Bayesian Active Remote Sensing Image Classification

In recent years, kernel methods, in particular support vector machines (SVMs), have been successfully introduced to remote sensing image classification. Their properties make them appropriate for dealing with a high number of image features and a low number of available labeled spectra. The introduction of alternative approaches based on (parametric) Bayesian inference has been quite scarce in the more recent years. Assuming a particular prior data distribution may lead to poor results in remote sensing problems because of the specificities and complexity of the data. In this context, the emerging field of nonparametric Bayesian methods constitutes a proper theoretical framework to tackle the remote sensing image classification problem. This paper exploits the Bayesian modeling and inference paradigm to tackle the problem of kernel-based remote sensing image classification. This Bayesian methodology is appropriate for both finite- and infinite-dimensional feature spaces. The particular problem of active learning is addressed by proposing an incremental/active learning approach based on three different approaches: 1) the maximum differential of entropies; 2) the minimum distance to decision boundary; and 3) the minimum normalized distance. Parameters are estimated by using the evidence Bayesian approach, the kernel trick, and the marginal distribution of the observations instead of the posterior distribution of the adaptive parameters. This approach allows us to deal with infinite-dimensional feature spaces. The proposed approach is tested on the challenging problem of urban monitoring from multispectral and synthetic aperture radar data and in multiclass land cover classification of hyperspectral images, in both purely supervised and active learning settings. Similar results are obtained when compared to SVMs in the supervised mode, with the advantage of providing posterior estimates for classification and automatic parameter learning. Comparison with random sampling as well as standard active learning methods such as margin sampling and entropy-query-by-bagging reveals a systematic overall accuracy gain and faster convergence with the number of queries.

[1]  David J. C. MacKay,et al.  Comparison of Approximate Methods for Handling Hyperparameters , 1999, Neural Computation.

[2]  Luis Gómez-Chova,et al.  Urban monitoring using multi-temporal SAR and multi-spectral data , 2006, Pattern Recognit. Lett..

[3]  Aggelos K. Katsaggelos,et al.  Bayesian and regularization methods for hyperparameter estimation in image restoration , 1999, IEEE Trans. Image Process..

[4]  Burr Settles,et al.  Active Learning Literature Survey , 2009 .

[5]  Alexander J. Smola,et al.  Online learning with kernels , 2001, IEEE Transactions on Signal Processing.

[6]  Aggelos K. Katsaggelos,et al.  Bayesian Compressive Sensing Using Laplace Priors , 2010, IEEE Transactions on Image Processing.

[7]  David J. C. MacKay,et al.  Information-Based Objective Functions for Active Data Selection , 1992, Neural Computation.

[8]  Michael E. Tipping The Relevance Vector Machine , 1999, NIPS.

[9]  Luis Alonso,et al.  Retrieval of Vegetation Biophysical Parameters Using Gaussian Process Techniques , 2012, IEEE Transactions on Geoscience and Remote Sensing.

[10]  Nasser M. Nasrabadi,et al.  Pattern Recognition and Machine Learning , 2006, Technometrics.

[11]  Michael Elad,et al.  Sparse and Redundant Representations - From Theory to Applications in Signal and Image Processing , 2010 .

[12]  Daphne Koller,et al.  Active Learning for Parameter Estimation in Bayesian Networks , 2000, NIPS.

[13]  Sankar K. Pal,et al.  Segmentation of multispectral remote sensing images using active support vector machines , 2004, Pattern Recognit. Lett..

[14]  Marin Ferecatu,et al.  Interactive Remote-Sensing Image Retrieval Using Active Relevance Feedback , 2007, IEEE Transactions on Geoscience and Remote Sensing.

[15]  IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, VOL. 34. NO. 4, JULY 1996 Universal Multifractal Scaling of Synthetic , 1996 .

[16]  Leslie M. Collins,et al.  Texture Features for Antitank Landmine Detection Using Ground Penetrating Radar , 2007, IEEE Transactions on Geoscience and Remote Sensing.

[17]  Antonio J. Plaza,et al.  Hyperspectral Image Segmentation Using a New Bayesian Approach With Active Learning , 2011, IEEE Transactions on Geoscience and Remote Sensing.

[18]  Christopher M. Bishop,et al.  Pattern Recognition and Machine Learning (Information Science and Statistics) , 2006 .

[19]  Antonio J. Plaza,et al.  Semisupervised Hyperspectral Image Segmentation Using Multinomial Logistic Regression With Active Learning , 2010, IEEE Transactions on Geoscience and Remote Sensing.

[20]  William J. Emery,et al.  Active Learning Methods for Remote Sensing Image Classification , 2009, IEEE Transactions on Geoscience and Remote Sensing.

[21]  P. Bartlett,et al.  Probabilities for SV Machines , 2000 .

[22]  Carl E. Rasmussen,et al.  Gaussian processes for machine learning , 2005, Adaptive computation and machine learning.

[23]  Aggelos K. Katsaggelos,et al.  A Bayesian Active Learning Framework for a Two-Class Classification Problem , 2011, MUSCLE.

[24]  Lawrence Carin,et al.  Detection of Unexploded Ordnance via Efficient Semisupervised and Active Learning , 2008, IEEE Transactions on Geoscience and Remote Sensing.

[25]  Lawrence Carin,et al.  Active Learning and Basis Selection for Kernel-Based Linear Models: A Bayesian Perspective , 2010, IEEE Transactions on Signal Processing.

[26]  Farid Melgani,et al.  Gaussian Process Approach to Remote Sensing Image Classification , 2010, IEEE Transactions on Geoscience and Remote Sensing.

[27]  Johan A. K. Suykens,et al.  Least Squares Support Vector Machines , 2002 .

[28]  Mikhail F. Kanevski,et al.  A Survey of Active Learning Algorithms for Supervised Remote Sensing Image Classification , 2011, IEEE Journal of Selected Topics in Signal Processing.

[29]  Gert Cauwenberghs,et al.  Incremental and Decremental Support Vector Machine Learning , 2000, NIPS.

[30]  Klaus-Robert Müller,et al.  Incremental Support Vector Learning: Analysis, Implementation and Applications , 2006, J. Mach. Learn. Res..

[31]  Mikhail F. Kanevski,et al.  Memory-Based Cluster Sampling for Remote Sensing Image Classification , 2012, IEEE Transactions on Geoscience and Remote Sensing.

[32]  Farid Melgani,et al.  Support Vector Machine Active Learning Through Significance Space Construction , 2011, IEEE Geoscience and Remote Sensing Letters.

[33]  Anthony Widjaja,et al.  Learning with Kernels: Support Vector Machines, Regularization, Optimization, and Beyond , 2003, IEEE Transactions on Neural Networks.

[34]  Goo Jun,et al.  Spatially Adaptive Classification of Land Cover With Remote Sensing Data , 2011, IEEE Transactions on Geoscience and Remote Sensing.

[35]  Lorenzo Bruzzone,et al.  Kernel-based methods for hyperspectral image classification , 2005, IEEE Transactions on Geoscience and Remote Sensing.

[36]  Michael E. Tipping Sparse Bayesian Learning and the Relevance Vector Machine , 2001, J. Mach. Learn. Res..

[37]  Lorenzo Bruzzone,et al.  Batch-Mode Active-Learning Methods for the Interactive Classification of Remote Sensing Images , 2011, IEEE Transactions on Geoscience and Remote Sensing.

[38]  Gustavo Camps-Valls,et al.  Retrieval of oceanic chlorophyll concentration with relevance vector machines , 2006 .

[39]  J. Berger Statistical Decision Theory and Bayesian Analysis , 1988 .

[40]  Ryan M. Rifkin,et al.  In Defense of One-Vs-All Classification , 2004, J. Mach. Learn. Res..

[41]  Lorenzo Bruzzone,et al.  Kernel methods for remote sensing data analysis , 2009 .

[42]  Ye Zhang,et al.  Robust Hyperspectral Classification Using Relevance Vector Machine , 2011, IEEE Transactions on Geoscience and Remote Sensing.

[43]  William J. Emery,et al.  Using active learning to adapt remote sensing image classifiers , 2011 .

[44]  Luis Gómez-Chova,et al.  Remote Sensing Image Processing , 2011, Remote Sensing Image Processing.

[45]  Lehel Csató,et al.  Sparse On-Line Gaussian Processes , 2002, Neural Computation.

[46]  Joaquin Quiñonero Candela,et al.  Incremental Gaussian Processes , 2002, NIPS.

[47]  Matthias W. Seeger,et al.  Compressed sensing and Bayesian experimental design , 2008, ICML '08.