Semisupervised Hyperspectral Image Segmentation Using Multinomial Logistic Regression With Active Learning

This paper presents a new semi-supervised segmentation algorithm, suited to high dimensional data, of which remotely sensed hyperspectral image data sets are an example. The algorithm implements two main steps: (i) semi-supervised learning of the posterior class distributions, followed by (ii) segmentation, which infers an image of class labels from a posterior distribution built on the learned class distributions, and on a Markov random field (MRF). The posterior class distributions are modeled using multinomial logistic regression (MLR), where the regressors are learned using both labeled and, through a graph-based technique, unlabeled samples. Such unlabeled samples are actively selected based on the entropy of the corresponding class label. The prior on the image of labels is a multi-level logistic (MLL) model, which enforces segmentation results in which neighboring labels belongs to the same class. The maximum a posteriori (MAP) segmentation is computed by the α-Expansion min-cut based integer optimization algorithm. Our experimental results, conducted using synthetic and real hyperspectral image data sets collected by the Airborne Visible Infra-Red Imaging Spectrometer (AVIRIS) system of NASA Jet Propulsion Laboratory over the regions of Indian Pines, Indiana, and Salinas Valley, California, reveal that the proposed approach can provide classification accuracies which are similar or higher than those achieved by other supervised methods for the considered scenes. Our results also indicate that the use of a spatial prior can greatly improve the final results with respect to a case in which only the learned class densities are considered, confirming the importance of jointly considering spatial and spectral information in hyperspectral image segmentation.

[1]  Melba M. Crawford,et al.  Locally consistent graph regularization based active learning for hyperspectral image classification , 2010, 2010 2nd Workshop on Hyperspectral Image and Signal Processing: Evolution in Remote Sensing.

[2]  William J. Emery,et al.  Active Learning Methods for Remote Sensing Image Classification , 2009, IEEE Transactions on Geoscience and Remote Sensing.

[3]  Antonio J. Plaza,et al.  Multi-Channel Morphological Profiles for Classification of Hyperspectral Images Using Support Vector Machines , 2009, Sensors.

[4]  Johannes R. Sveinsson,et al.  Spectral and spatial classification of hyperspectral data using SVMs and morphological profiles , 2008, 2007 IEEE International Geoscience and Remote Sensing Symposium.

[5]  Gustavo Camps-Valls,et al.  Semi-Supervised Graph-Based Hyperspectral Image Classification , 2007, IEEE Transactions on Geoscience and Remote Sensing.

[6]  José M. Bioucas-Dias,et al.  Evaluation of bayesian hyperspectral image segmentation with a discriminative class learning , 2007, 2007 IEEE International Geoscience and Remote Sensing Symposium.

[7]  Christopher M. Bishop,et al.  Pattern Recognition and Machine Learning (Information Science and Statistics) , 2006 .

[8]  Alexander Zien,et al.  A continuation method for semi-supervised SVMs , 2006, ICML.

[9]  Robert D. Nowak,et al.  Minimax-optimal classification with dyadic decision trees , 2006, IEEE Transactions on Information Theory.

[10]  Liangpei Zhang,et al.  An unsupervised artificial immune classifier for multi/hyperspectral remote sensing imagery , 2006, IEEE Trans. Geosci. Remote. Sens..

[11]  Lorenzo Bruzzone,et al.  A Novel Transductive SVM for Semisupervised Classification of Remote-Sensing Images , 2005, IEEE Transactions on Geoscience and Remote Sensing.

[12]  Theofanis Sapatinas,et al.  Discriminant Analysis and Statistical Pattern Recognition , 2005 .

[13]  Lawrence Carin,et al.  Sparse multinomial logistic regression: fast algorithms and generalization bounds , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[14]  Lorenzo Bruzzone,et al.  Kernel-based methods for hyperspectral image classification , 2005, IEEE Transactions on Geoscience and Remote Sensing.

[15]  Antonio J. Plaza,et al.  Dimensionality reduction and classification of hyperspectral image data using sequences of extended morphological transformations , 2005, IEEE Transactions on Geoscience and Remote Sensing.

[16]  Lawrence Carin,et al.  Semi-Supervised Classification , 2004, Encyclopedia of Database Systems.

[17]  Christopher K. I. Williams,et al.  Learning With Kernels: Support Vector Machines, Regularization, Optimization, and Beyond , 2001 .

[18]  David A. Landgrebe,et al.  Signal Theory Methods in Multispectral Remote Sensing , 2003 .

[19]  Vladimir Kolmogorov,et al.  What energy functions can be minimized via graph cuts? , 2002, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[20]  Olga Veksler,et al.  Fast approximate energy minimization via graph cuts , 2001, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[21]  Vladimir Kolmogorov,et al.  An experimental comparison of min-cut/max- flow algorithms for energy minimization in vision , 2001, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[22]  Michael I. Jordan,et al.  On Discriminative vs. Generative Classifiers: A comparison of logistic regression and naive Bayes , 2001, NIPS.

[23]  D. Hunter,et al.  Optimization Transfer Using Surrogate Objective Functions , 2000 .

[24]  Trevor J. Hastie,et al.  Discriminative vs Informative Learning , 1997, KDD.

[25]  Stan Z. Li,et al.  Markov Random Field Modeling in Computer Vision , 1995, Computer Science Workbench.

[26]  N. Merhav,et al.  A Measure of Relative Entropy between Individual Sequences with Application to Universal Classification , 1993, Proceedings. IEEE International Symposium on Information Theory.

[27]  David J. C. MacKay,et al.  Information-Based Objective Functions for Active Data Selection , 1992, Neural Computation.

[28]  D. Böhning Multinomial logistic regression algorithm , 1992 .

[29]  Donald Geman,et al.  Stochastic Relaxation, Gibbs Distributions, and the Bayesian Restoration of Images , 1984, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[30]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[31]  Jon Atli Benediktsson,et al.  Recent Advances in Techniques for Hyperspectral Image Processing , 2009 .

[32]  Bernhard Schölkopf,et al.  Learning with Kernels: Support Vector Machines, Regularization, Optimization, and Beyond , 2005, IEEE Transactions on Neural Networks.

[33]  José M Bernardo and Adrian F M Smith,et al.  BAYESIAN THEORY , 2008 .

[34]  Shigeo Abe DrEng Pattern Classification , 2001, Springer London.

[35]  Stan Z. Li,et al.  Markov Random Field Modeling in Image Analysis , 2001, Computer Science Workbench.

[36]  Vladimir Vapnik,et al.  Statistical learning theory , 1998 .

[37]  D. Greig,et al.  Exact Maximum A Posteriori Estimation for Binary Images , 1989 .

[38]  J. Besag Spatial Interaction and the Statistical Analysis of Lattice Systems , 1974 .

[39]  and as an in , 2022 .