Semi-supervised remote sensing image classification via maximum entropy

Remote sensing image segmentation requires multi-category classification typically with limited number of labeled training samples. While semi-supervised learning (SSL) has emerged as a sub-field of machine learning to tackle the scarcity of labeled samples, most SSL algorithms to date have had trade-offs in terms of scalability and/or applicability to multi-categorical data. In this paper, we evaluate semi-supervised logistic regression (SLR), a recent information theoretic semi-supervised algorithm, for remote sensing image classification problems. SLR is a probabilistic discriminative classifier and a specific instance of the generalized maximum entropy framework with a convex loss function. Moreover, the method is inherently multi-class and easy to implement. These characteristics make SLR a strong alternative to the widely used semi-supervised variants of SVM for the segmentation of remote sensing images. We demonstrate the competitiveness of SLR in multispectral, hyperspectral and radar image classification.

[1]  J. Borwein,et al.  Convex Analysis And Nonlinear Optimization , 2000 .

[2]  Stéphane Canu,et al.  Sparsity regularization path for semi-supervised SVM , 2007, ICMLA 2007.

[3]  Ayhan Demiriz,et al.  Semi-Supervised Support Vector Machines , 1998, NIPS.

[4]  Bernhard Schölkopf,et al.  Learning with Local and Global Consistency , 2003, NIPS.

[5]  S. Domínguez-Almendros,et al.  Logistic regression models. , 2011, Allergologia et immunopathologia.

[6]  Andrew McCallum,et al.  Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data , 2001, ICML.

[7]  Bernhard Schölkopf,et al.  Cluster Kernels for Semi-Supervised Learning , 2002, NIPS.

[8]  Alexander Zien,et al.  Semi-Supervised Classification by Low Density Separation , 2005, AISTATS.

[9]  Gustavo Camps-Valls,et al.  Semi-Supervised Graph-Based Hyperspectral Image Classification , 2007, IEEE Transactions on Geoscience and Remote Sensing.

[10]  Thorsten Joachims,et al.  Making large-scale support vector machine learning practical , 1999 .

[11]  Ji Zhu,et al.  Kernel Logistic Regression and the Import Vector Machine , 2001, NIPS.

[12]  Jason Weston,et al.  Deep learning via semi-supervised embedding , 2008, ICML '08.

[13]  Yasemin Altun,et al.  Semi-Supervised Learning via Generalized Maximum Entropy , 2010, AISTATS.

[14]  Mikhail Belkin,et al.  Beyond the point cloud: from transductive to semi-supervised learning , 2005, ICML.

[15]  Lorenzo Bruzzone,et al.  Kernel methods for remote sensing data analysis , 2009 .

[16]  Vladimir Vapnik,et al.  Statistical learning theory , 1998 .

[17]  Luis Gómez-Chova,et al.  Semisupervised Image Classification With Laplacian Support Vector Machines , 2008, IEEE Geoscience and Remote Sensing Letters.

[18]  Ivor W. Tsang,et al.  Large-Scale Sparsified Manifold Regularization , 2006, NIPS.

[19]  Aomar Osmani,et al.  Phase transition and heuristic search in relational learning , 2007, ICMLA 2007.

[20]  Luis Gómez-Chova,et al.  Urban monitoring using multi-temporal SAR and multi-spectral data , 2006, Pattern Recognit. Lett..

[21]  T. M. Lillesand,et al.  Remote Sensing and Image Interpretation , 1980 .

[22]  Alexander J. Smola,et al.  Unifying Divergence Minimization and Statistical Inference Via Convex Duality , 2006, COLT.