An Active Learning Approach to Hyperspectral Data Classification

Obtaining training data for land cover classification using remotely sensed data is time consuming and expensive especially for relatively inaccessible locations. Therefore, designing classifiers that use as few labeled data points as possible is highly desirable. Existing approaches typically make use of small-sample techniques and semisupervision to deal with the lack of labeled data. In this paper, we propose an active learning technique that efficiently updates existing classifiers by using fewer labeled data points than semisupervised methods. Further, unlike semisupervised methods, our proposed technique is well suited for learning or adapting classifiers when there is substantial change in the spectral signatures between labeled and unlabeled data. Thus, our active learning approach is also useful for classifying a series of spatially/temporally related images, wherein the spectral signatures vary across the images. Our interleaved semisupervised active learning method was tested on both single and spatially/temporally related hyperspectral data sets. We present empirical results that establish the superior performance of our proposed approach versus other active learning and semisupervised methods.

[1]  P. H. Swain,et al.  Bayesian classification in a time-varying environment , 1978 .

[2]  Kamal Nigamyknigam,et al.  Employing Em in Pool-based Active Learning for Text Classiication , 1998 .

[3]  David J. C. MacKay,et al.  Information-Based Objective Functions for Active Data Selection , 1992, Neural Computation.

[4]  Joydeep Ghosh,et al.  Investigation of the random forest framework for classification of hyperspectral data , 2005, IEEE Transactions on Geoscience and Remote Sensing.

[5]  David A. Landgrebe,et al.  The effect of unlabeled samples in reducing the small sample size problem and mitigating the Hughes phenomenon , 1994, IEEE Trans. Geosci. Remote. Sens..

[6]  Matthias Seeger,et al.  Learning from Labeled and Unlabeled Data , 2010, Encyclopedia of Machine Learning.

[7]  Naoki Abe,et al.  Query Learning Strategies Using Boosting and Bagging , 1998, ICML.

[8]  Raymond J. Mooney,et al.  Active Learning for Probability Estimation Using Jensen-Shannon Divergence , 2005, ECML.

[9]  Andrew McCallum,et al.  Toward Optimal Active Learning through Sampling Estimation of Error Reduction , 2001, ICML.

[10]  Thorsten Joachims,et al.  Transductive Inference for Text Classification using Support Vector Machines , 1999, ICML.

[11]  Lorenzo Bruzzone,et al.  An automatic technique for detecting land-cover transitions , 1996, IGARSS '96. 1996 International Geoscience and Remote Sensing Symposium.

[12]  Andrew McCallum,et al.  Employing EM and Pool-Based Active Learning for Text Classification , 1998, ICML.

[13]  Joydeep Ghosh,et al.  Best-bases feature extraction algorithms for classification of hyperspectral data , 2001, IEEE Trans. Geosci. Remote. Sens..

[14]  John Shepanski,et al.  Hyperion, a space-based imaging spectrometer , 2003, IEEE Trans. Geosci. Remote. Sens..

[15]  Thomas G. Dietterich,et al.  Solving Multiclass Learning Problems via Error-Correcting Output Codes , 1994, J. Artif. Intell. Res..

[16]  Foster J. Provost,et al.  Active Learning for Class Probability Estimation and Ranking , 2001, IJCAI.

[17]  Joydeep Ghosh,et al.  Adaptive Feature Spaces For Land Cover Classification With Limited Ground Truth Data , 2004, Int. J. Pattern Recognit. Artif. Intell..

[18]  Joydeep Ghosh,et al.  An Active Learning Approach to Knowledge Transfer for Hyperspectral Data Analysis , 2006, 2006 IEEE International Symposium on Geoscience and Remote Sensing.

[19]  Thomas M. Cover,et al.  Elements of Information Theory , 2005 .

[20]  David A. Landgrebe,et al.  A cost-effective semisupervised classifier approach with kernels , 2004, IEEE Transactions on Geoscience and Remote Sensing.

[21]  Trevor J. Hastie,et al.  Discriminative vs Informative Learning , 1997, KDD.

[22]  Gabriele Moser,et al.  Partially Supervised classification of remote sensing images through SVM-based probability density estimation , 2005, IEEE Transactions on Geoscience and Remote Sensing.

[23]  Tong Zhang,et al.  Active learning using adaptive resampling , 2000, KDD '00.

[24]  Craig A. Knoblock,et al.  Active + Semi-supervised Learning = Robust Multi-View Learning , 2002, ICML.

[25]  H. Sebastian Seung,et al.  Query by committee , 1992, COLT '92.

[26]  Foster J. Provost,et al.  Active Sampling for Class Probability Estimation and Ranking , 2004, Machine Learning.

[27]  Joydeep Ghosh,et al.  Hierarchical Fusion of Multiple Classifiers for Hyperspectral Data Analysis , 2002, Pattern Analysis & Applications.

[28]  Sebastiano B. Serpico,et al.  Foreword to the Special Issue on Advances in Techniques for Analysis of Remotely Sensed Data , 2005, IEEE Transactions on Geoscience and Remote Sensing.

[29]  Sankar K. Pal,et al.  Segmentation of multispectral remote sensing images using active support vector machines , 2004, Pattern Recognit. Lett..

[30]  David A. Cohn,et al.  Active Learning with Statistical Models , 1996, NIPS.

[31]  Lorenzo Bruzzone,et al.  Unsupervised retraining of a maximum likelihood classifier for the analysis of multitemporal remote sensing images , 2001, IEEE Trans. Geosci. Remote. Sens..

[32]  Qiong Jackson,et al.  An adaptive classifier design for high-dimensional data analysis with a limited training data set , 2001, IEEE Trans. Geosci. Remote. Sens..

[33]  Lorenzo Bruzzone,et al.  An approach to unsupervised change detection in multitemporal SAR images based on the Generalized Gaussian distribution , 2004, IGARSS 2004. 2004 IEEE International Geoscience and Remote Sensing Symposium.

[34]  Heekuck Oh,et al.  Neural Networks for Pattern Recognition , 1993, Adv. Comput..

[35]  David A. Landgrebe,et al.  Partially supervised classification using weighted unsupervised clustering , 1999, IEEE Trans. Geosci. Remote. Sens..

[36]  William A. Gale,et al.  A sequential algorithm for training text classifiers , 1994, SIGIR '94.

[37]  B. Jeon,et al.  Spatio-temporal contextual classification of remotely sensed multispectral data , 1990, 1990 IEEE International Conference on Systems, Man, and Cybernetics Conference Proceedings.

[38]  N. Khazenie,et al.  Spatial-temporal Autocorrelated Model For Contextual Classification , 1990 .

[39]  David A. Landgrebe,et al.  Decision fusion approach for multitemporal classification , 1999, IEEE Trans. Geosci. Remote. Sens..

[40]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .