Unbiased query-by-bagging active learning for VHR image classification

A key factor for the success of supervised remote sensing image classification is the definition of an efficient training set. Suboptimality in the selection of the training samples can bring to low classification performance. Active learning algorithms aim at building the training set in a smart and efficient way, by finding the most relevant samples for model improvement and thus iteratively improving the classification performance. In uncertaintybased approaches, a user-defined heuristic ranks the unlabeled samples according to the classifier's uncertainty about their class membership. Finally, the user is asked to define the labels of the pixels scoring maximum uncertainty. In the present work, an unbiased uncertainty scoring function encouraging sampling diversity is investigated. A modified version of the Entropy Query by Bagging (EQB) approach is presented and tested on very high resolution imagery using both SVM and LDA classifiers. Advantages of favoring diversity in the heuristics are discussed. By the diverse sampling it enhances, the unbiased approach proposed leads to higher convergence rates in the first iterations for both the models considered.

[1]  William J. Emery,et al.  Active Learning Methods for Remote Sensing Image Classification , 2009, IEEE Transactions on Geoscience and Remote Sensing.

[2]  Andrew McCallum,et al.  Toward Optimal Active Learning through Sampling Estimation of Error Reduction , 2001, ICML.

[3]  Leo Breiman,et al.  Bagging Predictors , 1996, Machine Learning.

[4]  H. Sebastian Seung,et al.  Selective Sampling Using the Query by Committee Algorithm , 1997, Machine Learning.

[5]  Naoki Abe,et al.  Query Learning Strategies Using Boosting and Bagging , 1998, ICML.

[6]  Marin Ferecatu,et al.  Interactive Remote-Sensing Image Retrieval Using Active Relevance Feedback , 2007, IEEE Transactions on Geoscience and Remote Sensing.

[7]  Lorenzo Bruzzone,et al.  Classification of hyperspectral remote sensing images with support vector machines , 2004, IEEE Transactions on Geoscience and Remote Sensing.

[8]  Greg Schohn,et al.  Less is More: Active Learning with Support Vector Machines , 2000, ICML.

[9]  John Platt,et al.  Probabilistic Outputs for Support vector Machines and Comparisons to Regularized Likelihood Methods , 1999 .

[10]  Lorenzo Bruzzone,et al.  Kernel-based methods for hyperspectral image classification , 2005, IEEE Transactions on Geoscience and Remote Sensing.

[11]  Mikhail F. Kanevski,et al.  Advanced active sampling for remote sensing image classification , 2010, 2010 IEEE International Geoscience and Remote Sensing Symposium.

[12]  Lawrence O. Hall,et al.  Active learning to recognize multiple types of plankton , 2004, Proceedings of the 17th International Conference on Pattern Recognition, 2004. ICPR 2004..

[13]  Lorenzo Bruzzone,et al.  Kernel methods for remote sensing data analysis , 2009 .

[14]  Jocelyn Chanussot,et al.  Decision Fusion for the Classification of Hyperspectral Data: Outcome of the 2008 GRS-S Data Fusion Contest , 2009, IEEE Transactions on Geoscience and Remote Sensing.

[15]  H. Sebastian Seung,et al.  Query by committee , 1992, COLT '92.

[16]  David A. Cohn,et al.  Improving generalization with active learning , 1994, Machine Learning.