Semisupervised Transfer Component Analysis for Domain Adaptation in Remote Sensing Image Classification

In this paper, we study the problem of feature extraction for knowledge transfer between multiple remotely sensed images in the context of land-cover classification. Several factors such as illumination, atmospheric, and ground conditions cause radiometric differences between images of similar scenes acquired on different geographical areas or over the same scene but at different time instants. Accordingly, a change in the probability distributions of the classes is observed. The purpose of this work is to statistically align in the feature space an image of interest that still has to be classified (the target image) to another image whose ground truth is already available (the source image). Following a specifically designed feature extraction step applied to both images, we show that classifiers trained on the source image can successfully predict the classes of the target image despite the shift that has occurred. In this context, we analyze a recently proposed domain adaptation method aiming at reducing the distance between domains, Transfer Component Analysis, and assess the potential of its unsupervised and semisupervised implementations. In particular, with a dedicated study of its key additional objectives, namely the alignment of the projection with the labels and the preservation of the local data structures, we demonstrate the advantages of Semisupervised Transfer Component Analysis. We compare this approach with other both linear and kernel-based feature extraction techniques. Experiments on multi- and hyperspectral acquisitions show remarkable cross- image classification performances for the considered strategy, thus confirming its suitability when applied to remotely sensed images.

[1]  Bernhard Schölkopf,et al.  A Kernel Two-Sample Test , 2012, J. Mach. Learn. Res..

[2]  Robert H. Fraser,et al.  Signature extension through space for northern landcover classification: A comparison of radiometric correction methods , 2005 .

[3]  Ivor W. Tsang,et al.  Domain Adaptation via Transfer Component Analysis , 2009, IEEE Transactions on Neural Networks.

[4]  R. Fisher THE USE OF MULTIPLE MEASUREMENTS IN TAXONOMIC PROBLEMS , 1936 .

[5]  Knut Conradsen,et al.  Multivariate Alteration Detection (MAD) and MAF Postprocessing in Multispectral, Bitemporal Image Data: New Approaches to Change Detection Studies , 1998 .

[6]  Allan Aasbjerg Nielsen,et al.  Kernel Maximum Autocorrelation Factor and Minimum Noise Fraction Transformations , 2011, IEEE Transactions on Image Processing.

[7]  C. Simmer,et al.  Remote Sensing of Angular Characteristics of Canopy Reflectances , 1985, IEEE Transactions on Geoscience and Remote Sensing.

[8]  Lorenzo Bruzzone,et al.  A new search algorithm for feature selection in hyperspectral remote sensing images , 2001, IEEE Trans. Geosci. Remote. Sens..

[9]  William J. Volchok,et al.  Radiometric scene normalization using pseudoinvariant features , 1988 .

[10]  Bernhard Schölkopf,et al.  Remote Sensing Feature Selection by Kernel Dependence Measures , 2010, IEEE Geoscience and Remote Sensing Letters.

[11]  Qiang Yang,et al.  A Survey on Transfer Learning , 2010, IEEE Transactions on Knowledge and Data Engineering.

[12]  Francisco Herrera,et al.  A unifying view on dataset shift in classification , 2012, Pattern Recognit..

[13]  Gustavo Camps-Valls,et al.  Semisupervised Manifold Alignment of Multimodal Remote Sensing Images , 2014, IEEE Transactions on Geoscience and Remote Sensing.

[14]  Anna Margolis,et al.  A Literature Review of Domain Adaptation with Unlabeled Data , 2011 .

[15]  Bernhard Schölkopf,et al.  Correcting Sample Selection Bias by Unlabeled Data , 2006, NIPS.

[16]  Ivor W. Tsang,et al.  Domain Transfer Multiple Kernel Learning , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[17]  Luis Gómez-Chova,et al.  Graph Matching for Adaptation in Remote Sensing , 2013, IEEE Transactions on Geoscience and Remote Sensing.

[18]  Mikhail Belkin,et al.  Manifold Regularization: A Geometric Framework for Learning from Labeled and Unlabeled Examples , 2006, J. Mach. Learn. Res..

[19]  Bernhard Schölkopf,et al.  Measuring Statistical Dependence with Hilbert-Schmidt Norms , 2005, ALT.

[20]  Jesslyn F. Brown,et al.  Measuring phenological variability from satellite imagery , 1994 .

[21]  Jon Atli Benediktsson,et al.  Advances in Hyperspectral Image Classification: Earth Monitoring with Statistical Learning Methods , 2013, IEEE Signal Processing Magazine.

[22]  P. Switzer,et al.  A transformation for ordering multispectral data in terms of image quality with implications for noise removal , 1988 .

[23]  Francesca Bovolo,et al.  Multidimensional Probability Density Function Matching for Preprocessing of Multitemporal Remote Sensing Images , 2008, IEEE Transactions on Geoscience and Remote Sensing.

[24]  Curtis E. Woodcock,et al.  Monitoring large areas for forest change using Landsat: Generalization across space, time and Landsat sensors , 2001 .

[25]  Lorenzo Bruzzone,et al.  Unsupervised retraining of a maximum likelihood classifier for the analysis of multitemporal remote sensing images , 2001, IEEE Trans. Geosci. Remote. Sens..

[26]  Bernhard Schölkopf,et al.  Generalized Clustering via Kernel Embeddings , 2009, KI.

[27]  Godfried T. Toussaint,et al.  Some Inequalities Between Distance Measures for Feature Evaluation , 1972, IEEE Transactions on Computers.

[28]  M. Canty,et al.  Automatic radiometric normalization of multitemporal satellite imagery , 2004 .

[29]  G. Baudat,et al.  Generalized Discriminant Analysis Using a Kernel Approach , 2000, Neural Computation.

[30]  David A. Landgrebe,et al.  The effect of unlabeled samples in reducing the small sample size problem and mitigating the Hughes phenomenon , 1994, IEEE Trans. Geosci. Remote. Sens..

[31]  Vladimir Vapnik,et al.  Statistical learning theory , 1998 .

[32]  Jianhua Lin,et al.  Divergence measures based on the Shannon entropy , 1991, IEEE Trans. Inf. Theory.

[33]  Lorenzo Bruzzone,et al.  A multiple-cascade-classifier system for a robust and partially unsupervised updating of land-cover maps , 2002, IEEE Trans. Geosci. Remote. Sens..

[34]  Luis Gómez-Chova,et al.  Semisupervised Image Classification With Laplacian Support Vector Machines , 2008, IEEE Geoscience and Remote Sensing Letters.

[35]  Kaare Brandt Petersen,et al.  Kernel Multivariate Analysis in Remote Sensing Feature Extraction , 2009 .

[36]  Robert A. Schowengerdt,et al.  Remote sensing, models, and methods for image processing , 1997 .

[37]  Jocelyn Chanussot,et al.  Decision Fusion for the Classification of Hyperspectral Data: Outcome of the 2008 GRS-S Data Fusion Contest , 2009, IEEE Transactions on Geoscience and Remote Sensing.

[38]  Lorenzo Bruzzone,et al.  A Novel Approach to the Selection of Spatially Invariant Features for the Classification of Hyperspectral Images With Improved Generalization Capability , 2009, IEEE Transactions on Geoscience and Remote Sensing.

[39]  Lorenzo Bruzzone,et al.  Mean Map Kernel Methods for Semisupervised Cloud Classification , 2010, IEEE Transactions on Geoscience and Remote Sensing.

[40]  Bor-Chen Kuo,et al.  Nonparametric weighted feature extraction for classification , 2004, IEEE Transactions on Geoscience and Remote Sensing.

[41]  Hal Daumé,et al.  Frustratingly Easy Domain Adaptation , 2007, ACL.

[42]  Bernhard Schölkopf,et al.  Nonlinear Component Analysis as a Kernel Eigenvalue Problem , 1998, Neural Computation.

[43]  Lorenzo Bruzzone,et al.  Domain Adaptation Problems: A DASVM Classification Technique and a Circular Validation Strategy , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[44]  Wilhelm Burger,et al.  Digital Image Processing - An Algorithmic Introduction using Java , 2008, Texts in Computer Science.

[45]  C. Woodcock,et al.  Classification and Change Detection Using Landsat TM Data: When and How to Correct Atmospheric Effects? , 2001 .

[46]  Hanyun Wang,et al.  Learn Multiple-Kernel SVMs for Domain Adaptation in Hyperspectral Data , 2013, IEEE Geoscience and Remote Sensing Letters.

[47]  Hans-Peter Kriegel,et al.  Integrating structured biological data by Kernel Maximum Mean Discrepancy , 2006, ISMB.

[48]  R. Stephenson A and V , 1962, The British journal of ophthalmology.

[49]  Conghe Song,et al.  Forest mapping with a generalized classifier and Landsat TM data , 2001 .

[50]  Aaas News,et al.  Book Reviews , 1893, Buffalo Medical and Surgical Journal.

[51]  Neil D. Lawrence,et al.  Dataset Shift in Machine Learning , 2009 .

[52]  G. F. Hughes,et al.  On the mean accuracy of statistical pattern recognizers , 1968, IEEE Trans. Inf. Theory.

[53]  H. Hotelling Analysis of a complex of statistical variables into principal components. , 1933 .

[54]  William J. Emery,et al.  Using active learning to adapt remote sensing image classifiers , 2011 .