Active Transfer Learning Network: A Unified Deep Joint Spectral–Spatial Feature Learning Model for Hyperspectral Image Classification

Deep learning has recently attracted significant attention in the field of hyperspectral images (HSIs) classification. However, the construction of an efficient deep neural network mostly relies on a large number of labeled samples being available. To address this problem, this paper proposes a unified deep network, combined with active transfer learning (TL) that can be well-trained for HSIs classification using only minimally labeled training data. More specifically, deep joint spectral–spatial feature is first extracted through hierarchical stacked sparse autoencoder (SSAE) networks. Active TL is then exploited to transfer the pretrained SSAE network and the limited training samples from the source domain to the target domain, where the SSAE network is subsequently fine-tuned using the limited labeled samples selected from both source and target domains by the corresponding active learning (AL) strategies. The advantages of our proposed method are threefold: 1) the network can be effectively trained using only limited labeled samples with the help of novel AL strategies; 2) the network is flexible and scalable enough to function across various transfer situations, including cross data set and intraimage; and 3) the learned deep joint spectral–spatial feature representation is more generic and robust than many joint spectral–spatial feature representations. Extensive comparative evaluations demonstrate that our proposed method significantly outperforms many state-of-the-art approaches, including both traditional and deep network-based methods, on three popular data sets.

[1]  Liang-pei Zhang,et al.  Spectral-spatial classification of hyperspectral imagery with cooperative game , 2018 .

[2]  Ye Zhang,et al.  Classification of hyperspectral image based on deep belief networks , 2014, 2014 IEEE International Conference on Image Processing (ICIP).

[3]  Johannes R. Sveinsson,et al.  Spectral and spatial classification of hyperspectral data using SVMs and morphological profiles , 2008, 2007 IEEE International Geoscience and Remote Sensing Symposium.

[4]  Lorenzo Bruzzone,et al.  Active Learning for Domain Adaptation in the Supervised Classification of Remote Sensing Images , 2012, IEEE Transactions on Geoscience and Remote Sensing.

[5]  Shutao Li,et al.  Learning to Diversify Deep Belief Networks for Hyperspectral Image Classification , 2017, IEEE Transactions on Geoscience and Remote Sensing.

[6]  Antonio J. Plaza,et al.  This article has been accepted for inclusion in a future issue of this journal. Content is final as presented, with the exception of pagination. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING 1 Spectral–Spatial Classification of Hyperspectral Data Usi , 2022 .

[7]  Simon J. Doran,et al.  Stacked Autoencoders for Unsupervised Feature Learning and Multiple Organ Detection in a Pilot Study Using 4D Patient Data , 2013, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[8]  Geoffrey E. Hinton,et al.  Reducing the Dimensionality of Data with Neural Networks , 2006, Science.

[9]  Shihong Du,et al.  Spectral–Spatial Feature Extraction for Hyperspectral Image Classification: A Dimension Reduction and Deep Learning Approach , 2016, IEEE Transactions on Geoscience and Remote Sensing.

[10]  Lorenzo Bruzzone,et al.  Kernel-based methods for hyperspectral image classification , 2005, IEEE Transactions on Geoscience and Remote Sensing.

[11]  Jonathan Cheung-Wai Chan,et al.  Learning and Transferring Deep Joint Spectral–Spatial Features for Hyperspectral Classification , 2017, IEEE Transactions on Geoscience and Remote Sensing.

[12]  Jason Weston,et al.  Semisupervised Neural Networks for Efficient Hyperspectral Image Classification , 2010, IEEE Transactions on Geoscience and Remote Sensing.

[13]  Xiuping Jia,et al.  Deep Feature Extraction and Classification of Hyperspectral Images Based on Convolutional Neural Networks , 2016, IEEE Transactions on Geoscience and Remote Sensing.

[14]  Uwe Stilla,et al.  Deep Learning Earth Observation Classification Using ImageNet Pretrained Networks , 2016, IEEE Geoscience and Remote Sensing Letters.

[15]  Lei Guo,et al.  When Deep Learning Meets Metric Learning: Remote Sensing Image Scene Classification via Learning Discriminative CNNs , 2018, IEEE Transactions on Geoscience and Remote Sensing.

[16]  Jie Xu,et al.  Multi-Class Support Vector Machine via Maximizing Multi-Class Margins , 2017, IJCAI.

[17]  Chao Li,et al.  Active multi-kernel domain adaptation for hyperspectral image classification , 2017, Pattern Recognit..

[18]  Lorenzo Bruzzone,et al.  Batch-Mode Active-Learning Methods for the Interactive Classification of Remote Sensing Images , 2011, IEEE Transactions on Geoscience and Remote Sensing.

[19]  Saurabh Prasad,et al.  Limitations of Principal Components Analysis for Hyperspectral Target Recognition , 2008, IEEE Geoscience and Remote Sensing Letters.

[20]  Claudio Persello,et al.  Interactive Domain Adaptation for the Classification of Remote Sensing Images Using Active Learning , 2013, IEEE Geoscience and Remote Sensing Letters.

[21]  Peter J. Mumby,et al.  Remote sensing of the coastal zone: An overview and priorities for future research , 2003 .

[22]  Antonio J. Plaza,et al.  Semi-supervised hyperspectral image classification using a new (soft) sparse multinomial logistic regression model , 2011, WHISPERS.

[23]  Gui-Song Xia,et al.  Transferring Deep Convolutional Neural Networks for the Scene Classification of High-Resolution Remote Sensing Imagery , 2015, Remote. Sens..

[24]  Lorenzo Bruzzone,et al.  A Novel Transductive SVM for Semisupervised Classification of Remote-Sensing Images , 2006, IEEE Transactions on Geoscience and Remote Sensing.

[25]  Lorenzo Bruzzone,et al.  Classification of hyperspectral remote sensing images with support vector machines , 2004, IEEE Transactions on Geoscience and Remote Sensing.

[26]  Lin Zhao,et al.  Blind spectral unmixing based on sparse component analysis for hyperspectral remote sensing imagery , 2016 .

[27]  Lyle H. Ungar,et al.  Machine Learning manuscript No. (will be inserted by the editor) Active Learning for Logistic Regression: , 2007 .

[28]  Zhang Liangpei,et al.  Spatial-Spectral Unsupervised Convolutional Sparse Auto-Encoder Classifier for Hyperspectral Imagery , 2017 .

[29]  Giles M. Foody,et al.  A relative evaluation of multiclass image classification by support vector machines , 2004, IEEE Transactions on Geoscience and Remote Sensing.

[30]  Gang Wang,et al.  Deep Learning-Based Classification of Hyperspectral Data , 2014, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[31]  Tülay Yildirim,et al.  Unsupervised classification of hyperspectral-image data using fuzzy approaches that spatially exploit membership relations , 2008, IEEE Geoscience and Remote Sensing Letters.

[32]  Yansheng Li,et al.  Unsupervised Spectral–Spatial Feature Learning With Stacked Sparse Autoencoder for Hyperspectral Imagery Classification , 2015, IEEE Geoscience and Remote Sensing Letters.

[33]  Antonio J. Plaza,et al.  Semisupervised Hyperspectral Image Classification Using Soft Sparse Multinomial Logistic Regression , 2013, IEEE Geoscience and Remote Sensing Letters.

[34]  F. M. Lacar,et al.  Use of hyperspectral imagery for mapping grape varieties in the Barossa Valley, South Australia , 2001, IGARSS 2001. Scanning the Present and Resolving the Future. Proceedings. IEEE 2001 International Geoscience and Remote Sensing Symposium (Cat. No.01CH37217).

[35]  J. Chanussot,et al.  Hyperspectral Remote Sensing Data Analysis and Future Challenges , 2013, IEEE Geoscience and Remote Sensing Magazine.

[36]  Xing Zhao,et al.  Spectral–Spatial Classification of Hyperspectral Data Based on Deep Belief Network , 2015, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[37]  Xuelong Li,et al.  Large Sparse Cone Non-negative Matrix Factorization for Image Annotation , 2017, ACM Trans. Intell. Syst. Technol..

[38]  Zexuan Zhu,et al.  Computational intelligence in optical remote sensing image processing , 2018, Appl. Soft Comput..

[39]  Jiang Li,et al.  Dimensionality reduction of hyperspectral data using discrete wavelet transform feature extraction , 2002, IEEE Trans. Geosci. Remote. Sens..

[40]  F. Meer Analysis of spectral absorption features in hyperspectral imagery , 2004 .

[41]  Rong Wang,et al.  Robust 2DPCA With Non-greedy $\ell _{1}$ -Norm Maximization for Image Analysis , 2015, IEEE Transactions on Cybernetics.

[42]  Cheng Deng,et al.  Deep Clustering via Joint Convolutional Autoencoder Embedding and Relative Entropy Minimization , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[43]  Peng Liu,et al.  Ieee Journal of Selected Topics in Applied Earth Observations and Remote Sensing 1 Active Deep Learning for Classification of Hyperspectral Images , 2022 .

[44]  Liangpei Zhang,et al.  Spatial Group Sparsity Regularized Nonnegative Matrix Factorization for Hyperspectral Unmixing , 2017, IEEE Transactions on Geoscience and Remote Sensing.

[45]  Jon Atli Benediktsson,et al.  A Survey on Spectral–Spatial Classification Techniques Based on Attribute Profiles , 2015, IEEE Transactions on Geoscience and Remote Sensing.

[46]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[47]  Jon Atli Benediktsson,et al.  Automatic Framework for Spectral–Spatial Classification Based on Supervised Feature Extraction and Morphological Attribute Profiles , 2014, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[48]  Zhenfeng Shao,et al.  Deep feature representations for high-resolution remote sensing scene classification , 2016, 2016 4th International Workshop on Earth Observation and Remote Sensing Applications (EORSA).

[49]  Léon Bottou,et al.  Stochastic Gradient Descent Tricks , 2012, Neural Networks: Tricks of the Trade.

[50]  Rong Wang,et al.  Diagonal principal component analysis with non-greedy ℓ1-norm maximization for face recognition , 2016, Neurocomputing.

[51]  Fuchun Sun,et al.  Building feature space of extreme learning machine with sparse denoising stacked-autoencoder , 2016, Neurocomputing.