Multiscale principle of relevant information for hyperspectral image classification

This paper proposes a novel architecture, termed multiscale principle of relevant information (MPRI), to learn discriminative spectral-spatial features for hyperspectral image (HSI) classification. MPRI inherits the merits of the principle of relevant information (PRI) to effectively extract multiscale information embedded in the given data, and also takes advantage of the multilayer structure to learn representations in a coarse-to-fine manner. Specifically, MPRI performs spectral-spatial pixel characterization (using PRI) and feature dimensionality reduction (using regularized linear discriminant analysis) iteratively and successively. Extensive experiments on three benchmark data sets demonstrate that MPRI outperforms existing state-of-the-art methods (including deep learning based ones) qualitatively and quantitatively, especially in the scenario of limited training samples.

[1]  Xiuping Jia,et al.  Deep Feature Extraction and Classification of Hyperspectral Images Based on Convolutional Neural Networks , 2016, IEEE Transactions on Geoscience and Remote Sensing.

[2]  Benjamin Recht,et al.  Random Features for Large-Scale Kernel Machines , 2007, NIPS.

[3]  J. Benediktsson,et al.  New Frontiers in Spectral-Spatial Hyperspectral Image Classification: The Latest Advances Based on Mathematical Morphology, Markov Random Fields, Segmentation, Sparse Representation, and Deep Learning , 2018, IEEE Geoscience and Remote Sensing Magazine.

[4]  Naoto Yokoya,et al.  Advances in Hyperspectral Image and Signal Processing: A Comprehensive Overview of the State of the Art , 2017, IEEE Geoscience and Remote Sensing Magazine.

[5]  Jon Atli Benediktsson,et al.  Multiple Feature Learning for Hyperspectral Image Classification , 2015, IEEE Transactions on Geoscience and Remote Sensing.

[6]  Xia Xu,et al.  Hierarchical Guidance Filtering-Based Ensemble Classification for Hyperspectral Images , 2017, IEEE Transactions on Geoscience and Remote Sensing.

[7]  Xia Xu,et al.  R-VCANet: A New Deep-Learning-Based Hyperspectral Image Classification Method , 2017, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[8]  Zhiming Luo,et al.  Spectral–Spatial Residual Network for Hyperspectral Image Classification: A 3-D Deep Learning Framework , 2018, IEEE Transactions on Geoscience and Remote Sensing.

[9]  Geoffrey E. Hinton,et al.  Visualizing Data using t-SNE , 2008 .

[10]  José Carlos Príncipe,et al.  An efficient rank-deficient computation of the Principle of Relevant Information , 2011, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[11]  Xing Zhao,et al.  Spectral–Spatial Classification of Hyperspectral Data Based on Deep Belief Network , 2015, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[12]  H. Chernoff A Measure of Asymptotic Efficiency for Tests of a Hypothesis Based on the sum of Observations , 1952 .

[13]  Xiao Xiang Zhu,et al.  Deep Learning in Remote Sensing: A Comprehensive Review and List of Resources , 2017, IEEE Geoscience and Remote Sensing Magazine.

[14]  T. Hastie,et al.  Principal Curves , 2007 .

[15]  Fuhui Long,et al.  Feature selection based on mutual information criteria of max-dependency, max-relevance, and min-redundancy , 2003, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[16]  Huaiyu Zhu On Information and Sufficiency , 1997 .

[17]  Ying Li,et al.  Spectral-Spatial Classification of Hyperspectral Imagery with 3D Convolutional Neural Network , 2017, Remote. Sens..

[18]  Bo Du,et al.  Hyperspectral image classification via a random patches network , 2018, ISPRS Journal of Photogrammetry and Remote Sensing.

[19]  Lorenzo Bruzzone,et al.  Classification of Hyperspectral Images With Regularized Linear Discriminant Analysis , 2009, IEEE Transactions on Geoscience and Remote Sensing.

[20]  Chein-I Chang,et al.  An information-theoretic approach to spectral variability, similarity, and discrimination for hyperspectral image analysis , 2000, IEEE Trans. Inf. Theory.

[21]  Sudhir Madhav Rao,et al.  Unsupervised learning: An information theoretic framework , 2008 .

[22]  Jon Atli Benediktsson,et al.  Automatic Framework for Spectral–Spatial Classification Based on Supervised Feature Extraction and Morphological Attribute Profiles , 2014, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[23]  Deyu Meng,et al.  Hyperspectral Image Classification With Markov Random Fields and a Convolutional Neural Network , 2017, IEEE Transactions on Image Processing.

[24]  Jie Geng,et al.  Spectral–Spatial Classification of Hyperspectral Image Based on Deep Auto-Encoder , 2016, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[25]  Gang Wang,et al.  Deep Learning-Based Classification of Hyperspectral Data , 2014, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[26]  Naftali Tishby,et al.  Opening the Black Box of Deep Neural Networks via Information , 2017, ArXiv.

[27]  Baocai Yin,et al.  Hyperspectral Image Classification Based on Deep Deconvolution Network With Skip Architecture , 2018, IEEE Transactions on Geoscience and Remote Sensing.

[28]  Hassan Ghassemian,et al.  Linear Feature Extraction for Hyperspectral Images Based on Information Theoretic Learning , 2013, IEEE Geoscience and Remote Sensing Letters.

[29]  Antonio J. Plaza,et al.  This article has been accepted for inclusion in a future issue of this journal. Content is final as presented, with the exception of pagination. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING 1 Spectral–Spatial Classification of Hyperspectral Data Usi , 2022 .

[30]  G. Crooks On Measures of Entropy and Information , 2015 .

[31]  Fang Liu,et al.  Mutual-Information-Based Semi-Supervised Hyperspectral Band Selection With High Discrimination, High Information, and Low Redundancy , 2015, IEEE Transactions on Geoscience and Remote Sensing.

[32]  Qian Du,et al.  Modified Fisher's Linear Discriminant Analysis for Hyperspectral Imagery , 2007, IEEE Geoscience and Remote Sensing Letters.

[33]  Ning Zhang,et al.  Hyperspectral Image Classification Based on Nonlinear Spectral–Spatial Network , 2016, IEEE Geoscience and Remote Sensing Letters.

[34]  Shutao Li,et al.  PCA-Based Edge-Preserving Features for Hyperspectral Image Classification , 2017, IEEE Transactions on Geoscience and Remote Sensing.

[35]  Jon Atli Benediktsson,et al.  A Survey on Spectral–Spatial Classification Techniques Based on Attribute Profiles , 2015, IEEE Transactions on Geoscience and Remote Sensing.

[36]  Hamid R. Rabiee,et al.  Spatial-Aware Dictionary Learning for Hyperspectral Image Classification , 2013, IEEE Transactions on Geoscience and Remote Sensing.

[37]  Naftali Tishby,et al.  The information bottleneck method , 2000, ArXiv.

[38]  Changzhe Jiao,et al.  Discriminative Multiple Instance Hyperspectral Target Characterization , 2018, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[39]  Jon Atli Benediktsson,et al.  Spectral–Spatial Hyperspectral Image Classification With Edge-Preserving Filtering , 2014, IEEE Transactions on Geoscience and Remote Sensing.

[40]  P. J. Green,et al.  Density Estimation for Statistics and Data Analysis , 1987 .

[41]  E. Parzen On Estimation of a Probability Density Function and Mode , 1962 .

[42]  Robert Jenssen,et al.  The Cauchy-Schwarz divergence and Parzen windowing: Connections to graph theory and Mercer kernels , 2006, J. Frankl. Inst..

[43]  Jun Li,et al.  Recent Advances on Spectral–Spatial Hyperspectral Image Classification: An Overview and New Guidelines , 2018, IEEE Transactions on Geoscience and Remote Sensing.

[44]  Jose C. Principe,et al.  Information Theoretic Learning - Renyi's Entropy and Kernel Perspectives , 2010, Information Theoretic Learning.

[45]  Antonio J. Plaza,et al.  Dimensionality reduction and classification of hyperspectral image data using sequences of extended morphological transformations , 2005, IEEE Transactions on Geoscience and Remote Sensing.

[46]  Stefano Soatto,et al.  Information Dropout: Learning Optimal Representations Through Noisy Computation , 2016, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[47]  Weiwei Song,et al.  Deep Hashing Neural Networks for Hyperspectral Image Feature Extraction , 2019, IEEE Geoscience and Remote Sensing Letters.

[48]  Robert Jenssen,et al.  Multivariate Extension of Matrix-based Renyi's α-order Entropy Functional , 2020, IEEE transactions on pattern analysis and machine intelligence.

[49]  Lorenzo Bruzzone,et al.  A Deep Network Architecture for Super-Resolution-Aided Hyperspectral Image Classification With Classwise Loss , 2018, IEEE Transactions on Geoscience and Remote Sensing.

[50]  S LuisGonzalo AN EFFICIENT RANK-DEFICIENT COMPUTATION OF THE PRINCIPLE OF RELEVANT INFORMATION , 2011 .

[51]  Antonio J. Plaza,et al.  A New Spatial–Spectral Feature Extraction Method for Hyperspectral Images Using Local Covariance Matrix Representation , 2018, IEEE Transactions on Geoscience and Remote Sensing.

[52]  Raymond Y. K. Lau,et al.  Hyperspectral Image Classification With Deep Learning Models , 2018, IEEE Transactions on Geoscience and Remote Sensing.

[53]  Trac D. Tran,et al.  Hyperspectral Image Classification Using Dictionary-Based Sparse Representation , 2011, IEEE Transactions on Geoscience and Remote Sensing.

[54]  M. F. Baumgardner,et al.  220 Band AVIRIS Hyperspectral Image Data Set: June 12, 1992 Indian Pine Test Site 3 , 2015 .

[55]  Robert Jenssen,et al.  Multivariate Extension of Matrix-Based Rényi's $\alpha$α-Order Entropy Functional , 2018, IEEE Trans. Pattern Anal. Mach. Intell..