Statistical Loss and Analysis for Deep Learning in Hyperspectral Image Classification

Nowadays, deep learning methods, especially the convolutional neural networks (CNNs), have shown impressive performance on extracting abstract and high-level features from the hyperspectral image. However, the general training process of CNNs mainly considers the pixelwise information or the samples’ correlation to formulate the penalization while ignores the statistical properties especially the spectral variability of each class in the hyperspectral image. These sample-based penalizations would lead to the uncertainty of the training process due to the imbalanced and limited number of training samples. To overcome this problem, this article characterizes each class from the hyperspectral image as a statistical distribution and further develops a novel statistical loss with the distributions, not directly with samples for deep learning. Based on the Fisher discrimination criterion, the loss penalizes the sample variance of each class distribution to decrease the intraclass variance of the training samples. Moreover, an additional diversity-promoting condition is added to enlarge the interclass variance between different class distributions, and this could better discriminate samples from different classes in the hyperspectral image. Finally, the statistical estimation form of the statistical loss is developed with the training samples through multivariant statistical analysis. Experiments over the real-world hyperspectral images show the effectiveness of the developed statistical loss for deep learning.

[1]  Junwei Han,et al.  Learning Compact and Discriminative Stacked Autoencoder for Hyperspectral Image Classification , 2019, IEEE Transactions on Geoscience and Remote Sensing.

[2]  J. Habbema,et al.  Selection of Variables in Discriminant Analysis by F-statistic and Error Rate , 1977 .

[3]  Kavita Bala,et al.  Learning visual similarity for product design with convolutional neural networks , 2015, ACM Trans. Graph..

[4]  Qi Wang,et al.  Salient Band Selection for Hyperspectral Image Classification via Manifold Ranking , 2016, IEEE Transactions on Neural Networks and Learning Systems.

[5]  Trevor Darrell,et al.  Caffe: Convolutional Architecture for Fast Feature Embedding , 2014, ACM Multimedia.

[6]  Shuyuan Yang,et al.  Deep Fully Convolutional Network-Based Spatial Distribution Prediction for Hyperspectral Image Classification , 2017, IEEE Transactions on Geoscience and Remote Sensing.

[7]  James Philbin,et al.  FaceNet: A unified embedding for face recognition and clustering , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[8]  Hong Li,et al.  Hyperspectral Image Classification Using Functional Data Analysis , 2014, IEEE Transactions on Cybernetics.

[9]  Yunsong Li,et al.  Hyperspectral image reconstruction by deep convolutional neural network for classification , 2017, Pattern Recognit..

[10]  Simon Haykin,et al.  Neural Networks and Learning Machines , 2010 .

[11]  Lei Guo,et al.  Exploring Hierarchical Convolutional Features for Hyperspectral Image Classification , 2018, IEEE Transactions on Geoscience and Remote Sensing.

[12]  Shutao Li,et al.  Hyperspectral Image Classification With Deep Feature Fusion Network , 2018, IEEE Transactions on Geoscience and Remote Sensing.

[13]  Nasser M. Nasrabadi,et al.  Hyperspectral Target Detection : An Overview of Current and Future Challenges , 2014, IEEE Signal Processing Magazine.

[14]  Charles E. Heckler,et al.  Applied Multivariate Statistical Analysis , 2005, Technometrics.

[15]  Yu Qiao,et al.  A Discriminative Feature Learning Approach for Deep Face Recognition , 2016, ECCV.

[16]  Ajmal Mian,et al.  Nonparametric Coupled Bayesian Dictionary and Classifier Learning for Hyperspectral Classification , 2018, IEEE Transactions on Neural Networks and Learning Systems.

[17]  Ping Zhong,et al.  A Hybrid DBN and CRF Model for Spectral-Spatial Classification of Hyperspectral Images , 2017 .

[18]  Jian Wang,et al.  Deep Metric Learning with Angular Loss , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[19]  Heesung Kwon,et al.  Going Deeper With Contextual CNN for Hyperspectral Image Classification , 2016, IEEE Transactions on Image Processing.

[20]  Ping Zhong,et al.  Multiple Instance Learning for Multiple Diverse Hyperspectral Target Characterizations , 2020, IEEE Transactions on Neural Networks and Learning Systems.

[21]  Ping Zhong,et al.  Diversity-Promoting Deep Structural Metric Learning for Remote Sensing Scene Classification , 2018, IEEE Transactions on Geoscience and Remote Sensing.

[22]  R. H. Myers,et al.  Probability and Statistics for Engineers and Scientists , 1978 .

[23]  Hassan Ghassemian,et al.  Nonparametric feature extraction for classification of hyperspectral images with limited training samples , 2016 .

[24]  David Zhang,et al.  Fisher Discrimination Dictionary Learning for sparse representation , 2011, 2011 International Conference on Computer Vision.

[25]  Qingshan Liu,et al.  Cascaded Recurrent Neural Networks for Hyperspectral Image Classification , 2017, IEEE Transactions on Geoscience and Remote Sensing.

[26]  Shutao Li,et al.  A CNN With Multiscale Convolution and Diversified Metric for Hyperspectral Image Classification , 2019, IEEE Transactions on Geoscience and Remote Sensing.

[27]  Jon Atli Benediktsson,et al.  Deep Learning for Hyperspectral Image Classification: An Overview , 2019, IEEE Transactions on Geoscience and Remote Sensing.

[28]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[29]  Xiao Xiang Zhu,et al.  Deep Recurrent Neural Networks for Hyperspectral Image Classification , 2017, IEEE Transactions on Geoscience and Remote Sensing.

[30]  Shutao Li,et al.  Feature Extraction With Multiscale Covariance Maps for Hyperspectral Image Classification , 2019, IEEE Transactions on Geoscience and Remote Sensing.

[31]  Xiuping Jia,et al.  Deep Feature Extraction and Classification of Hyperspectral Images Based on Convolutional Neural Networks , 2016, IEEE Transactions on Geoscience and Remote Sensing.

[32]  Bo Du,et al.  Hyperspectral Anomaly Detection via a Sparsity Score Estimation Framework , 2017, IEEE Transactions on Geoscience and Remote Sensing.

[33]  Zhiming Luo,et al.  Spectral–Spatial Residual Network for Hyperspectral Image Classification: A 3-D Deep Learning Framework , 2018, IEEE Transactions on Geoscience and Remote Sensing.

[34]  Zhenwei Shi,et al.  Hierarchical Suppression Method for Hyperspectral Target Detection , 2016, IEEE Transactions on Geoscience and Remote Sensing.

[35]  Qian Du,et al.  Hyperspectral Image Classification Using Deep Pixel-Pair Features , 2017, IEEE Transactions on Geoscience and Remote Sensing.

[36]  Gary A. Shaw,et al.  Hyperspectral Image Processing for Automatic Target Detection Applications , 2003 .

[37]  Yann LeCun,et al.  Dimensionality Reduction by Learning an Invariant Mapping , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[38]  Shutao Li,et al.  Learning to Diversify Deep Belief Networks for Hyperspectral Image Classification , 2017, IEEE Transactions on Geoscience and Remote Sensing.

[39]  Yicong Zhou,et al.  Learning Hierarchical Spectral–Spatial Features for Hyperspectral Image Classification , 2016, IEEE Transactions on Cybernetics.

[40]  Silvio Savarese,et al.  Deep Metric Learning via Lifted Structured Feature Embedding , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[41]  Jon Atli Benediktsson,et al.  Set-to-Set Distance-Based Spectral–Spatial Classification of Hyperspectral Images , 2016, IEEE Transactions on Geoscience and Remote Sensing.

[42]  Antonio Plaza,et al.  A new deep convolutional neural network for fast hyperspectral image classification , 2017, ISPRS Journal of Photogrammetry and Remote Sensing.