SSDANet: Spectral-Spatial Three-Dimensional Convolutional Neural Network for Hyperspectral Image Classification

Recently, the classification of hyperspectral images has made great process. Especially, the classification methods based on three-dimensional convolutional neural network have remarkable performance due to the uniqueness of hyperspectral images. However, the hyperspectral classification still faces great challenges due to a series of problems such as the insufficient extraction of spectral-spatial features, the lack of labeled samples, the large amount of noise, the tendency of overfitting and so on. Therefore, SSDANet is proposed to solve the above problems and promote the further development of hyperspectral classification technology based on deep learning. SSDANet is a spectral-spatial three-dimensional convolutional neural network with a deep and wide structure that can significantly improve classification performance. In SSDANet, the spectral-spatial dense connectivity is put forward to protect the integrity of information. It is made up of the spectral branch and the spatial branch, which can learn and reuse the spectral-spatial features. Besides, the spectral-spatial attention mechanism is proposed to adapt the special structure of hyperspectral images. It can excite important spectral-spatial information and suppress unimportant spectral-spatial information. In addition, a series of optimization methods including data augmentation, batch normalization, dropout, exponential decay learning rate, and L2 regularization are adopted to alleviate the problem of overfitting and improve the classification results. To verify the performance of SSDANet, experiments were implemented on two widely used datasets—Pavia University and Indian Pines. Under the condition of limited labeled samples, the classification evaluation indexes of OA, AA, and Kappa on the two datasets all exceeded 99%, reaching state-of-the-art performance.

[1]  Bor-Chen Kuo,et al.  Feature Mining for Hyperspectral Image Classification , 2013, Proceedings of the IEEE.

[2]  Shutao Li,et al.  Learning to Diversify Deep Belief Networks for Hyperspectral Image Classification , 2017, IEEE Transactions on Geoscience and Remote Sensing.

[3]  Kunihiko Fukushima,et al.  Neocognitron: A Self-Organizing Neural Network Model for a Mechanism of Visual Pattern Recognition , 1982 .

[4]  Trac D. Tran,et al.  Hyperspectral Image Classification Using Dictionary-Based Sparse Representation , 2011, IEEE Transactions on Geoscience and Remote Sensing.

[5]  Yunlong Shang,et al.  A Data-Driven Approach With Uncertainty Quantification for Predicting Future Capacities and Remaining Useful Life of Lithium-ion Battery , 2021, IEEE Transactions on Industrial Electronics.

[6]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[7]  Lorenzo Bruzzone,et al.  An extension of the Jeffreys-Matusita distance to multiclass cases for feature selection , 1995, IEEE Trans. Geosci. Remote. Sens..

[8]  T. Kailath The Divergence and Bhattacharyya Distance Measures in Signal Selection , 1967 .

[9]  Yoshua Bengio,et al.  Gradient-based learning applied to document recognition , 1998, Proc. IEEE.

[10]  Qian Du,et al.  Hyperspectral Image Classification Using Deep Pixel-Pair Features , 2017, IEEE Transactions on Geoscience and Remote Sensing.

[11]  Simon B Eickhoff,et al.  Your evidence? Machine learning algorithms for medical diagnosis and prediction , 2019, Human brain mapping.

[12]  Zhouchen Lin,et al.  Convolutional Neural Networks with Alternately Updated Clique , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[13]  Gang Wang,et al.  Deep Learning-Based Classification of Hyperspectral Data , 2014, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[14]  Enhua Wu,et al.  Squeeze-and-Excitation Networks , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[15]  Lorenzo Bruzzone,et al.  Kernel-based methods for hyperspectral image classification , 2005, IEEE Transactions on Geoscience and Remote Sensing.

[16]  Dumitru Erhan,et al.  Going deeper with convolutions , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[17]  Lorenzo Bruzzone,et al.  Classification of hyperspectral remote sensing images with support vector machines , 2004, IEEE Transactions on Geoscience and Remote Sensing.

[18]  Yi Li,et al.  Gaussian Process Regression With Automatic Relevance Determination Kernel for Calendar Aging Prediction of Lithium-Ion Batteries , 2020, IEEE Transactions on Industrial Informatics.

[19]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[20]  Allan Aasbjerg Nielsen,et al.  Kernel Maximum Autocorrelation Factor and Minimum Noise Fraction Transformations , 2011, IEEE Transactions on Image Processing.

[21]  Naoto Yokoya,et al.  Potential of Resolution-Enhanced Hyperspectral Data for Mineral Mapping Using Simulated EnMAP and Sentinel-2 Images , 2016, Remote. Sens..

[22]  Ying Li,et al.  Spectral-Spatial Classification of Hyperspectral Imagery with 3D Convolutional Neural Network , 2017, Remote. Sens..

[23]  Yi Li,et al.  Modified Gaussian Process Regression Models for Cyclic Capacity Prediction of Lithium-Ion Batteries , 2019, IEEE Transactions on Transportation Electrification.

[24]  Tan Hai-fen Research on background depression in hyperspectral image anomaly detection , 2016 .

[25]  Yizhao Ni,et al.  Finding warning markers: Leveraging natural language processing and machine learning technologies to detect risk of school violence , 2020, Int. J. Medical Informatics.

[26]  Bo Du,et al.  Deep Learning for Remote Sensing Data: A Technical Tutorial on the State of the Art , 2016, IEEE Geoscience and Remote Sensing Magazine.

[27]  Shutao Li,et al.  PCA-Based Edge-Preserving Features for Hyperspectral Image Classification , 2017, IEEE Transactions on Geoscience and Remote Sensing.

[28]  Yoshua Bengio,et al.  Greedy Layer-Wise Training of Deep Networks , 2006, NIPS.

[29]  Shihong Du,et al.  Multi-Scale Dense Networks for Hyperspectral Remote Sensing Image Classification , 2019, IEEE Transactions on Geoscience and Remote Sensing.

[30]  G. F. Hughes,et al.  On the mean accuracy of statistical pattern recognizers , 1968, IEEE Trans. Inf. Theory.

[31]  Kilian Q. Weinberger,et al.  Densely Connected Convolutional Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[32]  Fei Gao,et al.  Three-dimensional densely connected convolutional network for hyperspectral remote sensing image classification , 2019, Journal of Applied Remote Sensing.

[33]  Bing Liu,et al.  Spectral–spatial classification of hyperspectral image using three-dimensional convolution network , 2018 .

[34]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[35]  Licheng Jiao,et al.  CNN-Based Multilayer Spatial–Spectral Feature Fusion and Sample Augmentation With Local and Nonlocal Constraints for Hyperspectral Image Classification , 2019, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[36]  Bidyut Baran Chaudhuri,et al.  HybridSN: Exploring 3-D–2-D CNN Feature Hierarchy for Hyperspectral Image Classification , 2019, IEEE Geoscience and Remote Sensing Letters.

[37]  Furong Gao,et al.  Model Migration Neural Network for Predicting Battery Aging Trajectories , 2020, IEEE Transactions on Transportation Electrification.

[38]  J. Chanussot,et al.  Hyperspectral Remote Sensing Data Analysis and Future Challenges , 2013, IEEE Geoscience and Remote Sensing Magazine.

[39]  Sergey Ioffe,et al.  Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift , 2015, ICML.

[40]  Jing Wang,et al.  Independent component analysis-based dimensionality reduction with applications in hyperspectral image analysis , 2006, IEEE Transactions on Geoscience and Remote Sensing.

[41]  Yee Whye Teh,et al.  A Fast Learning Algorithm for Deep Belief Nets , 2006, Neural Computation.

[42]  Jonathan Cheung-Wai Chan,et al.  Hyperspectral Images Classification Based on Dense Convolutional Networks with Spectral-Wise Attention Mechanism , 2019, Remote. Sens..

[43]  Xiuping Jia,et al.  Deep Feature Extraction and Classification of Hyperspectral Images Based on Convolutional Neural Networks , 2016, IEEE Transactions on Geoscience and Remote Sensing.

[44]  Johannes R. Sveinsson,et al.  Classification of hyperspectral data from urban areas based on extended morphological profiles , 2005, IEEE Transactions on Geoscience and Remote Sensing.

[45]  Fan Zhang,et al.  Deep Convolutional Neural Networks for Hyperspectral Image Classification , 2015, J. Sensors.

[46]  Nicola Caporaso,et al.  Non-destructive analysis of sucrose, caffeine and trigonelline on single green coffee beans by hyperspectral imaging , 2018, Food research international.

[47]  Haokui Zhang,et al.  Spectral-spatial classification of hyperspectral imagery using a dual-channel convolutional neural network , 2017 .

[48]  Jonathan Cheung-Wai Chan,et al.  Learning and Transferring Deep Joint Spectral–Spatial Features for Hyperspectral Classification , 2017, IEEE Transactions on Geoscience and Remote Sensing.

[49]  Zhiming Luo,et al.  Spectral–Spatial Residual Network for Hyperspectral Image Classification: A 3-D Deep Learning Framework , 2018, IEEE Transactions on Geoscience and Remote Sensing.

[50]  S. Maruo,et al.  Simple autofocusing method by image processing using transmission images for large-scale two-photon lithography. , 2020, Optics express.

[51]  Farid Melgani,et al.  Nearest Neighbor Classification of Remote Sensing Images With the Maximal Margin Principle , 2008, IEEE Transactions on Geoscience and Remote Sensing.

[52]  Zenghui Wang,et al.  Deep Convolutional Neural Networks for Image Classification: A Comprehensive Review , 2017, Neural Computation.

[53]  Bo Du,et al.  Beyond the Patchwise Classification: Spectral-Spatial Fully Convolutional Networks for Hyperspectral Image Classification , 2020, IEEE Transactions on Big Data.

[54]  David A. Landgrebe,et al.  Hyperspectral image data analysis , 2002, IEEE Signal Process. Mag..

[55]  Lin Zhu,et al.  Generative Adversarial Networks for Hyperspectral Image Classification , 2018, IEEE Transactions on Geoscience and Remote Sensing.

[56]  Antonio J. Plaza,et al.  Deep&Dense Convolutional Neural Network for Hyperspectral Image Classification , 2018, Remote. Sens..

[57]  Jon Atli Benediktsson,et al.  Classification of Hyperspectral Images by Exploiting Spectral–Spatial Information of Superpixel via Multiple Kernels , 2015, IEEE Transactions on Geoscience and Remote Sensing.

[58]  N. Keshava,et al.  Distance metrics and band selection in hyperspectral processing with applications to material identification and spectral libraries , 2004, IEEE Transactions on Geoscience and Remote Sensing.

[59]  Zhihao Qin,et al.  Estimation of Crop LAI using hyperspectral vegetation indices and a hybrid inversion method , 2015 .

[60]  Wenju Wang,et al.  A Fast Dense Spectral-Spatial Convolution Network Framework for Hyperspectral Images Classification , 2018, Remote. Sens..

[61]  Weiwei Sun,et al.  Spatial-Spectral Squeeze-and-Excitation Residual Network for Hyperspectral Image Classification , 2019, Remote. Sens..