Three-dimensional densely connected convolutional network for hyperspectral remote sensing image classification

Abstract. Hyperspectral remote sensing images (HSIs) are rich in spatial and spectral information, thus they help to enhance the ability to distinguish geographic objects. In recent years, great progress have been made in image classification using deep learning (such as 2D-CNN and 3D-CNN). Compared with traditional machine learning methods, deep learning methods can automatically extract the abstract features from low to high levels and convert the images into more easily recognizable features. Most HSI classification tasks focus on spectral information but often ignore the rich spatial structures in HSIs, leading to a low classification accuracy. Moreover, most supervised learning methods use shallow structures in HSI classifications and hence exhibit weak performance in finding sparse geographic objects. We proposed to use the three-dimensional (3-D) structure to extract spectral–spatial information to build a deep neural network for HSI classifications. Based on DenseNet, the 3D densely connected convolutional network was improved to learn spectral-spatial features of HSIs. The densely connected structure can enhance feature transmission, support feature reuse, improve information flow in the network, and make deeper networks easier to train. The 3D-DenseNet has a deeper structure than 3D-CNN, thus it can learn more robust spectral–spatial features from HSIs. In fact, the deeper network structure has a regularized effect, which can effectively reduce overfitting on small sample datasets. The network uses HSIs instead of feature engineering as input data and is trained in an end-to-end manner. The experimental results of this model on the Indian Pines datasets and the Pavia University datasets show that deeper neural networks further improve the classification of complex objects, especially in the areas where geographic objects are sparse. It effectively improves the classification accuracy of HSIs.

[1]  Bo Du,et al.  Spectral–Spatial Unified Networks for Hyperspectral Image Classification , 2018, IEEE Transactions on Geoscience and Remote Sensing.

[2]  Bo Du,et al.  Deep Learning for Remote Sensing Data: A Technical Tutorial on the State of the Art , 2016, IEEE Geoscience and Remote Sensing Magazine.

[3]  Kilian Q. Weinberger,et al.  Densely Connected Convolutional Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[4]  Sergey Ioffe,et al.  Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift , 2015, ICML.

[5]  Kun Tan,et al.  A novel binary tree support vector machine for hyperspectral remote sensing image classification , 2012 .

[6]  Xiuping Jia,et al.  Deep Feature Extraction and Classification of Hyperspectral Images Based on Convolutional Neural Networks , 2016, IEEE Transactions on Geoscience and Remote Sensing.

[7]  Chein-I. Chang Hyperspectral Imaging: Techniques for Spectral Detection and Classification , 2003 .

[8]  Zhiming Luo,et al.  Spectral–Spatial Residual Network for Hyperspectral Image Classification: A 3-D Deep Learning Framework , 2018, IEEE Transactions on Geoscience and Remote Sensing.

[9]  Zhuowen Tu,et al.  Aggregated Residual Transformations for Deep Neural Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[10]  Jun Li,et al.  Recent Advances on Spectral–Spatial Hyperspectral Image Classification: An Overview and New Guidelines , 2018, IEEE Transactions on Geoscience and Remote Sensing.

[11]  Yoshua Bengio,et al.  Deep Sparse Rectifier Neural Networks , 2011, AISTATS.

[12]  Xing Zhao,et al.  Spectral–Spatial Classification of Hyperspectral Data Based on Deep Belief Network , 2015, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[13]  Shihong Du,et al.  Spectral–Spatial Feature Extraction for Hyperspectral Image Classification: A Dimension Reduction and Deep Learning Approach , 2016, IEEE Transactions on Geoscience and Remote Sensing.

[14]  Li Fei-Fei,et al.  ImageNet: A large-scale hierarchical image database , 2009, CVPR.

[15]  Bo Du,et al.  Hyperspectral image classification via a random patches network , 2018, ISPRS Journal of Photogrammetry and Remote Sensing.

[16]  G. F. Hughes,et al.  On the mean accuracy of statistical pattern recognizers , 1968, IEEE Trans. Inf. Theory.

[17]  Andrew Y. Ng,et al.  Reading Digits in Natural Images with Unsupervised Feature Learning , 2011 .

[18]  Jürgen Schmidhuber,et al.  Deep learning in neural networks: An overview , 2014, Neural Networks.

[19]  Shanjun Mao,et al.  Spectral–spatial classification of hyperspectral images using deep convolutional neural networks , 2015 .

[20]  Jia Deng,et al.  A large-scale hierarchical image database , 2009, CVPR 2009.

[21]  Jon Atli Benediktsson,et al.  Advances in Spectral-Spatial Classification of Hyperspectral Images , 2013, Proceedings of the IEEE.

[22]  Dongfeng Gu 3D Densely Connected Convolutional Network for the Recognition of Human Shopping Actions , 2017 .

[23]  Shutao Li,et al.  Hyperspectral Image Classification With Deep Feature Fusion Network , 2018, IEEE Transactions on Geoscience and Remote Sensing.

[24]  Nikolaos Doulamis,et al.  Deep supervised learning for hyperspectral data classification through convolutional neural networks , 2015, 2015 IEEE International Geoscience and Remote Sensing Symposium (IGARSS).

[25]  Sergey Ioffe,et al.  Rethinking the Inception Architecture for Computer Vision , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[26]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[27]  Alex Krizhevsky,et al.  Learning Multiple Layers of Features from Tiny Images , 2009 .

[28]  Mark G. Arnold,et al.  Bitstream Efficiency of Field Programmable One-Hot Arrays , 2010, 2010 IEEE Computer Society Annual Symposium on VLSI.