Deep stacking network with coarse features for hyperspectral image classification

Hyperspectral image (HSI) classification attracts increasing attentions in remote sensing community for its academic significance and potential wide applications. Although most of used existing “intelligent” methods extracting the features of original hyperspectral data are based on shallow-layer networks such as neural network (NN) and support vector machine (SVM) for its simplicity in realization, two deep neural networks (DNN), i.e. the deep convolutional network (DCN) and the deep belief network (DBN), have been used for HSI classifications with improved performance, as deep learning recently achieves great success for its ability in deep feature extraction and representations. In this paper, another DNN, a deep stacking network (DSN) based approach with coarse features for hyperspectral image classification is proposed, as its advantages over shadow networks and other deep models in its simplicity and batch-mode learning — not requiring stochastic gradient descent that other DNNs require. In this approach, the inputs to the DSN are coarse features, i.e. coarse spectral features by band reduction, coarse spatial features extracted by PCA from original HSI, or combination of both. The fine deeper features in the hyperspectral data are gradually obtained by employing the nonlinear activation function on the hidden layer nodes of each module, which is different from the current DSN that usually uses linear weights between hidden layer to output layer. A closed limit on its node's input — output is also exerted. The theoretical analysis and experimental results with AVIRIS hyperspectral image have shown that, (1) the proposed DSN approach achieves improved classification performance compared with shadow networks, (2) the new DSN approach with both the coarse spectral feature and coarse spatial feature has higher classification accuracy compared to that with either only, (3) this study has insight of necessity to develop new deep neural network for HSI classification from direct HSI data.

[1]  Dong Yu,et al.  Scalable stacking and learning for building deep architectures , 2012, 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[2]  Gang Wang,et al.  Deep Learning-Based Classification of Hyperspectral Data , 2014, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[3]  Xing Zhao,et al.  Spectral–Spatial Classification of Hyperspectral Data Based on Deep Belief Network , 2015, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[4]  Joydeep Ghosh,et al.  Investigation of the random forest framework for classification of hyperspectral data , 2005, IEEE Transactions on Geoscience and Remote Sensing.

[5]  Nicolas Le Roux,et al.  Representational Power of Restricted Boltzmann Machines and Deep Belief Networks , 2008, Neural Computation.

[6]  Mark A. Richardson,et al.  An introduction to hyperspectral imaging and its application for security, surveillance and target acquisition , 2010 .

[7]  Ye Zhang,et al.  Classification of hyperspectral image based on deep belief networks , 2014, 2014 IEEE International Conference on Image Processing (ICIP).

[8]  D. Lu,et al.  Remote Sensing Image Classification , 2011 .

[9]  Nitish Srivastava,et al.  Improving neural networks by preventing co-adaptation of feature detectors , 2012, ArXiv.

[10]  Yicong Zhou,et al.  Learning Hierarchical Spectral–Spatial Features for Hyperspectral Image Classification , 2016, IEEE Transactions on Cybernetics.

[11]  Nikolaos Doulamis,et al.  Deep supervised learning for hyperspectral data classification through convolutional neural networks , 2015, 2015 IEEE International Geoscience and Remote Sensing Symposium (IGARSS).

[12]  Jianfeng Gao,et al.  Deep stacking networks for information retrieval , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.

[13]  Mei Shao-hui Advance in Feature Mining from Hyperspectral Remote Sensing Data , 2013 .

[14]  David H. Wolpert,et al.  Stacked generalization , 1992, Neural Networks.

[15]  Dong Yu,et al.  Deep Convex Net: A Scalable Architecture for Speech Pattern Classification , 2011, INTERSPEECH.