Hyperspectral Image Classification via a Novel Spectral-Spatial 3D ConvLSTM-CNN

In recent years, deep learning-based models have produced encouraging results for hyperspectral image (HSI) classification. Specifically, Convolutional Long Short-Term Memory (ConvLSTM) has shown good performance for learning valuable features and modeling long-term dependencies in spectral data. However, it is less effective for learning spatial features, which is an integral part of hyperspectral images. Alternatively, convolutional neural networks (CNNs) can learn spatial features, but they possess limitations in handling long-term dependencies due to the local feature extraction in these networks. Considering these factors, this paper proposes an end-to-end Spectral-Spatial 3D ConvLSTM-CNN based Residual Network (SSCRN), which combines 3D ConvLSTM and 3D CNN for handling both spectral and spatial information, respectively. The contribution of the proposed network is twofold. Firstly, it addresses the long-term dependencies of spectral dimension using 3D ConvLSTM to capture the information related to various ground materials effectively. Secondly, it learns the discriminative spatial features using 3D CNN by employing the concept of the residual blocks to accelerate the training process and alleviate the overfitting. In addition, SSCRN uses batch normalization and dropout to regularize the network for smooth learning. The proposed framework is evaluated on three benchmark datasets widely used by the research community. The results confirm that SSCRN outperforms state-of-the-art methods with an overall accuracy of 99.17%, 99.67%, and 99.31% over Indian Pines, Salinas, and Pavia University datasets, respectively. Moreover, it is worth mentioning that these excellent results were achieved with comparatively fewer epochs, which also confirms the fast learning capabilities of the SSCRN.

[1]  Qian Du,et al.  Feature Extraction and Classification Based on Spatial-Spectral ConvLSTM Neural Network for Hyperspectral Images , 2019, ArXiv.

[2]  Mary B Stuart,et al.  Hyperspectral Imaging in Environmental Monitoring: A Review of Recent Developments and Technological Advances in Compact Field Deployable Systems , 2019, Sensors.

[3]  Lei Qu,et al.  Triple-Attention-Based Parallel Network for Hyperspectral Image Classification , 2021, Remote. Sens..

[4]  Mohamed Farah,et al.  Hyperspectral imagery classification based on semi-supervised 3-D deep neural network and adaptive band selection , 2019, Expert Syst. Appl..

[5]  Mariana Belgiu,et al.  Random forest in remote sensing: A review of applications and future directions , 2016 .

[6]  Baojun Zhao,et al.  Spectral–spatial classification of hyperspectral remote sensing image based on capsule network , 2019, The Journal of Engineering.

[7]  Ying Li,et al.  Spectral-Spatial Classification of Hyperspectral Imagery with 3D Convolutional Neural Network , 2017, Remote. Sens..

[8]  Jürgen Schmidhuber,et al.  Long Short-Term Memory , 1997, Neural Computation.

[9]  Bardia Yousefi,et al.  Mineral identification in LWIR hyperspectral imagery applying sparse-based clustering , 2018, Quantitative InfraRed Thermography Journal.

[10]  W. Maes,et al.  Perspectives for Remote Sensing with Unmanned Aerial Vehicles in Precision Agriculture. , 2019, Trends in plant science.

[11]  Qingshan Liu,et al.  Hyperspectral Image Classification Using Spectral-Spatial LSTMs , 2017, CCCV.

[12]  Mustaqeem,et al.  CLSTM: Deep Feature-Based Speech Emotion Recognition Using the Hierarchical ConvLSTM Network , 2020 .

[13]  Jun Huang,et al.  Spectral-Spatial Attention Networks for Hyperspectral Image Classification , 2019, Remote. Sens..

[14]  Yong Xiao,et al.  CSA-MSO3DCNN: Multiscale Octave 3D CNN with Channel and Spatial Attention for Hyperspectral Image Classification , 2020, Remote. Sens..

[15]  Mustaqeem,et al.  1D-CNN: Speech Emotion Recognition System Using a Stacked Network with Dilated CNN Features , 2021 .

[16]  Antonio J. Plaza,et al.  Multi-Channel Morphological Profiles for Classification of Hyperspectral Images Using Support Vector Machines , 2009, Sensors.

[17]  Anthony M. Filippi,et al.  Hyperspectral Image Classification Using Similarity Measurements-Based Deep Recurrent Neural Networks , 2019, Remote. Sens..

[18]  Jonathan Cheung-Wai Chan,et al.  Hyperspectral Images Classification Based on Dense Convolutional Networks with Spectral-Wise Attention Mechanism , 2019, Remote. Sens..

[19]  Feng Liu,et al.  Three-Dimensional ResNeXt Network Using Feature Fusion and Label Smoothing for Hyperspectral Image Classification , 2020, Sensors.

[20]  Hassan Ghassemian,et al.  A probabilistic SVM approach for hyperspectral image classification using spectral and texture features , 2017 .

[21]  Qingshan Liu,et al.  Bidirectional-Convolutional LSTM Based Spectral-Spatial Feature Learning for Hyperspectral Image Classification , 2017, Remote. Sens..

[22]  Bing Tu,et al.  Multiple convolutional layers fusion framework for hyperspectral image classification , 2019, Neurocomputing.

[23]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[24]  Fan Feng,et al.  Learning Deep Hierarchical Spatial–Spectral Features for Hyperspectral Image Classification Based on Residual 3D-2D CNN , 2019, Sensors.