Modifications of the Multi-Layer Perceptron for Hyperspectral Image Classification

Recently, many convolutional neural network (CNN)-based methods have been proposed to tackle the classification task of hyperspectral images (HSI). In fact, CNN has become the de-facto standard for HSI classification. It seems that the traditional neural networks such as multi-layer perceptron (MLP) are not competitive for HSI classification. However, in this study, we try to prove that the MLP can achieve good classification performance of HSI if it is properly designed and improved. The proposed Modified-MLP for HSI classification contains two special parts: spectral–spatial feature mapping and spectral–spatial information mixing. Specifically, for spectral–spatial feature mapping, each input sample of HSI is divided into a sequence of 3D patches with fixed length and then a linear layer is used to map the 3D patches to spectral–spatial features. For spectral–spatial information mixing, all the spectral–spatial features within a single sample are feed into the solely MLP architecture to model the spectral–spatial information across patches for following HSI classification. Furthermore, to obtain the abundant spectral–spatial information with different scales, Multiscale-MLP is proposed to aggregate neighboring patches with multiscale shapes for acquiring abundant spectral–spatial information. In addition, the Soft-MLP is proposed to further enhance the classification performance by applying soft split operation, which flexibly capture the global relations of patches at different positions in the input HSI sample. Finally, label smoothing is introduced to mitigate the overfitting problem in the Soft-MLP (Soft-MLP-L), which greatly improves the classification performance of MLP-based method. The proposed Modified-MLP, Multiscale-MLP, Soft-MLP, and Soft-MLP-L are tested on the three widely used hyperspectral datasets. The proposed Soft-MLP-L leads to the highest OA, which outperforms CNN by 5.76%, 2.55%, and 2.5% on the Salinas, Pavia, and Indian Pines datasets, respectively. The obtained results reveal that the proposed models provide competitive results compared to the state-of-the-art methods, which shows that the MLP-based methods are still competitive for HSI classification.

[1]  C. Giardino,et al.  Measuring freshwater aquatic ecosystems: The need for a hyperspectral global mapping satellite mission , 2015 .

[2]  J. Anthony Gualtieri,et al.  Support vector machines for hyperspectral remote sensing classification , 1999, Other Conferences.

[3]  Rico Sennrich,et al.  Why Self-Attention? A Targeted Evaluation of Neural Machine Translation Architectures , 2018, EMNLP.

[4]  Fan Zhang,et al.  Deep Convolutional Neural Networks for Hyperspectral Image Classification , 2015, J. Sensors.

[5]  Timo Aila,et al.  Pruning Convolutional Neural Networks for Resource Efficient Inference , 2016, ICLR.

[6]  Mingyou Chen,et al.  3D global mapping of large-scale unstructured orchard integrating eye-in-hand stereo vision and SLAM , 2021, Comput. Electron. Agric..

[7]  Xiangjun Zou,et al.  High-accuracy multi-camera reconstruction enhanced by adaptive point cloud correction algorithm , 2019, Optics and Lasers in Engineering.

[8]  I. El-Magd,et al.  Quantitative hyperspectral analysis for characterization of the coastal water from Damietta to Port Said, Egypt , 2014 .

[9]  Geoffrey E. Hinton,et al.  Layer Normalization , 2016, ArXiv.

[10]  Geoffrey E. Hinton,et al.  Learning internal representations by error propagation , 1986 .

[11]  G.Kalaiarasi,et al.  Frost filtered scale-invariant feature extraction and multilayer perceptron for hyperspectral image classification , 2020 .

[12]  Yuri Sousa Aurelio,et al.  Learning from Imbalanced Data Sets with Weighted Cross-Entropy Function , 2019, Neural Processing Letters.

[13]  Xiangjun Zou,et al.  Recognition and Localization Methods for Vision-Based Fruit Picking Robots: A Review , 2020, Frontiers in Plant Science.

[14]  Yue Wu,et al.  Double-Branch Multi-Attention Mechanism Network for Hyperspectral Image Classification , 2019, Remote. Sens..

[15]  Ivan Laptev,et al.  Learning and Transferring Mid-level Image Representations Using Convolutional Neural Networks , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[16]  Ying Li,et al.  Spectral-Spatial Classification of Hyperspectral Imagery with 3D Convolutional Neural Network , 2017, Remote. Sens..

[17]  闫敬文 Yan Jing-wen,et al.  Overview of hyperspectral image classification , 2019 .

[18]  Shunyi Zheng,et al.  Classification of Hyperspectral Image Based on Double-Branch Dual-Attention Mechanism Network , 2020, Remote. Sens..

[19]  Jing Wang,et al.  Sea Ice Detection Based on an Improved Similarity Measurement Method Using Hyperspectral Data , 2017, 2017 IEEE 14th International Conference on Networking, Sensing and Control (ICNSC).

[20]  Kevin Gimpel,et al.  Gaussian Error Linear Units (GELUs) , 2016 .

[21]  Deepak Mishra,et al.  Hyper spectral image classification using multilayer perceptron neural network & functional link ANN , 2017, 2017 7th International Conference on Cloud Computing, Data Science & Engineering - Confluence.