FLOP-Reduction Through Memory Allocations Within CNN for Hyperspectral Image Classification

Convolutional neural networks (CNNs) have proven to be a powerful tool for the classification of hyperspectral images (HSIs). The CNN kernels are able to naturally include spatial information to smooth out the spectral variability and the noise present in HSI data. However, these kernels are composed of a large number of learning parameters that must be correctly adjusted to achieve good performance. This forces the model to consume a large amount of training data, being prone to overfitting when limited labeled samples are available. In addition, the execution of kernels is computationally very expensive, increasing quadratically with respect to the size of the convolution filter. This significantly reduces the performance of the model. To overcome the aforementioned limitations, this work presents a new few-parameter CNN (based on shift operations) for HSI classification that dramatically reduces both the number of parameters and the computational complexity of the model in terms of floating-point operations (FLOPs). The operational module combines a shift kernel (which adjusts the input data in particular directions without involving any parameters nor FLOPs) with pointwise convolutions that perform the feature extraction stage. The newly developed shift-based CNN has been employed to conduct HSI classification over five widely used and challenging data sets, achieving very promising results in terms of computational performance and classification accuracy.

[1]  Junwei Han,et al.  Object detection in remote sensing imagery using a discriminatively trained mixture model , 2013 .

[2]  J. Palous,et al.  Machine Learning and Data Mining , 2002 .

[3]  Surya Ganguli,et al.  On the Expressive Power of Deep Neural Networks , 2016, ICML.

[4]  Mahesh Pal,et al.  Multinomial logistic regression-based feature selection for hyperspectral data , 2012, Int. J. Appl. Earth Obs. Geoinformation.

[5]  J. Chanussot,et al.  Hyperspectral Remote Sensing Data Analysis and Future Challenges , 2013, IEEE Geoscience and Remote Sensing Magazine.

[6]  Antonio J. Plaza,et al.  Deep Pyramidal Residual Networks for Spectral–Spatial Hyperspectral Image Classification , 2019, IEEE Transactions on Geoscience and Remote Sensing.

[7]  Zhiming Luo,et al.  Spectral–Spatial Residual Network for Hyperspectral Image Classification: A 3-D Deep Learning Framework , 2018, IEEE Transactions on Geoscience and Remote Sensing.

[8]  Jun Li,et al.  Hyperspectral Image Classification Using Random Occlusion Data Augmentation , 2019, IEEE Geoscience and Remote Sensing Letters.

[9]  Chi Hau Chen,et al.  Statistical pattern recognition in remote sensing , 2008, Pattern Recognit..

[10]  Bidyut Baran Chaudhuri,et al.  HybridSN: Exploring 3-D–2-D CNN Feature Hierarchy for Hyperspectral Image Classification , 2019, IEEE Geoscience and Remote Sensing Letters.

[11]  Pierre Alliez,et al.  Convolutional Neural Networks for Large-Scale Remote-Sensing Image Classification , 2017, IEEE Transactions on Geoscience and Remote Sensing.

[12]  Antonio J. Plaza,et al.  Recent Developments in High Performance Computing for Remote Sensing: A Review , 2011, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[13]  Javier Plaza,et al.  Neural Ordinary Differential Equations for Hyperspectral Image Classification , 2020, IEEE Transactions on Geoscience and Remote Sensing.

[14]  Andreas Stolcke,et al.  A study in machine learning from imbalanced data for sentence boundary detection in speech , 2006, Comput. Speech Lang..

[15]  Amir Hossein Alavi,et al.  Machine learning in geosciences and remote sensing , 2016 .

[16]  Bo Chen,et al.  MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications , 2017, ArXiv.

[17]  Yuchun Ma,et al.  Shift-based Primitives for Efficient Convolutional Neural Networks , 2018, ArXiv.

[18]  Joydeep Ghosh,et al.  Investigation of the random forest framework for classification of hyperspectral data , 2005, IEEE Transactions on Geoscience and Remote Sensing.

[19]  Radford M. Neal Pattern Recognition and Machine Learning , 2007, Technometrics.

[20]  Edward R. Dougherty,et al.  The peaking phenomenon in the presence of feature-selection , 2008, Pattern Recognit. Lett..

[21]  G. F. Hughes,et al.  On the mean accuracy of statistical pattern recognizers , 1968, IEEE Trans. Inf. Theory.

[22]  Antonio J. Plaza,et al.  Deep&Dense Convolutional Neural Network for Hyperspectral Image Classification , 2018, Remote. Sens..

[23]  Jun Li,et al.  Active Learning With Convolutional Neural Networks for Hyperspectral Image Classification Using a New Bayesian Approach , 2018, IEEE Transactions on Geoscience and Remote Sensing.

[24]  Yuan Zhang,et al.  All You Need Is a Few Shifts: Designing Efficient Convolutional Neural Networks for Image Classification , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[25]  Xiao Xiang Zhu,et al.  Learning to Pay Attention on Spectral Domain: A Spectral Attention Module-Based Convolutional Network for Hyperspectral Image Classification , 2020, IEEE Transactions on Geoscience and Remote Sensing.

[26]  Sergey Ioffe,et al.  Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift , 2015, ICML.

[27]  Guigang Zhang,et al.  Deep Learning , 2016, Int. J. Semantic Comput..

[28]  Xiang Zhang,et al.  Character-level Convolutional Networks for Text Classification , 2015, NIPS.

[29]  Antonio J. Plaza,et al.  Fusion of hyperspectral and LiDAR data using morphological component analysis , 2016, 2016 IEEE International Geoscience and Remote Sensing Symposium (IGARSS).

[30]  Junmo Kim,et al.  Constructing Fast Network through Deconstruction of Convolution , 2018, NeurIPS.

[31]  Jan Platos,et al.  Lightweight Spectral–Spatial Squeeze-and- Excitation Residual Bag-of-Features Learning for Hyperspectral Classification , 2020, IEEE Transactions on Geoscience and Remote Sensing.

[32]  François Chollet,et al.  Xception: Deep Learning with Depthwise Separable Convolutions , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[33]  Melba M. Crawford,et al.  A Special Issue on Advances in Machine Learning for Remote Sensing and Geosciences [From the Guest Editors] , 2016 .

[34]  Lei Guo,et al.  Object Detection in Optical Remote Sensing Images Based on Weakly Supervised Learning and High-Level Feature Learning , 2015, IEEE Transactions on Geoscience and Remote Sensing.

[35]  Lorenzo Bruzzone,et al.  Classification of hyperspectral remote sensing images with support vector machines , 2004, IEEE Transactions on Geoscience and Remote Sensing.

[36]  Fabrizio Sebastiani,et al.  Machine learning in automated text categorization , 2001, CSUR.

[37]  Kurt Keutzer,et al.  Shift: A Zero FLOP, Zero Parameter Alternative to Spatial Convolutions , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[38]  Xudong Kang,et al.  Dual-Path Network-Based Hyperspectral Image Classification , 2019, IEEE Geoscience and Remote Sensing Letters.

[39]  Zhihua Wei,et al.  Mixed Pooling for Convolutional Neural Networks , 2014, RSKT.

[40]  Antonio Plaza,et al.  A new deep convolutional neural network for fast hyperspectral image classification , 2017, ISPRS Journal of Photogrammetry and Remote Sensing.

[41]  Jian Sun,et al.  Convolutional neural networks at constrained time cost , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[42]  J Anitha,et al.  Machine Learning based Image Processing Techniques for Satellite Image Analysis -A Survey , 2019, 2019 International Conference on Machine Learning, Big Data, Cloud and Parallel Computing (COMITCon).

[43]  S. Macenka,et al.  Airborne Visible/Infrared Imaging Spectrometer (AVIRIS) , 1988 .

[44]  Geoffrey E. Hinton,et al.  Rectified Linear Units Improve Restricted Boltzmann Machines , 2010, ICML.

[45]  Javier Plaza,et al.  Neighboring Region Dropout for Hyperspectral Image Classification , 2020, IEEE Geoscience and Remote Sensing Letters.

[46]  Jun Li,et al.  Recent Advances on Spectral–Spatial Hyperspectral Image Classification: An Overview and New Guidelines , 2018, IEEE Transactions on Geoscience and Remote Sensing.

[47]  Dissecting Deep Neural Networks , 2019, ArXiv.

[48]  Xin Huang,et al.  A comparative study of spatial approaches for urban mapping using hyperspectral ROSIS images over Pavia City, northern Italy , 2009 .

[49]  Haihao Lu,et al.  Depth Creates No Bad Local Minima , 2017, ArXiv.

[50]  Timothy A. Warner,et al.  Implementation of machine-learning classification in remote sensing: an applied review , 2018 .

[51]  Jon Atli Benediktsson,et al.  Advances in Spectral-Spatial Classification of Hyperspectral Images , 2013, Proceedings of the IEEE.

[52]  Xiao Li,et al.  Machine Learning Paradigms for Speech Recognition: An Overview , 2013, IEEE Transactions on Audio, Speech, and Language Processing.

[53]  Jonathan Cheung-Wai Chan,et al.  Multiple Criteria for Evaluating Machine Learning Algorithms for Land Cover Classification from Satellite Data , 2000 .

[54]  Heesung Kwon,et al.  Going Deeper With Contextual CNN for Hyperspectral Image Classification , 2016, IEEE Transactions on Image Processing.

[55]  E. M. Wright,et al.  Adaptive Control Processes: A Guided Tour , 1961, The Mathematical Gazette.

[56]  Mercedes Eugenia Paoletti,et al.  Deep learning classifiers for hyperspectral imaging: A review , 2019 .

[57]  Nikolaos Doulamis,et al.  Deep Learning for Computer Vision: A Brief Review , 2018, Comput. Intell. Neurosci..

[58]  T. Minor,et al.  QUANTITATIVE COMPARISON OF NEURAL NETWORK AND CONVENTIONAL CLASSIFIERS FOR HYPERSPECTRAL IMAGERY , 1998 .

[59]  Qian Du,et al.  High Performance Computing for Hyperspectral Remote Sensing , 2011, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[60]  Stephen K. Babey,et al.  A compact airborne spectrographic imager (CASI) , 1989 .

[61]  Filiberto Pla,et al.  Capsule Networks for Hyperspectral Image Classification , 2019, IEEE Transactions on Geoscience and Remote Sensing.

[62]  Alessandra Russo,et al.  Speech Emotion Classification Using Machine Learning Algorithms , 2008, 2008 IEEE International Conference on Semantic Computing.

[63]  Sen Jia,et al.  Convolutional neural networks for hyperspectral image classification , 2017, Neurocomputing.

[64]  J. V. Van Campenhout On the Peaking of the Hughes Mean Recognition Accuracy: The Resolution of an Apparent Paradox , 1978, IEEE Transactions on Systems, Man, and Cybernetics.

[65]  Zhuowen Tu,et al.  Aggregated Residual Transformations for Deep Neural Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[66]  A F Goetz,et al.  Imaging Spectrometry for Earth Remote Sensing , 1985, Science.

[67]  Xiao Xiang Zhu,et al.  Deep Learning in Remote Sensing: A Comprehensive Review and List of Resources , 2017, IEEE Geoscience and Remote Sensing Magazine.

[68]  Lorenzo Rosasco,et al.  Why and when can deep-but not shallow-networks avoid the curse of dimensionality: A review , 2016, International Journal of Automation and Computing.