Meta-analysis of deep neural networks in remote sensing: A comparative study of mono-temporal classification to support vector machines

Abstract Deep learning methods have recently found widespread adoption for remote sensing tasks, particularly in image or pixel classification. Their flexibility and versatility has enabled researchers to propose many different designs to process remote sensing data in all spectral, spatial, and temporal dimensions. In most of the reported cases they surpass their non-deep rivals in overall classification accuracy. However, there is considerable diversity in implementation details in each case and a systematic quantitative comparison to non-deep classifiers does not exist. In this paper, we look at the major research papers that have studied deep learning image classifiers in recent years and undertake a meta-analysis on their performance compared to the most used non-deep rival, Support Vector Machine (SVM) classifiers. We focus on mono-temporal classification as the time-series image classification did not offer sufficient samples. Our work covered 103 manuscripts and included 92 cases that supported direct accuracy comparisons between deep learners and SVMs. Our general findings are the following: (i) Deep networks have better performance than non-deep spectral SVM implementations, with Convolutional Neural Networks (CNNs) performing better than other deep learners. This advantage, however, diminishes when feeding SVM with richer features extracted from data (e.g. spatial filters). (ii) Transfer learning and fine-tuning on pre-trained CNNs are offering promising results over spectral or enhanced SVM, however these pre-trained networks are currently limited to RGB input data, therefore currently lack applicability in multi/hyperspectral data. (iii) There is no strong relationship between network complexity and accuracy gains over SVM; small to medium networks perform similarly to more complex networks. (iv) Contrary to the popular belief, there are numerous cases of high deep networks performance with training proportions of 10% or less. Our study also indicates that the new generation of classifiers is often overperforming existing benchmark datasets, with accuracies surpassing 99%. There is a clear need for new benchmark dataset collections with diverse spectral, spatial and temporal resolutions and coverage that will enable us to study the design generalizations, challenge these new classifiers, and further advance remote sensing science. Our community could also benefit from a coordinated effort to create a large pre-trained network specifically designed for remote sensing images that users could later fine-tune and adjust to their study specifics.

[1]  Lingfeng Wang,et al.  Semantic Labeling in Very High Resolution Images via a Self-Cascaded Convolutional Neural Network , 2017, ISPRS Journal of Photogrammetry and Remote Sensing.

[2]  Giorgos Mountrakis,et al.  A meta-analysis of remote sensing research on supervised pixel-based land-cover image classification processes: General guidelines for practitioners and future research , 2016 .

[3]  Uwe Stilla,et al.  Deep Learning Earth Observation Classification Using ImageNet Pretrained Networks , 2016, IEEE Geoscience and Remote Sensing Letters.

[4]  Ying Li,et al.  Convolutional Neural Networks Based Hyperspectral Image Classification Method with Adaptive Kernels , 2017, Remote. Sens..

[5]  Qian Du,et al.  Hyperspectral Image Classification Using Deep Pixel-Pair Features , 2017, IEEE Transactions on Geoscience and Remote Sensing.

[6]  Xin Pan,et al.  An object-based convolutional neural network (OCNN) for urban land use classification , 2018, Remote Sensing of Environment.

[7]  Yan Zhou,et al.  Fast Automatic Airport Detection in Remote Sensing Images Using Convolutional Neural Networks , 2018, Remote. Sens..

[8]  Dawei Zai,et al.  Rotation-and-scale-invariant airplane detection in high-resolution satellite images based on deep-Hough-forests , 2016 .

[9]  Lizhe Wang,et al.  A semi-supervised generative framework with deep learning features for high-resolution remote sensing image scene classification , 2017, ISPRS Journal of Photogrammetry and Remote Sensing.

[10]  Junyu Dong,et al.  Encoding Spectral and Spatial Context Information for Hyperspectral Image Classification , 2017, IEEE Geoscience and Remote Sensing Letters.

[11]  Ronald Kemker,et al.  Algorithms for semantic segmentation of multispectral remote sensing imagery using deep learning , 2017, ISPRS Journal of Photogrammetry and Remote Sensing.

[12]  Shiyong Cui,et al.  BUILDING EXTRACTION FROM REMOTE SENSING DATA USING FULLY CONVOLUTIONAL NETWORKS , 2017 .

[13]  Lorenzo Bruzzone,et al.  Deep feature representation for hyperspectral image classification , 2015, 2015 IEEE International Geoscience and Remote Sensing Symposium (IGARSS).

[14]  Gang Wang,et al.  Deep Learning-Based Classification of Hyperspectral Data , 2014, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[15]  Weihua Su,et al.  Deep Filter Banks for Land-Use Scene Classification , 2016, IEEE Geoscience and Remote Sensing Letters.

[16]  Yong Dou,et al.  Region-based convolutional neural networks for object detection in very high resolution remote sensing images , 2016, 2016 12th International Conference on Natural Computation, Fuzzy Systems and Knowledge Discovery (ICNC-FSKD).

[17]  Bo Du,et al.  Saliency-Guided Unsupervised Feature Learning for Scene Classification , 2015, IEEE Transactions on Geoscience and Remote Sensing.

[18]  Gui-Song Xia,et al.  Transferring Deep Convolutional Neural Networks for the Scene Classification of High-Resolution Remote Sensing Imagery , 2015, Remote. Sens..

[19]  Mohammed Bennamoun,et al.  Forest Change Detection in Incomplete Satellite Images With Deep Neural Networks , 2017, IEEE Transactions on Geoscience and Remote Sensing.

[20]  Xiaoqiang Lu,et al.  Remote Sensing Image Scene Classification: Benchmark and State of the Art , 2017, Proceedings of the IEEE.

[21]  Shiming Xiang,et al.  Aircraft Detection by Deep Belief Nets , 2013, 2013 2nd IAPR Asian Conference on Pattern Recognition.

[22]  Geoffrey E. Hinton,et al.  Deep Learning , 2015, Nature.

[23]  Xiuwen Liu,et al.  Land Cover Classification from Multi-temporal, Multi-spectral Remotely Sensed Imagery using Patch-Based Recurrent Neural Networks , 2017, Neural Networks.

[24]  Zhenfeng Shao,et al.  High-resolution remote-sensing imagery retrieval using sparse features by auto-encoder , 2015 .

[25]  Xiao Xiang Zhu,et al.  A Self-Improving Convolution Neural Network for the Classification of Hyperspectral Data , 2016, IEEE Geoscience and Remote Sensing Letters.

[26]  Ping Zhong,et al.  An Unsupervised Convolutional Feature Fusion Network for Deep Representation of Remote Sensing Images , 2018, IEEE Geoscience and Remote Sensing Letters.

[27]  Amy Loutfi,et al.  Classification and Segmentation of Satellite Orthoimagery Using Convolutional Neural Networks , 2016, Remote. Sens..

[28]  Giorgos Mountrakis,et al.  Effect of classifier selection, reference sample size, reference class distribution and scene heterogeneity in per-pixel classification accuracy using 26 Landsat sites , 2018 .

[29]  Maoguo Gong,et al.  Superpixel-Based Difference Representation Learning for Change Detection in Multispectral Remote Sensing Images , 2017, IEEE Transactions on Geoscience and Remote Sensing.

[30]  Li Deng,et al.  A tutorial survey of architectures, algorithms, and applications for deep learning , 2014, APSIPA Transactions on Signal and Information Processing.

[31]  Xin Pan,et al.  A hybrid MLP-CNN classifier for very fine resolution remotely sensed image classification , 2017, ISPRS Journal of Photogrammetry and Remote Sensing.

[32]  Chunhui Zhao,et al.  Spectral-Spatial Classification of Hyperspectral Imagery Based on Stacked Sparse Autoencoder and Random Forest , 2017 .

[33]  Zhou Guo,et al.  On combining multiscale deep learning features for the classification of hyperspectral remote sensing imagery , 2015 .

[34]  Antonio Plaza,et al.  A new deep convolutional neural network for fast hyperspectral image classification , 2017, ISPRS Journal of Photogrammetry and Remote Sensing.

[35]  Xing Zhao,et al.  Spectral–Spatial Classification of Hyperspectral Data Based on Deep Belief Network , 2015, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[36]  Emile Ndikumana,et al.  Deep Recurrent Neural Network for Agricultural Classification using multitemporal SAR Sentinel-1 for Camargue, France , 2018, Remote. Sens..

[37]  Yoshua. Bengio,et al.  Learning Deep Architectures for AI , 2007, Found. Trends Mach. Learn..

[38]  M. Körner,et al.  MULTI-TEMPORAL LAND COVER CLASSIFICATION WITH LONG SHORT-TERM MEMORY NEURAL NETWORKS , 2017 .

[39]  Zhenwei Shi,et al.  MugNet: Deep learning for hyperspectral image classification using limited samples , 2017, ISPRS Journal of Photogrammetry and Remote Sensing.

[40]  Michele Volpi,et al.  Land cover mapping at very high resolution with rotation equivariant CNNs: towards small yet accurate models , 2018, ISPRS Journal of Photogrammetry and Remote Sensing.

[41]  Patrick Lambert,et al.  3-D Deep Learning Approach for Remote Sensing Image Classification , 2018, IEEE Transactions on Geoscience and Remote Sensing.

[42]  William J. Emery,et al.  Object-Based Convolutional Neural Network for High-Resolution Imagery Classification , 2017, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[43]  Plamen P. Angelov,et al.  A Massively Parallel Deep Rule-Based Ensemble Classifier for Remote Sensing Scenes , 2018, IEEE Geoscience and Remote Sensing Letters.

[44]  Tong Zhang,et al.  Deep Learning Based Feature Selection for Remote Sensing Scene Classification , 2015, IEEE Geoscience and Remote Sensing Letters.

[45]  Wenzhong Guo,et al.  Land-Use Classification via Extreme Learning Classifier Based on Deep Convolutional Features , 2017, IEEE Geoscience and Remote Sensing Letters.

[46]  Qingshan Liu,et al.  Cascaded Recurrent Neural Networks for Hyperspectral Image Classification , 2017, IEEE Transactions on Geoscience and Remote Sensing.

[47]  Peijun Du,et al.  A review of supervised object-based land-cover image classification , 2017 .

[48]  Angshul Majumdar,et al.  Deep Dictionary Learning vs Deep Belief Network vs Stacked Autoencoder: An Empirical Analysis , 2016, ICONIP.

[49]  Yun Zhang,et al.  Deep Convolutional Neural Network for Complex Wetland Classification Using Optical Remote Sensing Imagery , 2018, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[50]  Xiao Xiang Zhu,et al.  Deep Learning in Remote Sensing: A Comprehensive Review and List of Resources , 2017, IEEE Geoscience and Remote Sensing Magazine.

[51]  Naoto Yokoya,et al.  Advanced Multisource Optical Remote Sensing for Urban Land Use and Land Cover Classification [Technical Committees] , 2018 .

[52]  Miaozhong Xu,et al.  DenseNet-Based Depth-Width Double Reinforced Deep Learning Neural Network for High-Resolution Remote Sensing Image Per-Pixel Classification , 2018, Remote. Sens..

[53]  Bo Du,et al.  Scene Classification via a Gradient Boosting Random Convolutional Network Framework , 2016, IEEE Transactions on Geoscience and Remote Sensing.

[54]  Jun Li,et al.  Advanced Spectral Classifiers for Hyperspectral Images: A review , 2017, IEEE Geoscience and Remote Sensing Magazine.

[55]  Shawn D. Newsam,et al.  Learning Low Dimensional Convolutional Neural Networks for High-Resolution Remote Sensing Image Retrieval , 2016, Remote. Sens..

[56]  Yee Whye Teh,et al.  A Fast Learning Algorithm for Deep Belief Nets , 2006, Neural Computation.

[57]  Xiuping Jia,et al.  Deep Feature Extraction and Classification of Hyperspectral Images Based on Convolutional Neural Networks , 2016, IEEE Transactions on Geoscience and Remote Sensing.

[58]  Yansheng Li,et al.  Unsupervised Spectral–Spatial Feature Learning With Stacked Sparse Autoencoder for Hyperspectral Imagery Classification , 2015, IEEE Geoscience and Remote Sensing Letters.

[59]  Dino Ienco,et al.  APPLICATION OF DEEP LEARNING OF MULTI-TEMPORAL SENTINEL-1 IMAGES FOR THE CLASSIFICATION OF COASTAL VEGETATION ZONE OF THE DANUBE DELTA , 2018 .

[60]  Xiao Xiang Zhu,et al.  FusioNet: A two-stream convolutional neural network for urban scene classification using PolSAR and hyperspectral data , 2017, 2017 Joint Urban Remote Sensing Event (JURSE).

[61]  Fan Zhang,et al.  Deep Convolutional Neural Networks for Hyperspectral Image Classification , 2015, J. Sensors.

[62]  Jungho Im,et al.  ISPRS Journal of Photogrammetry and Remote Sensing , 2022 .

[63]  Yanfei Liu,et al.  Scene Classification Based on a Deep Random-Scale Stretched Convolutional Neural Network , 2018, Remote. Sens..

[64]  F ROSENBLATT,et al.  The perceptron: a probabilistic model for information storage and organization in the brain. , 1958, Psychological review.

[65]  Ying Li,et al.  Spectral-Spatial Classification of Hyperspectral Imagery with 3D Convolutional Neural Network , 2017, Remote. Sens..

[66]  Haokui Zhang,et al.  Spectral-spatial classification of hyperspectral imagery using a dual-channel convolutional neural network , 2017 .

[67]  Rob Fergus,et al.  Visualizing and Understanding Convolutional Networks , 2013, ECCV.

[68]  Jie Geng,et al.  Hyperspectral image classification via contextual deep learning , 2015, EURASIP Journal on Image and Video Processing.

[69]  Qi Zhou,et al.  Application of a parallel spectral–spatial convolution neural network in object-oriented remote sensing land use classification , 2018 .

[70]  Shanjun Mao,et al.  Spectral–spatial classification of hyperspectral images using deep convolutional neural networks , 2015 .

[71]  Cheng Shi,et al.  Superpixel-based 3D deep neural networks for hyperspectral image classification , 2018, Pattern Recognit..

[72]  W. Tobler A Computer Movie Simulating Urban Growth in the Detroit Region , 1970 .

[73]  Pascal Vincent,et al.  Stacked Denoising Autoencoders: Learning Useful Representations in a Deep Network with a Local Denoising Criterion , 2010, J. Mach. Learn. Res..

[74]  Brian P. Salmon,et al.  Multiview Deep Learning for Land-Use Classification , 2015, IEEE Geoscience and Remote Sensing Letters.

[75]  Xiangwen Liao,et al.  Land-use scene classification based on a CNN using a constrained extreme learning machine , 2018 .

[76]  Baihua Xiao,et al.  Multimodal Ground-Based Cloud Classification Using Joint Fusion Convolutional Neural Network , 2018, Remote. Sens..

[77]  Xiaojin Zhu,et al.  Introduction to Semi-Supervised Learning , 2009, Synthesis Lectures on Artificial Intelligence and Machine Learning.

[78]  Carlo Gatta,et al.  Unsupervised Deep Feature Extraction for Remote Sensing Image Classification , 2015, IEEE Transactions on Geoscience and Remote Sensing.

[79]  Congxin Liu,et al.  Satellite Imagery Classification Based on Deep Convolution Network , 2016 .

[80]  Kim-Kwang Raymond Choo,et al.  SVM or deep learning? A comparative study on remote sensing image classification , 2016, Soft Computing.

[81]  Xiao Xiang Zhu,et al.  Unsupervised Spectral–Spatial Feature Learning via Deep Residual Conv–Deconv Network for Hyperspectral Image Classification , 2018, IEEE Transactions on Geoscience and Remote Sensing.

[82]  Gui-Song Xia,et al.  AID: A Benchmark Data Set for Performance Evaluation of Aerial Scene Classification , 2016, IEEE Transactions on Geoscience and Remote Sensing.

[83]  Qian Du,et al.  Multisource Remote Sensing Data Classification Based on Convolutional Neural Network , 2018, IEEE Transactions on Geoscience and Remote Sensing.

[84]  Berrin A. Yanikoglu,et al.  Deep Learning With Attribute Profiles for Hyperspectral Image Classification , 2016, IEEE Geoscience and Remote Sensing Letters.

[85]  Jie Geng,et al.  High-Resolution SAR Image Classification via Deep Convolutional Autoencoders , 2015, IEEE Geoscience and Remote Sensing Letters.

[86]  Hao Wu,et al.  Semi-Supervised Deep Learning Using Pseudo Labels for Hyperspectral Image Classification , 2018, IEEE Transactions on Image Processing.

[87]  Jun Wu,et al.  A Hierarchical Oil Tank Detector With Deep Surrounding Features for High-Resolution Optical Satellite Imagery , 2015, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[88]  Nikos Komodakis,et al.  Building detection in very high resolution multispectral data with deep learning features , 2015, 2015 IEEE International Geoscience and Remote Sensing Symposium (IGARSS).

[89]  Yudong Zhang,et al.  Polarimetric synthetic aperture radar image segmentation by convolutional neural network using graphical processing units , 2017, Journal of Real-Time Image Processing.

[90]  Aamir Saeed Malik,et al.  Scene classification for aerial images based on CNN using sparse coding technique , 2017 .

[91]  Pierre Alliez,et al.  Convolutional Neural Networks for Large-Scale Remote-Sensing Image Classification , 2017, IEEE Transactions on Geoscience and Remote Sensing.

[92]  Bo Huang,et al.  Urban land-use mapping using a deep convolutional neural network with high spatial resolution multispectral remote sensing imagery , 2018, Remote Sensing of Environment.

[93]  Shihong Du,et al.  Learning multiscale and deep representations for classifying remotely sensed imagery , 2016 .

[94]  Gang Fu,et al.  Classification for High Resolution Remote Sensing Imagery Using a Fully Convolutional Network , 2017, Remote. Sens..

[95]  Lichao Mou,et al.  Learning a Transferable Change Rule from a Recurrent Neural Network for Land Cover Change Detection , 2016, Remote. Sens..

[96]  Yuanyuan Liu,et al.  Deep Salient Feature Based Anti-Noise Transfer Network for Scene Classification of Remote Sensing Imagery , 2018, Remote. Sens..

[97]  Michele Volpi,et al.  Dense Semantic Labeling of Subdecimeter Resolution Images With Convolutional Neural Networks , 2016, IEEE Transactions on Geoscience and Remote Sensing.

[98]  Shiming Xiang,et al.  Vehicle Detection in Satellite Images by Hybrid Deep Convolutional Neural Networks , 2014, IEEE Geoscience and Remote Sensing Letters.

[99]  Jon Atli Benediktsson,et al.  Advances in Hyperspectral Image Classification: Earth Monitoring with Statistical Learning Methods , 2013, IEEE Signal Processing Magazine.

[100]  Yun Shi,et al.  3D Convolutional Neural Networks for Crop Classification with Multi-Temporal Remote Sensing Images , 2018, Remote. Sens..

[101]  Lin Lei,et al.  Vehicle Detection in Aerial Images Based on Region Convolutional Neural Networks and Hard Negative Example Mining , 2017, Sensors.

[102]  Bo Du,et al.  Deep Learning for Remote Sensing Data: A Technical Tutorial on the State of the Art , 2016, IEEE Geoscience and Remote Sensing Magazine.

[103]  Jun Li,et al.  Active Learning With Convolutional Neural Networks for Hyperspectral Image Classification Using a New Bayesian Approach , 2018, IEEE Transactions on Geoscience and Remote Sensing.

[104]  Xin Pan,et al.  VPRS-Based Regional Decision Fusion of CNN and MRF Classifications for Very Fine Resolution Remotely Sensed Images , 2018, IEEE Transactions on Geoscience and Remote Sensing.

[105]  Xing Chen,et al.  Stacked Denoise Autoencoder Based Feature Extraction and Classification for Hyperspectral Images , 2016, J. Sensors.

[106]  Peijun Du,et al.  Novel segmented stacked autoencoder for effective dimensionality reduction and feature extraction in hyperspectral imaging , 2016, Neurocomputing.

[107]  Lei Guo,et al.  Remote Sensing Image Scene Classification Using Bag of Convolutional Features , 2017, IEEE Geoscience and Remote Sensing Letters.

[108]  Jefersson Alex dos Santos,et al.  Towards better exploiting convolutional neural networks for remote sensing scene classification , 2016, Pattern Recognit..

[109]  Harish Bhaskar,et al.  Supervised remote sensing image segmentation using boosted convolutional neural networks , 2016, Knowl. Based Syst..

[110]  Supratik Mukhopadhyay,et al.  DeepSat: a learning framework for satellite imagery , 2015, SIGSPATIAL/GIS.

[111]  Yurong Liu,et al.  A survey of deep neural network architectures and their applications , 2017, Neurocomputing.

[112]  Jun Wang,et al.  Road network extraction: a neural-dynamic framework based on deep learning and a finite state machine , 2015 .

[113]  Yoshihiko Mochizuki,et al.  Surface object recognition with CNN and SVM in Landsat 8 images , 2015, 2015 14th IAPR International Conference on Machine Vision Applications (MVA).

[114]  Bertrand Le Saux,et al.  Semantic Segmentation of Earth Observation Data Using Multimodal and Multi-scale Deep Networks , 2016, ACCV.

[115]  Geoffrey E. Hinton,et al.  Learning representations by back-propagating errors , 1986, Nature.