Deep Learning for Remote Sensing Data: A Technical Tutorial on the State of the Art

Deep-learning (DL) algorithms, which learn the representative and discriminative features in a hierarchical manner from the data, have recently become a hotspot in the machine-learning area and have been introduced into the geoscience and remote sensing (RS) community for RS big data analysis. Considering the low-level features (e.g., spectral and texture) as the bottom level, the output feature representation from the top level of the network can be directly fed into a subsequent classifier for pixel-based classification. As a matter of fact, by carefully addressing the practical demands in RS applications and designing the input?output levels of the whole network, we have found that DL is actually everywhere in RS data analysis: from the traditional topics of image preprocessing, pixel-based classification, and target recognition, to the recent challenging tasks of high-level semantic feature extraction and RS scene understanding.In this technical tutorial, a general framework of DL for RS data is provided, and the state-of-the-art DL methods in RS are regarded as special cases of input-output data combined with various deep networks and tuning tricks. Although extensive experimental results confirm the excellent performance of the DL-based algorithms in RS big data analysis, even more exciting prospects can be expected for DL in RS. Key bottlenecks and potential directions are also indicated in this article, guiding further research into DL for RS data.

[1]  Zhou Guo,et al.  On combining multiscale deep learning features for the classification of hyperspectral remote sensing imagery , 2015 .

[2]  Nitish Srivastava,et al.  Improving neural networks by preventing co-adaptation of feature detectors , 2012, ArXiv.

[3]  Gui-Song Xia,et al.  Dirichlet-Derived Multiple Topic Scene Classification Model for High Spatial Resolution Remote Sensing Imagery , 2016, IEEE Transactions on Geoscience and Remote Sensing.

[4]  Brian P. Salmon,et al.  Multiview Deep Learning for Land-Use Classification , 2015, IEEE Geoscience and Remote Sensing Letters.

[5]  Gang Wang,et al.  Spectral-spatial classification of hyperspectral image using autoencoders , 2013, 2013 9th International Conference on Information, Communications & Signal Processing.

[6]  Bei Zhao,et al.  Scene classification via latent Dirichlet allocation using a hybrid generative/discriminative strategy for high spatial resolution remote sensing imagery , 2013 .

[7]  Lorenzo Bruzzone,et al.  Classification of hyperspectral remote sensing images with support vector machines , 2004, IEEE Transactions on Geoscience and Remote Sensing.

[8]  Xiaoqiang Lu,et al.  Unsupervised feature learning for scene classification of high resolution remote sensing image , 2015, 2015 IEEE China Summit and International Conference on Signal and Information Processing (ChinaSIP).

[9]  Andrea Garzelli,et al.  Context-driven fusion of high spatial and spectral resolution images based on oversampled multiresolution analysis , 2002, IEEE Trans. Geosci. Remote. Sens..

[10]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[11]  John R. Jensen,et al.  Introductory Digital Image Processing: A Remote Sensing Perspective , 1986 .

[12]  Luca Maria Gambardella,et al.  Deep Big Simple Neural Nets Excel on Handwritten Digit Recognition , 2010, ArXiv.

[13]  Xing Zhao,et al.  Spectral–Spatial Classification of Hyperspectral Data Based on Deep Belief Network , 2015, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[14]  Gang Liu,et al.  Co-segmentation of aircrafts from high-resolution satellite images , 2012, 2012 IEEE 11th International Conference on Signal Processing.

[15]  Johannes R. Sveinsson,et al.  Classification of hyperspectral data from urban areas based on extended morphological profiles , 2005, IEEE Transactions on Geoscience and Remote Sensing.

[16]  Jefersson Alex dos Santos,et al.  Do deep features generalize from everyday objects to remote sensing and aerial scenes domains? , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[17]  Gui-Song Xia,et al.  Learning High-level Features for Satellite Image Classification With Limited Labeled Samples , 2015, IEEE Transactions on Geoscience and Remote Sensing.

[18]  Pascal Vincent,et al.  Representation Learning: A Review and New Perspectives , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[19]  Tong Zhang,et al.  Deep Learning Based Feature Selection for Remote Sensing Scene Classification , 2015, IEEE Geoscience and Remote Sensing Letters.

[20]  Yoshua Bengio,et al.  Extracting and composing robust features with denoising autoencoders , 2008, ICML '08.

[21]  Jie Li,et al.  Rethinking big data: A review on the data quality and usage issues , 2016 .

[22]  Jason Weston,et al.  A unified architecture for natural language processing: deep neural networks with multitask learning , 2008, ICML '08.

[23]  Shihong Du,et al.  Learning multiscale and deep representations for classifying remotely sensed imagery , 2016 .

[24]  Avik Bhattacharya,et al.  Urban classification using PolSAR data and deep learning , 2015, 2015 IEEE International Geoscience and Remote Sensing Symposium (IGARSS).

[25]  Fan Zhang,et al.  Deep Convolutional Neural Networks for Hyperspectral Image Classification , 2015, J. Sensors.

[26]  Liangpei Zhang,et al.  Morphological Building/Shadow Index for Building Extraction From High-Resolution Imagery Over Urban Areas , 2012, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[27]  Shiming Xiang,et al.  Vehicle Detection in Satellite Images by Hybrid Deep Convolutional Neural Networks , 2014, IEEE Geoscience and Remote Sensing Letters.

[28]  Qian Du,et al.  A joint band prioritization and band-decorrelation approach to band selection for hyperspectral image classification , 1999, IEEE Trans. Geosci. Remote. Sens..

[29]  Mihai Datcu,et al.  Latent Dirichlet Allocation for Spatial Analysis of Satellite Images , 2013, IEEE Transactions on Geoscience and Remote Sensing.

[30]  Shiming Xiang,et al.  Vehicle Detection in Satellite Images by Parallel Deep Convolutional Neural Networks , 2013, 2013 2nd IAPR Asian Conference on Pattern Recognition.

[31]  M. W Gardner,et al.  Artificial neural networks (the multilayer perceptron)—a review of applications in the atmospheric sciences , 1998 .

[32]  David Haussler,et al.  Unsupervised learning of distributions on binary vectors using two layer networks , 1991, NIPS 1991.

[33]  Hong Sun,et al.  Unsupervised Feature Learning Via Spectral Clustering of Multidimensional Patches for Remotely Sensed Scene Classification , 2015, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[34]  Liangpei Zhang,et al.  A pixel shape index coupled with spectral information for classification of high spatial resolution remotely sensed imagery , 2006, IEEE Transactions on Geoscience and Remote Sensing.

[35]  Jon Atli Benediktsson,et al.  Advances in Hyperspectral Image Classification: Earth Monitoring with Statistical Learning Methods , 2013, IEEE Signal Processing Magazine.

[36]  Haipeng Wang,et al.  Application of deep-learning algorithms to MSTAR data , 2015, 2015 IEEE International Geoscience and Remote Sensing Symposium (IGARSS).

[37]  Yann LeCun,et al.  What is the best multi-stage architecture for object recognition? , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[38]  S. P. Hoogendoorn,et al.  Microscopic Traffic Data Collection by Remote Sensing , 2003 .

[39]  Liangpei Zhang,et al.  Scene Classification Based on the Multifeature Fusion Probabilistic Topic Model for High Spatial Resolution Remote Sensing Imagery , 2015, IEEE Transactions on Geoscience and Remote Sensing.

[40]  Bor-Chen Kuo,et al.  Feature Mining for Hyperspectral Image Classification , 2013, Proceedings of the IEEE.

[41]  Geoffrey E. Hinton Learning multiple layers of representation , 2007, Trends in Cognitive Sciences.

[42]  Anil M. Cheriyadat,et al.  Unsupervised Feature Learning for Aerial Scene Classification , 2014, IEEE Transactions on Geoscience and Remote Sensing.

[43]  Jon Atli Benediktsson,et al.  Spectral–Spatial Hyperspectral Image Classification With Edge-Preserving Filtering , 2014, IEEE Transactions on Geoscience and Remote Sensing.

[44]  U. Benz,et al.  Multi-resolution, object-oriented fuzzy analysis of remote sensing data for GIS-ready information , 2004 .

[45]  Yann LeCun,et al.  Convolutional networks and applications in vision , 2010, Proceedings of 2010 IEEE International Symposium on Circuits and Systems.

[46]  Nikolaos Doulamis,et al.  Deep supervised learning for hyperspectral data classification through convolutional neural networks , 2015, 2015 IEEE International Geoscience and Remote Sensing Symposium (IGARSS).

[47]  Hong Sun,et al.  A comparative study of sampling analysis in scene classification of high-resolution remote sensing imagery , 2015, 2015 IEEE International Geoscience and Remote Sensing Symposium (IGARSS).

[48]  Liangpei Zhang,et al.  On Combining Multiple Features for Hyperspectral Remote Sensing Image Classification , 2012, IEEE Transactions on Geoscience and Remote Sensing.

[49]  Liangpei Zhang,et al.  Tensor Discriminative Locality Alignment for Hyperspectral Image Spectral–Spatial Feature Extraction , 2013, IEEE Transactions on Geoscience and Remote Sensing.

[50]  Haiyan Guan,et al.  Rotation-Invariant Object Detection in High-Resolution Satellite Imagery Using Superpixel-Based Deep Hough Forests , 2015, IEEE Geoscience and Remote Sensing Letters.

[51]  Antonio J. Plaza,et al.  Dimensionality reduction and classification of hyperspectral image data using sequences of extended morphological transformations , 2005, IEEE Transactions on Geoscience and Remote Sensing.

[52]  Jonathan Li,et al.  Learning Hierarchical Features for Automated Extraction of Road Markings From 3-D Mobile LiDAR Point Clouds , 2015, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[53]  Shuang Wang,et al.  Multilayer feature learning for polarimetric synthetic radar data classification , 2014, 2014 IEEE Geoscience and Remote Sensing Symposium.

[54]  Alexandre Boulch,et al.  Benchmarking classification of earth-observation data: From learning explicit features to convolutional networks , 2015, 2015 IEEE International Geoscience and Remote Sensing Symposium (IGARSS).

[55]  Hongyi Liu,et al.  A New Pan-Sharpening Method With Deep Neural Networks , 2015, IEEE Geoscience and Remote Sensing Letters.

[56]  Yong Dou,et al.  Classification of land cover based on deep belief networks using polarimetric RADARSAT-2 data , 2014, 2014 IEEE Geoscience and Remote Sensing Symposium.

[57]  W. Cohen,et al.  Lidar Remote Sensing for Ecosystem Studies , 2002 .

[58]  Uwe Stilla,et al.  Deep Learning Earth Observation Classification Using ImageNet Pretrained Networks , 2016, IEEE Geoscience and Remote Sensing Letters.

[59]  Nikos Komodakis,et al.  Building detection in very high resolution multispectral data with deep learning features , 2015, 2015 IEEE International Geoscience and Remote Sensing Symposium (IGARSS).

[60]  Hong Sun,et al.  Unsupervised feature coding on local patch manifold for satellite image scene classification , 2014, 2014 IEEE Geoscience and Remote Sensing Symposium.

[61]  Yicong Zhou,et al.  Learning Hierarchical Spectral–Spatial Features for Hyperspectral Image Classification , 2016, IEEE Transactions on Cybernetics.

[62]  J. Chanussot,et al.  Hyperspectral Remote Sensing Data Analysis and Future Challenges , 2013, IEEE Geoscience and Remote Sensing Magazine.

[63]  Carlo Gatta,et al.  Unsupervised Deep Feature Extraction for Remote Sensing Image Classification , 2015, IEEE Transactions on Geoscience and Remote Sensing.

[64]  Shanjun Mao,et al.  Spectral–spatial classification of hyperspectral images using deep convolutional neural networks , 2015 .

[65]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[66]  Ping Zhong,et al.  $L_{1/2}$-Regularized Deconvolution Network for the Representation and Restoration of Optical Remote Sensing Images , 2014, IEEE Transactions on Geoscience and Remote Sensing.

[67]  Shawn D. Newsam,et al.  Bag-of-visual-words and spatial extensions for land-use classification , 2010, GIS '10.

[68]  T. Herring,et al.  GPS Meteorology: Remote Sensing of Atmospheric Water Vapor Using the Global Positioning System , 1992 .

[69]  Bo Du,et al.  Scene Classification via a Gradient Boosting Random Convolutional Network Framework , 2016, IEEE Transactions on Geoscience and Remote Sensing.

[70]  Yoshua Bengio,et al.  Gradient-based learning applied to document recognition , 1998, Proc. IEEE.

[71]  Menglong Yan,et al.  Object recognition in remote sensing images using sparse deep belief networks , 2015 .

[72]  Baojun Zhao,et al.  Compressed-Domain Ship Detection on Spaceborne Optical Image Using Deep Neural Network and Extreme Learning Machine , 2015, IEEE Transactions on Geoscience and Remote Sensing.

[73]  Dawei Zai,et al.  Rotation-and-scale-invariant airplane detection in high-resolution satellite images based on deep-Hough-forests , 2016 .

[74]  Domenico Velotto,et al.  Target classification in oceanographic SAR images with deep neural networks: Architecture and initial results , 2015, 2015 IEEE International Geoscience and Remote Sensing Symposium (IGARSS).

[75]  Thomas Blaschke,et al.  Object based image analysis for remote sensing , 2010 .

[76]  Naif Alajlan,et al.  A hierarchical learning paradigm for semi-supervised classification of remote sensing images , 2015, 2015 IEEE International Geoscience and Remote Sensing Symposium (IGARSS).

[77]  Gui-Song Xia,et al.  Transferring Deep Convolutional Neural Networks for the Scene Classification of High-Resolution Remote Sensing Imagery , 2015, Remote. Sens..

[78]  Bernt Schiele,et al.  What Makes for Effective Detection Proposals? , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[79]  Michalis Zervakis,et al.  Deep learning for multi-label land cover classification , 2015, SPIE Remote Sensing.

[80]  Zoran Zivkovic,et al.  Improved adaptive Gaussian mixture model for background subtraction , 2004, Proceedings of the 17th International Conference on Pattern Recognition, 2004. ICPR 2004..

[81]  Guigang Zhang,et al.  Deep Learning , 2016, Int. J. Semantic Comput..

[82]  T. Poggio,et al.  Hierarchical models of object recognition in cortex , 1999, Nature Neuroscience.

[83]  Jon Atli Benediktsson,et al.  Very High-Resolution Remote Sensing: Challenges and Opportunities [Point of View] , 2012, Proc. IEEE.

[84]  Xiang Zhang,et al.  OverFeat: Integrated Recognition, Localization and Detection using Convolutional Networks , 2013, ICLR.

[85]  Rajat Raina,et al.  Efficient sparse coding algorithms , 2006, NIPS.

[86]  Yihong Gong,et al.  Locality-constrained Linear Coding for image classification , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[87]  Thomas Serre,et al.  A quantitative theory of immediate visual recognition. , 2007, Progress in brain research.

[88]  Fan Zhang,et al.  Hierarchical feature learning with dropout k-means for hyperspectral image classification , 2016, Neurocomputing.

[89]  Luca Maria Gambardella,et al.  Deep, Big, Simple Neural Nets for Handwritten Digit Recognition , 2010, Neural Computation.

[90]  Bo Du,et al.  Ensemble manifold regularized sparse low-rank approximation for multiview feature embedding , 2015, Pattern Recognit..

[91]  Enhong Chen,et al.  Image Denoising and Inpainting with Deep Neural Networks , 2012, NIPS.

[92]  Naif Alajlan,et al.  A deep learning approach for unsupervised domain adaptation in multitemporal remote sensing images , 2015, 2015 IEEE International Geoscience and Remote Sensing Symposium (IGARSS).

[93]  M. Siegel,et al.  Hyperspectral classification via deep networks and superpixel segmentation , 2015 .

[94]  Jie Geng,et al.  Hyperspectral image classification via contextual deep learning , 2015, EURASIP Journal on Image and Video Processing.

[95]  Jon Atli Benediktsson,et al.  Classification and feature extraction for remote sensing images from urban areas based on morphological transformations , 2003, IEEE Trans. Geosci. Remote. Sens..

[96]  Shiming Xiang,et al.  Aircraft Detection by Deep Belief Nets , 2013, 2013 2nd IAPR Asian Conference on Pattern Recognition.

[97]  Yee Whye Teh,et al.  A Fast Learning Algorithm for Deep Belief Nets , 2006, Neural Computation.

[98]  Jon Atli Benediktsson,et al.  Advances in Spectral-Spatial Classification of Hyperspectral Images , 2013, Proceedings of the IEEE.

[99]  Tara N. Sainath,et al.  Deep Belief Networks using discriminative features for phone recognition , 2011, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[100]  Lorenzo Bruzzone,et al.  Deep feature representation for hyperspectral image classification , 2015, 2015 IEEE International Geoscience and Remote Sensing Symposium (IGARSS).

[101]  Gang Wang,et al.  Deep Learning-Based Classification of Hyperspectral Data , 2014, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[102]  Bo Du,et al.  Saliency-Guided Unsupervised Feature Learning for Scene Classification , 2015, IEEE Transactions on Geoscience and Remote Sensing.

[103]  J. P. Jones,et al.  An evaluation of the two-dimensional Gabor filter model of simple receptive fields in cat striate cortex. , 1987, Journal of neurophysiology.

[104]  Xin Huang,et al.  A multi-index learning approach for classification of high-resolution remotely sensed images over urban areas , 2014 .

[105]  Bill Triggs,et al.  Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[106]  Ye Zhang,et al.  Classification of hyperspectral image based on deep belief networks , 2014, 2014 IEEE International Conference on Image Processing (ICIP).

[107]  Ryan S. Miller,et al.  Identifying populations potentially exposed to agricultural pesticides using remote sensing and a Geographic Information System. , 1999, Environmental health perspectives.

[108]  Jun Wang,et al.  Road network extraction: a neural-dynamic framework based on deep learning and a finite state machine , 2015 .

[109]  Jie Geng,et al.  High-Resolution SAR Image Classification via Deep Convolutional Autoencoders , 2015, IEEE Geoscience and Remote Sensing Letters.

[110]  Jun Wu,et al.  A Hierarchical Oil Tank Detector With Deep Surrounding Features for High-Resolution Optical Satellite Imagery , 2015, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[111]  Vladimir Risojevic,et al.  Unsupervised learning of quaternion features for image classification , 2013, 2013 11th International Conference on Telecommunications in Modern Satellite, Cable and Broadcasting Services (TELSIKS).

[112]  Yoshua. Bengio,et al.  Learning Deep Architectures for AI , 2007, Found. Trends Mach. Learn..

[113]  Jun Huang,et al.  A Novel Bioinspired Multiobjective Optimization Algorithm for Designing Wireless Sensor Networks in the Internet of Things , 2015, J. Sensors.

[114]  Matthijs C. Dorst Distinctive Image Features from Scale-Invariant Keypoints , 2011 .

[115]  Mihai Datcu,et al.  Semantic Annotation of Satellite Images Using Latent Dirichlet Allocation , 2010, IEEE Geoscience and Remote Sensing Letters.

[116]  Nicolas Courty,et al.  Multiclass feature learning for hyperspectral image classification: sparse and hierarchical solutions , 2015, ArXiv.

[117]  Peijun Du,et al.  Novel segmented stacked autoencoder for effective dimensionality reduction and feature extraction in hyperspectral imaging , 2016, Neurocomputing.

[118]  Jianhua Wang,et al.  Deep hierarchical representation and segmentation of high resolution remote sensing images , 2015, 2015 IEEE International Geoscience and Remote Sensing Symposium (IGARSS).

[119]  Geoffrey E. Hinton,et al.  Learning internal representations by error propagation , 1986 .

[120]  Cordelia Schmid,et al.  Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[121]  James E. Fowler,et al.  Nearest Regularized Subspace for Hyperspectral Classification , 2014, IEEE Transactions on Geoscience and Remote Sensing.

[122]  Mihai Datcu,et al.  Deep learning in very high resolution remote sensing image information mining communication concept , 2012, 2012 Proceedings of the 20th European Signal Processing Conference (EUSIPCO).

[123]  Thomas Hofmann,et al.  Greedy Layer-Wise Training of Deep Networks , 2007 .

[124]  Zhenfeng Shao,et al.  High-resolution remote-sensing imagery retrieval using sparse features by auto-encoder , 2015 .

[125]  Lei Guo,et al.  Object Detection in Optical Remote Sensing Images Based on Weakly Supervised Learning and High-Level Feature Learning , 2015, IEEE Transactions on Geoscience and Remote Sensing.