A review of deep learning methods for semantic segmentation of remote sensing imagery

Abstract Semantic segmentation of remote sensing imagery has been employed in many applications and is a key research topic for decades. With the success of deep learning methods in the field of computer vision, researchers have made a great effort to transfer their superior performance to the field of remote sensing image analysis. This paper starts with a summary of the fundamental deep neural network architectures and reviews the most recent developments of deep learning methods for semantic segmentation of remote sensing imagery including non-conventional data such as hyperspectral images and point clouds. In our review of the literature, we identified three major challenges faced by researchers and summarize the innovative development to address them. As tremendous efforts have been devoted to advancing pixel-level accuracy, the emerged deep learning methods demonstrated much-improved performance on several public data sets. As to handling the non-conventional, unstructured point cloud and rich spectral imagery, the performance of the state-of-the-art methods is, on average, inferior to that of the satellite imagery. Such a performance gap also exists in learning from small data sets. In particular, the limited non-conventional remote sensing data sets with labels is an obstacle to developing and evaluating new deep learning methods.

[1]  Nataliia Kussul,et al.  Deep Learning Classification of Land Cover and Crop Types Using Remote Sensing Data , 2017, IEEE Geoscience and Remote Sensing Letters.

[2]  Lianru Gao,et al.  High-Resolution Aerial Imagery Semantic Labeling with Dense Pyramid Network , 2018, Sensors.

[3]  Bertrand Le Saux,et al.  Beyond RGB: Very High Resolution Urban Remote Sensing With Multimodal Deep Networks , 2017, ISPRS Journal of Photogrammetry and Remote Sensing.

[4]  Jiangyun Li,et al.  Efficient Patch-Wise Semantic Segmentation for Large-Scale Remote Sensing Images , 2018, Sensors.

[5]  Lingfeng Wang,et al.  Semantic Labeling in Very High Resolution Images via a Self-Cascaded Convolutional Neural Network , 2017, ISPRS Journal of Photogrammetry and Remote Sensing.

[6]  Ronald Kemker,et al.  Self-Taught Feature Learning for Hyperspectral Image Classification , 2017, IEEE Transactions on Geoscience and Remote Sensing.

[7]  Bertrand Le Saux,et al.  Semantic Segmentation of Earth Observation Data Using Multimodal and Multi-scale Deep Networks , 2016, ACCV.

[8]  Yoshua Bengio,et al.  Understanding the difficulty of training deep feedforward neural networks , 2010, AISTATS.

[9]  Xiao Xiang Zhu,et al.  A Self-Improving Convolution Neural Network for the Classification of Hyperspectral Data , 2016, IEEE Geoscience and Remote Sensing Letters.

[10]  Shihong Du,et al.  Learning multiscale and deep representations for classifying remotely sensed imagery , 2016 .

[11]  Bolei Zhou,et al.  Learning Deep Features for Scene Recognition using Places Database , 2014, NIPS.

[12]  Yu Liu,et al.  A review of semantic segmentation using deep neural networks , 2017, International Journal of Multimedia Information Retrieval.

[13]  Geoffrey E. Hinton,et al.  Machine Learning for Aerial Image Labeling , 2013 .

[14]  Michael S. Bernstein,et al.  ImageNet Large Scale Visual Recognition Challenge , 2014, International Journal of Computer Vision.

[15]  Jian Sun,et al.  Guided Image Filtering , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[16]  Biao Hou,et al.  Patch-Sorted Deep Feature Learning for High Resolution SAR Image Classification , 2018, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[17]  Bo Du,et al.  Deep Learning for Remote Sensing Data: A Technical Tutorial on the State of the Art , 2016, IEEE Geoscience and Remote Sensing Magazine.

[18]  Markus Gerke,et al.  The ISPRS benchmark on urban object classification and 3D building reconstruction , 2012 .

[19]  Trevor Darrell,et al.  Fully Convolutional Networks for Semantic Segmentation , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[20]  Wei Lee Woon,et al.  Simultaneous extraction of roads and buildings in remote sensing imagery with convolutional neural networks , 2017 .

[21]  Vladlen Koltun,et al.  Multi-Scale Context Aggregation by Dilated Convolutions , 2015, ICLR.

[22]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[23]  Jian Sun,et al.  Identity Mappings in Deep Residual Networks , 2016, ECCV.

[24]  Menglong Yan,et al.  Semantic Segmentation of Aerial Images With Shuffling Convolutional Neural Networks , 2018, IEEE Geoscience and Remote Sensing Letters.

[25]  François Chollet,et al.  Xception: Deep Learning with Depthwise Separable Convolutions , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[26]  Xiaorui Ma,et al.  Semisupervised classification for hyperspectral image based on multi-decision labeling and deep feature learning , 2016 .

[27]  Wei Li,et al.  DeepUNet: A Deep Fully Convolutional Network for Pixel-Level Sea-Land Segmentation , 2017, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[28]  Jürgen Schmidhuber,et al.  Long Short-Term Memory , 1997, Neural Computation.

[29]  Zhenwei Shi,et al.  MugNet: Deep learning for hyperspectral image classification using limited samples , 2017, ISPRS Journal of Photogrammetry and Remote Sensing.

[30]  Wei Yuan,et al.  Automatic Building Segmentation of Aerial Imagery Using Multi-Constraint Fully Convolutional Networks , 2018, Remote. Sens..

[31]  Gang Fu,et al.  Classification for High Resolution Remote Sensing Imagery Using a Fully Convolutional Network , 2017, Remote. Sens..

[32]  Iasonas Kokkinos,et al.  DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs , 2016, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[33]  Li Wang,et al.  Fusion of images and point clouds for the semantic segmentation of large-scale 3D scenes based on deep learning , 2018, ISPRS Journal of Photogrammetry and Remote Sensing.

[34]  Saad Rehman,et al.  Convolutional neural network based image segmentation: a review , 2018, Defense + Security.

[35]  Peter Caccetta,et al.  ResUNet-a: a deep learning framework for semantic segmentation of remotely sensed data , 2019, ISPRS Journal of Photogrammetry and Remote Sensing.

[36]  Xiao Xiang Zhu,et al.  Deep learning in remote sensing: a review , 2017, ArXiv.

[37]  Jie Gong,et al.  Automated residential building detection from airborne LiDAR data with deep neural networks , 2018, Adv. Eng. Informatics.

[38]  Yongyang Xu,et al.  Building Extraction in Very High Resolution Remote Sensing Imagery Using Deep Learning and Guided Filters , 2018, Remote. Sens..

[39]  Xiaohui Yuan,et al.  Automatic Urban Water-Body Detection and Segmentation From Sparse ALSM Data via Spatially Constrained Model-Driven Clustering , 2011, IEEE Geoscience and Remote Sensing Letters.

[40]  Luc Van Gool,et al.  The Pascal Visual Object Classes (VOC) Challenge , 2010, International Journal of Computer Vision.

[41]  Rameen Abdal,et al.  UFCN: a fully convolutional neural network for road extraction in RGB imagery acquired by remote sensing from an unmanned aerial vehicle , 2018 .

[42]  Reda ElHakim,et al.  A Deep CNN-Based Framework For Enhanced Aerial Imagery Registration with Applications to UAV Geolocalization , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[43]  Licheng Jiao,et al.  Divide-and-Conquer Dual-Architecture Convolutional Neural Network for Classification of Hyperspectral Images , 2019, Remote. Sens..

[44]  Ronald Kemker,et al.  Algorithms for semantic segmentation of multispectral remote sensing imagery using deep learning , 2017, ISPRS Journal of Photogrammetry and Remote Sensing.

[45]  George Papandreou,et al.  Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation , 2018, ECCV.

[46]  Alexandre Boulch,et al.  Distance transform regression for spatially-aware deep semantic segmentation , 2019, Comput. Vis. Image Underst..

[47]  Simon Hessner,et al.  Image Style Transfer using Convolutional Neural Networks , 2018 .

[48]  Yuantao Jiang Research on road extraction of remote sensing image based on convolutional neural network , 2019, EURASIP J. Image Video Process..

[49]  M. F. Baumgardner,et al.  220 Band AVIRIS Hyperspectral Image Data Set: June 12, 1992 Indian Pine Test Site 3 , 2015 .

[50]  Mohamed Farah,et al.  Hyperspectral imagery classification based on semi-supervised 3-D deep neural network and adaptive band selection , 2019, Expert Syst. Appl..

[51]  Hasan Asy'ari Arief,et al.  Land Cover Segmentation of Airborne LiDAR Data Using Stochastic Atrous Network , 2018, Remote. Sens..

[52]  Jie Geng,et al.  SAR Image Classification via Deep Recurrent Encoding Neural Networks , 2018, IEEE Transactions on Geoscience and Remote Sensing.

[53]  Bo Du,et al.  Hyperspectral image classification via a random patches network , 2018, ISPRS Journal of Photogrammetry and Remote Sensing.

[54]  Naif Alajlan,et al.  Siamese-GAN: Learning Invariant Representations for Aerial Vehicle Image Categorization , 2018, Remote. Sens..

[55]  Clément Mallet,et al.  SEMANTIC SEGMENTATION OF FOREST STANDS OF PURE SPECIES AS A GLOBAL OPTIMIZATION PROBLEM , 2017 .

[56]  Ronald Kemker,et al.  Low-Shot Learning for the Semantic Segmentation of Remote Sensing Imagery , 2018, IEEE Transactions on Geoscience and Remote Sensing.

[57]  R. P. Singh,et al.  Automatic semantic segmentation and classification of remote sensing data for agriculture , 2018, Mathematical Models in Engineering.

[58]  Kilian Q. Weinberger,et al.  Densely Connected Convolutional Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[59]  Thorsten Hoeser,et al.  Object Detection and Image Segmentation with Deep Learning on Earth Observation Data: A Review-Part I: Evolution and Recent Trends , 2020, Remote. Sens..

[60]  Dumitru Erhan,et al.  Going deeper with convolutions , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[61]  Seyed Majid Azimi,et al.  Road Segmentation in SAR Satellite Images With Deep Fully Convolutional Neural Networks , 2018, IEEE Geoscience and Remote Sensing Letters.

[62]  Lijun Xie,et al.  A regularized ensemble framework of deep learning for cancer detection from multi-class, imbalanced training data , 2018, Pattern Recognit..

[63]  Leon A. Gatys,et al.  Image Style Transfer Using Convolutional Neural Networks , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[64]  Robert Roschlaub,et al.  Towards HD Maps from Aerial Imagery: Robust Lane Marking Segmentation Using Country-Scale Imagery , 2018, ISPRS Int. J. Geo Inf..

[65]  Xiaohui Yuan,et al.  Weakly Supervised Deep Learning for Objects Detection from Images , 2020 .

[66]  Zhidong Deng,et al.  Recent progress in semantic image segmentation , 2018, Artificial Intelligence Review.

[67]  Pierre Alliez,et al.  High-Resolution Aerial Image Labeling With Convolutional Neural Networks , 2016, IEEE Transactions on Geoscience and Remote Sensing.

[68]  Michael Kampffmeyer,et al.  Semantic Segmentation of Small Objects and Modeling of Uncertainty in Urban Remote Sensing Images Using Deep Convolutional Neural Networks , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[69]  Ying Wang,et al.  Gated Convolutional Neural Network for Semantic Segmentation in High-Resolution Images , 2017, Remote. Sens..

[70]  Wenzhuo Song,et al.  A deep residual learning serial segmentation network for extracting buildings from remote sensing imagery , 2020, International Journal of Remote Sensing.

[71]  Seunghoon Hong,et al.  Learning Deconvolution Network for Semantic Segmentation , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[72]  Thomas Brox,et al.  U-Net: Convolutional Networks for Biomedical Image Segmentation , 2015, MICCAI.

[73]  Derek T. Anderson,et al.  Comprehensive survey of deep learning in remote sensing: theories, tools, and challenges for the community , 2017 .

[74]  Xueliang Zhang,et al.  Deep learning in remote sensing applications: A meta-analysis and review , 2019, ISPRS Journal of Photogrammetry and Remote Sensing.

[75]  Xinchang Zhang,et al.  Developing a multi-filter convolutional neural network for semantic segmentation using high-resolution aerial imagery and LiDAR data , 2018, ISPRS Journal of Photogrammetry and Remote Sensing.

[76]  Wei Sun,et al.  Methods and datasets on semantic segmentation: A review , 2018, Neurocomputing.

[77]  Ian D. Reid,et al.  RefineNet: Multi-path Refinement Networks for High-Resolution Semantic Segmentation , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[78]  Bin Yang,et al.  PIXOR: Real-time 3D Object Detection from Point Clouds , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[79]  Yixin Liu,et al.  A Novel Method of Hyperspectral Data Classification Based on Transfer Learning and Deep Belief Network , 2019 .

[80]  Roberto Cipolla,et al.  SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[81]  Jonathan Cheung-Wai Chan,et al.  Hyperspectral Images Classification Based on Dense Convolutional Networks with Spectral-Wise Attention Mechanism , 2019, Remote. Sens..

[82]  Michele Volpi,et al.  Semantic segmentation of urban scenes by learning local class interactions , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[83]  Xiaohui Yuan,et al.  A Unified Coherent-Incoherent Target Decomposition Method for Polarimetric SAR , 2020 .

[84]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[85]  Michele Volpi,et al.  Deep multi-task learning for a geographically-regularized semantic segmentation of aerial images , 2018, ISPRS Journal of Photogrammetry and Remote Sensing.

[86]  Leonidas J. Guibas,et al.  PointNet: Deep Learning on Point Sets for 3D Classification and Segmentation , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[87]  Weiwei Sun,et al.  Fully Convolutional Networks for Semantic Segmentation of Very High Resolution Remotely Sensed Images Combined With DSM , 2018, IEEE Geoscience and Remote Sensing Letters.

[88]  Chenming Li,et al.  A Hyperspectral Image Classification Method Based on Multi-Discriminator Generative Adversarial Networks , 2019, Sensors.

[89]  Uwe Stilla,et al.  SEMANTIC SEGMENTATION OF AERIAL IMAGES WITH AN ENSEMBLE OF CNNS , 2016 .

[90]  Amy Loutfi,et al.  Classification and Segmentation of Satellite Orthoimagery Using Convolutional Neural Networks , 2016, Remote. Sens..

[91]  Bing Liu,et al.  Deep convolutional recurrent neural network with transfer learning for hyperspectral image classification , 2018, Journal of Applied Remote Sensing.

[92]  Lu Wang,et al.  Urban Land-Use Classification From Photographs , 2018, IEEE Geoscience and Remote Sensing Letters.

[93]  Kun Zhu,et al.  Symmetrical Dense-Shortcut Deep Fully Convolutional Networks for Semantic Segmentation of Very-High-Resolution Remote Sensing Images , 2018, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[94]  Carl Salvaggio,et al.  A Fully Convolutional Network for Semantic Labeling of 3D Point Clouds , 2017, ISPRS Journal of Photogrammetry and Remote Sensing.

[95]  Sergio Benini,et al.  Deep Learning Meets Hyperspectral Image Analysis: A Multidisciplinary Review , 2019, J. Imaging.

[96]  Uwe Stilla,et al.  Classification With an Edge: Improving Semantic Image Segmentation with Boundary Detection , 2016, ISPRS Journal of Photogrammetry and Remote Sensing.

[97]  George Papandreou,et al.  Rethinking Atrous Convolution for Semantic Image Segmentation , 2017, ArXiv.

[98]  Hexin Zhang,et al.  Improvements in sparse non-negative matrix factorization for hyperspectral unmixing algorithms , 2018, Journal of Applied Remote Sensing.

[99]  Qingjie Liu,et al.  Road Extraction by Deep Residual U-Net , 2017, IEEE Geoscience and Remote Sensing Letters.

[100]  Gaofeng Meng,et al.  FusionNet: Edge Aware Deep Convolutional Networks for Semantic Segmentation of Remote Sensing Harbor Images , 2017, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[101]  Sen Jia,et al.  Convolutional neural networks for hyperspectral image classification , 2017, Neurocomputing.

[102]  J. Strobl,et al.  Object-Oriented Image Processing in an Integrated GIS/Remote Sensing Environment and Perspectives for Environmental Applications , 2000 .

[103]  Xiaohui Yuan,et al.  Adaptive Coherency Matrix Estimation for Polarimetric SAR Imagery Based on Local Heterogeneity Coefficients , 2016, IEEE Transactions on Geoscience and Remote Sensing.

[104]  Jie Geng,et al.  High-Resolution SAR Image Classification via Deep Convolutional Autoencoders , 2015, IEEE Geoscience and Remote Sensing Letters.

[105]  Pascal Fua,et al.  SLIC Superpixels Compared to State-of-the-Art Superpixel Methods , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[106]  Ying Li,et al.  Spectral-Spatial Classification of Hyperspectral Imagery with 3D Convolutional Neural Network , 2017, Remote. Sens..

[107]  Jefersson Alex dos Santos,et al.  Do deep features generalize from everyday objects to remote sensing and aerial scenes domains? , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[108]  Yu Liu,et al.  Hourglass-ShapeNetwork Based Semantic Segmentation for High Resolution Aerial Imagery , 2017, Remote. Sens..

[109]  José García Rodríguez,et al.  A survey on deep learning techniques for image and video semantic segmentation , 2018, Appl. Soft Comput..

[110]  Xiangyu Zhang,et al.  ShuffleNet: An Extremely Efficient Convolutional Neural Network for Mobile Devices , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[111]  Vladlen Koltun,et al.  Efficient Inference in Fully Connected CRFs with Gaussian Edge Potentials , 2011, NIPS.

[112]  Ronan Collobert,et al.  Learning to Refine Object Segments , 2016, ECCV.

[113]  Pierre Alliez,et al.  Can semantic labeling methods generalize to any city? the inria aerial image labeling benchmark , 2017, 2017 IEEE International Geoscience and Remote Sensing Symposium (IGARSS).

[114]  Shihong Du,et al.  Spectral–Spatial Feature Extraction for Hyperspectral Image Classification: A Dimension Reduction and Deep Learning Approach , 2016, IEEE Transactions on Geoscience and Remote Sensing.

[115]  Qiang Chen,et al.  Network In Network , 2013, ICLR.

[116]  Antonio Plaza,et al.  A new deep convolutional neural network for fast hyperspectral image classification , 2017, ISPRS Journal of Photogrammetry and Remote Sensing.

[117]  Vibhav Vineet,et al.  Conditional Random Fields as Recurrent Neural Networks , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[118]  Soumith Chintala,et al.  Unsupervised Representation Learning with Deep Convolutional Generative Adversarial Networks , 2015, ICLR.

[119]  Sebastian Ruder,et al.  An Overview of Multi-Task Learning in Deep Neural Networks , 2017, ArXiv.