Simplified object-based deep neural network for very high resolution remote sensing image classification

Abstract For the object-based classification of high resolution remote sensing images, many people expect that introducing deep learning methods can improve then classification accuracy. Unfortunately, the input shape for deep neural networks (DNNs) is usually rectangular, whereas the shapes of the segments output by segmentation methods are usually according to the corresponding ground objects; this inconsistency can lead to confusion among different types of heterogeneous content when a DNN processes a segment. Currently, most object-based methods utilizing convolutional neural networks (CNNs) adopt additional models to overcome the detrimental influence of such heterogeneous content; however, these heterogeneity suppression mechanisms introduce additional complexity into the whole classification process, and these methods are usually unstable and difficult to use in real applications. To address the above problems, this paper proposes a simplified object-based deep neural network (SO-DNN) for very high resolution remote sensing image classification. In SO-DNN, a new segment category label inference method is introduced, in which a deep semantic segmentation neural network (DSSNN) is used as the classification model instead of a traditional CNN. Since the DSSNN can obtain a category label for each pixel in the input image patch, different types of content are not mixed together; therefore, SO-DNN does not require an additional heterogeneity suppression mechanism. Moreover, SO-DNN includes a sample information optimization method that allows the DSSNN model to be trained using only pixel-based training samples. Because only a single model is used and only a pixel-based training set is needed, the whole classification process of SO-DNN is relatively simple and direct. In experiments, we use very high-resolution aerial images from Vaihingen and Potsdam from the ISPRS WG II/4 dataset as test data and compare SO-DNN with 6 traditional methods: O-MLP, O+CNN, OHSF-CNN, 2-CNN, JDL and U-Net. Compared with the best-performing method among these traditional methods, the classification accuracy of SO-DNN is improved by up to 7.71% and 10.78% for single images from Vaihingen and Potsdam, respectively, and the average classification accuracy is improved by 2.46% and 2.91% for the Vaihingen and Potsdam images, respectively. SO-DNN relies on fewer models and easier-to-obtain samples than traditional methods, and its stable performance makes SO-DNN more valuable for practical applications.

[1]  Mathieu Salzmann,et al.  Geometry-Aware Deep Recurrent Neural Networks for Hyperspectral Image Classification , 2021, IEEE Transactions on Geoscience and Remote Sensing.

[2]  Markus Gerke,et al.  The ISPRS benchmark on urban object classification and 3D building reconstruction , 2012 .

[3]  Yao Zhao,et al.  Object Region Mining with Adversarial Erasing: A Simple Classification to Semantic Segmentation Approach , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[4]  Jie Jiang,et al.  RWSNet: a semantic segmentation network based on SegNet combined with random walk for remote sensing , 2020, International Journal of Remote Sensing.

[5]  Gui-Song Xia,et al.  AID: A Benchmark Data Set for Performance Evaluation of Aerial Scene Classification , 2016, IEEE Transactions on Geoscience and Remote Sensing.

[6]  Mingsheng Liao,et al.  Lifting Scheme-Based Deep Neural Network for Remote Sensing Scene Classification , 2019, Remote. Sens..

[7]  Huimin Ma,et al.  Weakly-Supervised Semantic Segmentation by Iteratively Mining Common Object Features , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[8]  Bolei Zhou,et al.  Learning Deep Features for Discriminative Localization , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[9]  Xin Pan,et al.  Joint Deep Learning for land cover and land use classification , 2019, Remote Sensing of Environment.

[10]  Yang Chen,et al.  Object-based multi-modal convolution neural networks for building extraction using panchromatic and multispectral imagery , 2020, Neurocomputing.

[11]  William J. Emery,et al.  Object-Based Convolutional Neural Network for High-Resolution Imagery Classification , 2017, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[12]  Brian K. Gelder,et al.  Exploring multiscale object-based convolutional neural network (multi-OCNN) for remote sensing image classification at high spatial resolution , 2020 .

[13]  Geoffrey J. Hay,et al.  Object-based change detection , 2012 .

[14]  Fang Qiu,et al.  Developing an Object-based Hyperspatial Image Classifier with a Case Study Using WorldView-2 Data , 2013 .

[15]  Daniela O Medley,et al.  Deep Active Shape Model for Robust Object Fitting , 2020, IEEE Transactions on Image Processing.

[16]  Peng Yue,et al.  A multi-level context-guided classification method with object-based convolutional neural network for land cover classification using very high resolution remote sensing images , 2020, Int. J. Appl. Earth Obs. Geoinformation.

[17]  Thomas Blaschke,et al.  Geographic Object-Based Image Analysis – Towards a new paradigm , 2014, ISPRS journal of photogrammetry and remote sensing : official publication of the International Society for Photogrammetry and Remote Sensing.

[18]  Liang Xiao,et al.  Optimizing multiscale segmentation with local spectral heterogeneity measure for high resolution remote sensing images , 2019, ISPRS Journal of Photogrammetry and Remote Sensing.

[19]  Thomas Brox,et al.  U-Net: Convolutional Networks for Biomedical Image Segmentation , 2015, MICCAI.

[20]  Xin Pan,et al.  An object-based convolutional neural network (OCNN) for urban land use classification , 2018, Remote Sensing of Environment.

[21]  Roberto Cipolla,et al.  SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[22]  Harri Valpola,et al.  Weight-averaged consistency targets improve semi-supervised deep learning results , 2017, ArXiv.

[23]  J. A. de Jesús Osuna-Coutiño,et al.  Structure extraction in urbanized aerial images from a single view using a CNN-based approach , 2020, International Journal of Remote Sensing.

[24]  Wenfeng Luo,et al.  Weakly-supervised semantic segmentation with saliency and incremental supervision updating , 2021, Pattern Recognit..

[25]  Yang Wang,et al.  Deep Discriminative Representation Learning with Attention Map for Scene Classification , 2019, Remote. Sens..

[26]  Yang Shao,et al.  Assessing Deep Convolutional Neural Networks and Assisted Machine Perception for Urban Mapping , 2021, Remote. Sens..

[27]  Xiao Li,et al.  Integrating spectral variability and spatial distribution for object-based image analysis using curve matching approaches , 2020 .

[28]  Xin Pan,et al.  A hybrid MLP-CNN classifier for very fine resolution remotely sensed image classification , 2017, ISPRS Journal of Photogrammetry and Remote Sensing.

[29]  Min Wang,et al.  Very high resolution remote sensing image classification with SEEDS-CNN and scale effect analysis for superpixel CNN classification , 2018, International Journal of Remote Sensing.

[30]  Huimin Yan,et al.  A Deep Convolution Neural Network Method for Land Cover Mapping: A Case Study of Qinhuangdao, China , 2018, Remote. Sens..

[31]  Peter M. Atkinson,et al.  Scale Sequence Joint Deep Learning (SS-JDL) for land use and land cover classification , 2020, Remote Sensing of Environment.

[32]  Raymond Y. K. Lau,et al.  Road Detection and Centerline Extraction Via Deep Recurrent Convolutional Neural Network U-Net , 2019, IEEE Transactions on Geoscience and Remote Sensing.

[33]  Trevor Darrell,et al.  Fully Convolutional Networks for Semantic Segmentation , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[34]  Yansheng Li,et al.  Learning deep semantic segmentation network under multiple weakly-supervised constraints for cross-domain remote sensing image semantic segmentation , 2021 .

[35]  Qingming Zhan,et al.  Urban land use extraction from Very High Resolution remote sensing imagery using a Bayesian network , 2016 .

[36]  Jianjun Zhu,et al.  A New Crop Classification Method Based on the Time-Varying Feature Curves of Time Series Dual-Polarization Sentinel-1 Data Sets , 2020, IEEE Geoscience and Remote Sensing Letters.

[37]  Xiaolei Zhao,et al.  Residual Dense Network Based on Channel-Spatial Attention for the Scene Classification of a High-Resolution Remote Sensing Image , 2020, Remote. Sens..

[38]  Xin Pan,et al.  An object-based and heterogeneous segment filter convolutional neural network for high-resolution remote sensing image classification , 2019, International Journal of Remote Sensing.

[39]  Zhe Zhu,et al.  Understanding an urbanizing planet: Strategic directions for remote sensing , 2019, Remote Sensing of Environment.

[40]  Hongfeng You,et al.  Pixel-Level Remote Sensing Image Recognition Based on Bidirectional Word Vectors , 2020, IEEE Transactions on Geoscience and Remote Sensing.

[41]  Claudio Persello,et al.  Deep Fully Convolutional Networks for Cadastral Boundary Detection from UAV Images , 2019, Remote. Sens..

[42]  Wei Lee Woon,et al.  Simultaneous extraction of roads and buildings in remote sensing imagery with convolutional neural networks , 2017 .

[43]  Geoffrey E. Hinton,et al.  Deep Learning , 2015, Nature.

[44]  Ana Cristina Murillo,et al.  Coral-Segmentation: Training Dense Labeling Models with Sparse Ground Truth , 2017, 2017 IEEE International Conference on Computer Vision Workshops (ICCVW).

[45]  Dongmei Chen,et al.  Segmentation for Object-Based Image Analysis (OBIA): A review of algorithms and challenges from remote sensing perspective , 2019, ISPRS Journal of Photogrammetry and Remote Sensing.

[46]  Thomas Blaschke,et al.  Object based image analysis for remote sensing , 2010 .

[47]  Peijun Du,et al.  A review of supervised object-based land-cover image classification , 2017 .

[48]  Onkar Dikshit,et al.  Feature extraction for hyperspectral image classification: a review , 2020, International Journal of Remote Sensing.

[49]  Fei-Fei Li,et al.  What's the Point: Semantic Segmentation with Point Supervision , 2015, ECCV.

[50]  Xiao Xiang Zhu,et al.  Semantic Segmentation of Remote Sensing Images With Sparse Annotations , 2021, IEEE Geoscience and Remote Sensing Letters.

[51]  Pascal Fua,et al.  SLIC Superpixels Compared to State-of-the-Art Superpixel Methods , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[52]  Lizhe Wang,et al.  A semi-supervised generative framework with deep learning features for high-resolution remote sensing image scene classification , 2017, ISPRS Journal of Photogrammetry and Remote Sensing.

[53]  Xiangtao Zheng,et al.  Joint Dictionary Learning for Multispectral Change Detection , 2017, IEEE Transactions on Cybernetics.