Exploring multiscale object-based convolutional neural network (multi-OCNN) for remote sensing image classification at high spatial resolution

Abstract Convolutional Neural Network (CNN) has been increasingly used for land cover mapping of remotely sensed imagery. However, large-area classification using traditional CNN is computationally expensive and produces coarse maps using a sliding window approach. To address this problem, object-based CNN (OCNN) becomes an alternative solution to improve classification performance. However, previous studies were mainly focused on urban areas or small scenes, and implementation of OCNN method is still needed for large-area classification over heterogeneous landscape. Additionally, the massive labeling of segmented objects requires a practical approach for less computation, including object analysis and multiple CNNs. This study presents a new multiscale OCNN (multi-OCNN) framework for large-scale land cover classification at 1-m resolution over 145,740 km2. Our approach consists of three main steps: (i) image segmentation, (ii) object analysis with skeleton-based algorithm, and (iii) application of multiple CNNs for final classification. Also, we developed a large benchmark dataset, called IowaNet, with 1 million labeled images and 10 classes. In our approach, multiscale CNNs were trained to capture the best contextual information during the semantic labeling of objects. Meanwhile, skeletonization algorithm provided morphological representation (“medial axis”) of objects to support the selection of convolutional locations for CNN predictions. In general, proposed multi-OCNN presented better classification accuracy (overall accuracy ~87.2%) compared to traditional patch-based CNN (81.6%) and fixed-input OCNN (82%). In addition, the results showed that this framework is 8.1 and 111.5 times faster than traditional pixel-wise CNN16 or CNN256, respectively. Multiple CNNs and object analysis have proved to be essential for accurate and fast classification. While multi-OCNN produced a high-level of spatial details in the land cover product, misclassification was observed for some classes, such as road versus buildings or shadow versus lake. Despite these minor drawbacks, our results also demonstrated the benefits of IowaNet training dataset in the model performance; overfitting process reduces as the number of samples increases. The limitations of multi-OCNN are partially explained by segmentation quality and limited number of spectral bands in the aerial data. With the advance of deep learning methods, this study supports the claim of multi-OCNN benefits for operational large-scale land cover product at 1-m resolution.

[1]  Mariana Belgiu,et al.  Random forest in remote sensing: A review of applications and future directions , 2016 .

[2]  Geoffrey E. Hinton,et al.  Deep Learning , 2015, Nature.

[3]  Liang Fan,et al.  Multiscale and Multifeature Segmentation of High-Spatial Resolution Remote Sensing Images Using Superpixels with Mutual Optimal Strategy , 2018, Remote. Sens..

[4]  Suming Jin,et al.  Completion of the 2011 National Land Cover Database for the Conterminous United States – Representing a Decade of Land Cover Change Information , 2015 .

[5]  Willem Bouten,et al.  Segmentation optimization and stratified object-based analysis for semi-automated geomorphological mapping , 2011 .

[6]  Xiuwen Liu,et al.  A patch-based convolutional neural network for remote sensing image classification , 2017, Neural Networks.

[7]  Sabine Vanhuysse,et al.  Fully Convolutional Networks and Geographic Object-Based Image Analysis for the Classification of VHR Imagery , 2019, Remote. Sens..

[8]  Lin Lei,et al.  Multi-scale object detection in remote sensing imagery with convolutional neural networks , 2018, ISPRS Journal of Photogrammetry and Remote Sensing.

[9]  Xia Li,et al.  Integration of Convolutional Neural Networks and Object-Based Post-Classification Refinement for Land Use and Land Cover Mapping with Optical and SAR Data , 2019, Remote. Sens..

[10]  Yoshua Bengio,et al.  The One Hundred Layers Tiramisu: Fully Convolutional DenseNets for Semantic Segmentation , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[11]  Pierre Alliez,et al.  Convolutional Neural Networks for Large-Scale Remote-Sensing Image Classification , 2017, IEEE Transactions on Geoscience and Remote Sensing.

[12]  Jin Chen,et al.  Global land cover mapping at 30 m resolution: A POK-based operational approach , 2015 .

[13]  Qihao Weng,et al.  An Automated Method to Parameterize Segmentation Scale by Enhancing Intrasegment Homogeneity and Intersegment Heterogeneity , 2015, IEEE Geoscience and Remote Sensing Letters.

[14]  Yongyang Xu,et al.  Building Extraction in Very High Resolution Remote Sensing Imagery Using Deep Learning and Guided Filters , 2018, Remote. Sens..

[15]  Bo Huang,et al.  Urban land-use mapping using a deep convolutional neural network with high spatial resolution multispectral remote sensing imagery , 2018, Remote Sensing of Environment.

[16]  Dongping Ming,et al.  Superpixel based land cover classification of VHR satellite image combining multi-scale CNN and scale parameter estimation , 2019, Earth Science Informatics.

[17]  Shihong Du,et al.  Learning multiscale and deep representations for classifying remotely sensed imagery , 2016 .

[18]  Gang Fu,et al.  Classification for High Resolution Remote Sensing Imagery Using a Fully Convolutional Network , 2017, Remote. Sens..

[19]  Dirk Tiede,et al.  ESP: a tool to estimate scale parameter for multiresolution image segmentation of remotely sensed data , 2010, Int. J. Geogr. Inf. Sci..

[20]  Camille Couprie,et al.  Learning Hierarchical Features for Scene Labeling , 2013, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[21]  Supratik Mukhopadhyay,et al.  A Semiautomated Probabilistic Framework for Tree-Cover Delineation From 1-m NAIP Imagery Using a High-Performance Computing Architecture , 2015, IEEE Transactions on Geoscience and Remote Sensing.

[22]  Qingshan Liu,et al.  Learning Multiscale Deep Features for High-Resolution Satellite Image Scene Classification , 2018, IEEE Transactions on Geoscience and Remote Sensing.

[23]  Dongmei Chen,et al.  Segmentation for Object-Based Image Analysis (OBIA): A review of algorithms and challenges from remote sensing perspective , 2019, ISPRS Journal of Photogrammetry and Remote Sensing.

[24]  Dorin Comaniciu,et al.  Mean Shift: A Robust Approach Toward Feature Space Analysis , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[25]  Xueliang Zhang,et al.  Hybrid region merging method for segmentation of high-resolution remote sensing images , 2014 .

[26]  Hongyu Li,et al.  Image segmentation using mean shift for extracting croplands from high-resolution remote sensing imagery , 2015 .

[27]  Ching Y. Suen,et al.  A fast parallel algorithm for thinning digital patterns , 1984, CACM.

[28]  Uwe Stilla,et al.  SEMANTIC SEGMENTATION OF AERIAL IMAGES WITH AN ENSEMBLE OF CNNS , 2016 .

[29]  Thomas Blaschke,et al.  A comparison of three image-object methods for the multiscale analysis of landscape structure , 2003 .

[30]  Hang Zhou,et al.  Deep learning based multi-temporal crop classification , 2019, Remote Sensing of Environment.

[31]  Zahra Dabiri,et al.  Scale matters: a survey of the concepts of scale used in spatial disciplines , 2019, European journal of remote sensing.

[32]  Antonio Plaza,et al.  A new deep convolutional neural network for fast hyperspectral image classification , 2017, ISPRS Journal of Photogrammetry and Remote Sensing.

[33]  Aaron E. Maxwell,et al.  Comparison of NAIP orthophotography and RapidEye satellite imagery for mapping of mining and mine reclamation , 2014 .

[34]  Song Wang,et al.  New benchmark for image segmentation evaluation , 2007, J. Electronic Imaging.

[35]  O. Csillik,et al.  Automated parameterisation for multi-scale image segmentation on multiple layers , 2014, ISPRS journal of photogrammetry and remote sensing : official publication of the International Society for Photogrammetry and Remote Sensing.

[36]  Hong Zhang,et al.  An evaluation metric for image segmentation of multiple objects , 2009, Image Vis. Comput..

[37]  Yun Zhang,et al.  Very Deep Convolutional Neural Networks for Complex Land Cover Mapping Using Multispectral Remote Sensing Imagery , 2018, Remote. Sens..

[38]  Russell G. Congalton,et al.  Assessing the accuracy of remotely sensed data : principles and practices , 1998 .

[39]  Zhenfeng Shao,et al.  PatternNet: A Benchmark Dataset for Performance Evaluation of Remote Sensing Image Retrieval , 2017, ISPRS Journal of Photogrammetry and Remote Sensing.

[40]  Xiaoxiao Li,et al.  Object-based land-cover classification for metropolitan Phoenix, Arizona, using aerial photography , 2014, Int. J. Appl. Earth Obs. Geoinformation.

[41]  Hankui K. Zhang,et al.  Finer resolution observation and monitoring of global land cover: first mapping results with Landsat TM and ETM+ data , 2013 .

[42]  Shihua Li,et al.  Object-Oriented Method Combined with Deep Convolutional Neural Networks for Land-Use-Type Classification of Remote Sensing Images , 2019, Journal of the Indian Society of Remote Sensing.

[43]  William J. Emery,et al.  Object-Based Convolutional Neural Network for High-Resolution Imagery Classification , 2017, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[44]  Suming Jin,et al.  A comprehensive change detection method for updating the National Land Cover Database to circa 2011 , 2013 .

[45]  Qihao Weng,et al.  A survey of image classification methods and techniques for improving classification performance , 2007 .

[46]  Huimin Yan,et al.  A Deep Convolution Neural Network Method for Land Cover Mapping: A Case Study of Qinhuangdao, China , 2018, Remote. Sens..

[47]  Timothy A. Warner,et al.  Land Cover Classification and Feature Extraction from National Agriculture Imagery Program (NAIP) Orthoimagery: A Review , 2017 .

[48]  Xianhong Xie,et al.  Land cover classification of finer resolution remote sensing data integrating temporal features from time series coarser resolution data , 2014 .

[49]  M. Mahdianpari,et al.  Random forest wetland classification using ALOS-2 L-band, RADARSAT-2 C-band, and TerraSAR-X imagery , 2017 .

[50]  Xiuping Jia,et al.  Deep Feature Extraction and Classification of Hyperspectral Images Based on Convolutional Neural Networks , 2016, IEEE Transactions on Geoscience and Remote Sensing.

[51]  Thomas Blaschke,et al.  Geographic Object-Based Image Analysis – Towards a new paradigm , 2014, ISPRS journal of photogrammetry and remote sensing : official publication of the International Society for Photogrammetry and Remote Sensing.

[52]  C. Woodcock,et al.  Continuous change detection and classification of land cover using all available Landsat data , 2014 .

[53]  Tao Liu,et al.  Comparing fully convolutional networks, random forest, support vector machine, and patch-based deep convolutional neural networks for object-based wetland mapping using images from small unmanned aircraft system , 2018 .

[54]  Yun Zhang,et al.  Constructing Hierarchical Segmentation Tree for Feature Extraction and Land Cover Classification of High Resolution MS Imagery , 2015, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[55]  Thomas Blaschke,et al.  Object based image analysis for remote sensing , 2010 .

[56]  Feng Li,et al.  Fusion of Multiscale Convolutional Neural Networks for Building Extraction in Very High-Resolution Images , 2019, Remote. Sens..

[57]  Gui-Song Xia,et al.  Transferring Deep Convolutional Neural Networks for the Scene Classification of High-Resolution Remote Sensing Imagery , 2015, Remote. Sens..

[58]  Yoshua Bengio,et al.  Gradient-based learning applied to document recognition , 1998, Proc. IEEE.

[59]  Jungho Im,et al.  ISPRS Journal of Photogrammetry and Remote Sensing , 2022 .

[60]  Takayoshi Yamashita,et al.  Multiple Object Extraction from Aerial Imagery with Convolutional Neural Networks , 2016, IRIACV.

[61]  Iasonas Kokkinos,et al.  DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs , 2016, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[62]  Amy Loutfi,et al.  Classification and Segmentation of Satellite Orthoimagery Using Convolutional Neural Networks , 2016, Remote. Sens..

[63]  Peng Gong,et al.  Global land cover mapping using Earth observation satellite data: Recent progresses and challenges , 2015 .

[64]  Fei Yuan,et al.  High-resolution Land Cover and Impervious Surface Classifications in the Twin Cities Metropolitan Area with NAIP Imagery , 2016 .

[65]  Larry D. Hostetler,et al.  The estimation of the gradient of a density function, with applications in pattern recognition , 1975, IEEE Trans. Inf. Theory.

[66]  Nataliia Kussul,et al.  Deep Learning Classification of Land Cover and Crop Types Using Remote Sensing Data , 2017, IEEE Geoscience and Remote Sensing Letters.

[67]  Jamie Sherrah,et al.  Semantic Labeling of Aerial and Satellite Imagery , 2016, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[68]  Roberto Cipolla,et al.  SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[69]  Giles M. Foody,et al.  Key issues in rigorous accuracy assessment of land cover products , 2019, Remote Sensing of Environment.

[70]  Jahan Kariyeva,et al.  Comparing Deep Learning and Shallow Learning for Large-Scale Wetland Classification in Alberta, Canada , 2019, Remote. Sens..

[71]  Limin Yang,et al.  A new generation of the United States National Land Cover Database: Requirements, research priorities, design, and implementation strategies , 2018, ISPRS Journal of Photogrammetry and Remote Sensing.

[72]  Weijia Li,et al.  Large-Scale Oil Palm Tree Detection from High-Resolution Satellite Images Using Two-Stage Convolutional Neural Networks , 2018, Remote. Sens..

[73]  Min Wang,et al.  A New Method for Region-Based Majority Voting CNNs for Very High Resolution Image Classification , 2018, Remote. Sens..

[74]  Nitish Srivastava,et al.  Dropout: a simple way to prevent neural networks from overfitting , 2014, J. Mach. Learn. Res..

[75]  Wei Lee Woon,et al.  Simultaneous extraction of roads and buildings in remote sensing imagery with convolutional neural networks , 2017 .

[76]  Xin Pan,et al.  An object-based convolutional neural network (OCNN) for urban land use classification , 2018, Remote Sensing of Environment.

[77]  Jefersson Alex dos Santos,et al.  Towards better exploiting convolutional neural networks for remote sensing scene classification , 2016, Pattern Recognit..

[78]  Dongping Ming,et al.  Semivariogram-Based Spatial Bandwidth Selection for Remote Sensing Image Segmentation With Mean-Shift Algorithm , 2012, IEEE Geoscience and Remote Sensing Letters.

[79]  Gerald Penn,et al.  Convolutional Neural Networks for Speech Recognition , 2014, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[80]  Bertrand Le Saux,et al.  Beyond RGB: Very High Resolution Urban Remote Sensing With Multimodal Deep Networks , 2017, ISPRS Journal of Photogrammetry and Remote Sensing.

[81]  Peijun Du,et al.  A review of supervised object-based land-cover image classification , 2017 .

[82]  Yun Zhang,et al.  Deep Convolutional Neural Network for Complex Wetland Classification Using Optical Remote Sensing Imagery , 2018, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[83]  Xiao Xiang Zhu,et al.  Deep Learning in Remote Sensing: A Comprehensive Review and List of Resources , 2017, IEEE Geoscience and Remote Sensing Magazine.

[84]  Leiguang Wang,et al.  Adaptive regional feature extraction for very high spatial resolution image classification , 2012 .

[85]  Yuhan Rao,et al.  Land cover change detection by integrating object-based data blending model of Landsat and MODIS , 2016 .

[86]  Stephen V. Stehman,et al.  Sampling designs for accuracy assessment of land cover , 2009 .

[87]  Xiaogang Wang,et al.  Deep Learning Face Attributes in the Wild , 2014, 2015 IEEE International Conference on Computer Vision (ICCV).