Building Extraction from Very-High-Resolution Remote Sensing Images Using Semi-Supervised Semantic Edge Detection

The automated detection of buildings in remote sensing images enables understanding the distribution information of buildings, which is indispensable for many geographic and social applications, such as urban planning, change monitoring and population estimation. The performance of deep learning in images often depends on a large number of manually labeled samples, the production of which is time-consuming and expensive. Thus, this study focuses on reducing the number of labeled samples used and proposing a semi-supervised deep learning approach based on an edge detection network (SDLED), which is the first to introduce semi-supervised learning to the edge detection neural network for extracting building roof boundaries from high-resolution remote sensing images. This approach uses a small number of labeled samples and abundant unlabeled images for joint training. An expert-level semantic edge segmentation model is trained based on labeled samples, which guides unlabeled images to generate pseudo-labels automatically. The inaccurate label sets and manually labeled samples are used to update the semantic edge model together. Particularly, we modified the semantic segmentation network D-LinkNet to obtain high-quality pseudo-labels. Specifically, the main network architecture of D-LinkNet is retained while the multi-scale fusion is added in its second half to improve its performance on edge detection. The SDLED was tested on high-spatial-resolution remote sensing images taken from Google Earth. Results show that the SDLED performs better than the fully supervised method. Moreover, when the trained models were used to predict buildings in the neighboring counties, our approach was superior to the supervised way, with line IoU improvement of at least 6.47% and F1 score improvement of at least 7.49%.

[1]  Bo Du,et al.  Saliency-Guided Unsupervised Feature Learning for Scene Classification , 2015, IEEE Transactions on Geoscience and Remote Sensing.

[2]  Ming Yang,et al.  Bi-Directional Cascade Network for Perceptual Edge Detection , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[3]  Michal Kedzierski,et al.  Detection, Classification and Boundary Regularization of Buildings in Satellite Imagery Using Faster Edge Region Convolutional Neural Networks , 2020, Remote. Sens..

[4]  Ehsan Harirchian,et al.  A Review on Application of Soft Computing Techniques for the Rapid Visual Safety Evaluation and Damage Classification of Existing Buildings , 2021 .

[5]  Xiang Bai,et al.  Richer Convolutional Features for Edge Detection , 2016, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[6]  Jonathan Cheung-Wai Chan,et al.  Semi-Supervised Deep Learning Classification for Hyperspectral Image Based on Dual-Strategy Sample Selection , 2018, Remote. Sens..

[7]  Shunichi Koshimura,et al.  Pyramid Pooling Module-Based Semi-Siamese Network: A Benchmark Model for Assessing Building Damage from xBD Satellite Imagery Datasets , 2020, Remote. Sens..

[8]  Chun Liu,et al.  Automatic extraction of built-up area from ZY3 multi-view satellite imagery: Analysis of 45 global cities , 2019, Remote Sensing of Environment.

[9]  Dongmei Chen,et al.  Segmentation for Object-Based Image Analysis (OBIA): A review of algorithms and challenges from remote sensing perspective , 2019, ISPRS Journal of Photogrammetry and Remote Sensing.

[10]  Jorma Laaksonen,et al.  Multi-Hazard and Spatial Transferability of a CNN for Automated Building Damage Assessment , 2020, Remote. Sens..

[11]  Youqiang Dong,et al.  Extraction of Buildings from Multiple-View Aerial Images Using a Feature-Level-Fusion Strategy , 2018, Remote Sensing.

[12]  Yuan Hu,et al.  Dynamic Feature Fusion for Semantic Edge Detection , 2019, IJCAI.

[13]  Ming Wu,et al.  D-LinkNet: LinkNet with Pretrained Encoder and Dilated Convolution for High Resolution Satellite Imagery Road Extraction , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[14]  Reshma Rastogi,et al.  Semi-supervised Weighted Ternary Decision Structure for Multi-category Classification , 2020, Neural Processing Letters.

[15]  Tingting Lv,et al.  Detecting Building Edges from High Spatial Resolution Remote Sensing Imagery Using Richer Convolution Features Network , 2018, Remote. Sens..

[16]  Lizhe Wang,et al.  A semi-supervised generative framework with deep learning features for high-resolution remote sensing image scene classification , 2017, ISPRS Journal of Photogrammetry and Remote Sensing.

[17]  Ross B. Girshick,et al.  Mask R-CNN , 2017, 1703.06870.

[18]  Andreas Dengel,et al.  Multi-Task Learning for Segmentation of Building Footprints with Deep Neural Networks , 2017, 2019 IEEE International Conference on Image Processing (ICIP).

[19]  László Bertalan,et al.  Building Extraction Using Orthophotos and Dense Point Cloud Derived from Visual Band Aerial Imagery Based on Machine Learning and Segmentation , 2020, Remote. Sens..

[20]  Wei Wu,et al.  Refined extraction of buildings with the semantic edge-assisted approach from very high-resolution remotely sensed imagery , 2020 .

[21]  Weisheng Wang,et al.  A Study for Texture Feature Extraction of High-Resolution Satellite Images Based on a Direction Measure and Gray Level Co-Occurrence Matrix Fusion Algorithm , 2017, Sensors.

[22]  Xin Huang,et al.  An automatic change detection method for monitoring newly constructed building areas using time-series multi-view high-resolution optical satellite images , 2020 .

[23]  Nazzareno Pierdicca,et al.  Earthquake damage mapping: An overall assessment of ground surveys and VHR image change detection after L'Aquila 2009 earthquake , 2018, Remote Sensing of Environment.

[24]  Dong-Hyun Lee,et al.  Pseudo-Label : The Simple and Efficient Semi-Supervised Learning Method for Deep Neural Networks , 2013 .

[25]  Susu Xu,et al.  Knowledge Transfer between Buildings for Seismic Damage Diagnosis through Adversarial Learning , 2020, 2002.09513.

[26]  Hao Wu,et al.  Semi-Supervised Deep Learning Using Pseudo Labels for Hyperspectral Image Classification , 2018, IEEE Transactions on Image Processing.

[27]  Zhongsheng Hua,et al.  Semi-supervised learning based on nearest neighbor rule and cut edges , 2010, Knowl. Based Syst..

[28]  Meng Lu,et al.  A scale robust convolutional neural network for automatic building extraction from aerial and satellite imagery , 2018, International Journal of Remote Sensing.

[29]  Francisco Herrera,et al.  Self-labeled techniques for semi-supervised learning: taxonomy, software and empirical study , 2015, Knowledge and Information Systems.

[30]  Ali Farhadi,et al.  You Only Look Once: Unified, Real-Time Object Detection , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[31]  Kai Ma,et al.  Self-Loop Uncertainty: A Novel Pseudo-Label for Semi-Supervised Medical Image Segmentation , 2020, MICCAI.

[32]  Zhi-Hua Zhou,et al.  Improve Computer-Aided Diagnosis With Machine Learning Techniques Using Undiagnosed Samples , 2007, IEEE Transactions on Systems, Man, and Cybernetics - Part A: Systems and Humans.

[33]  Yiwen Hu,et al.  Building Extraction Using Mask Scoring R-CNN Network , 2019, CSAE 2019.

[34]  Jiangye Yuan,et al.  Learning Building Extraction in Aerial Scenes with Convolutional Networks , 2018, IEEE Transactions on Pattern Analysis and Machine Intelligence.