Automatic building extraction from high-resolution aerial images and LiDAR data using gated residual refinement network

Abstract Automated extraction of buildings from remotely sensed data is important for a wide range of applications but challenging due to difficulties in extracting semantic features from complex scenes like urban areas. The recently developed fully convolutional neural networks (FCNs) have shown to perform well on urban object extraction because of the outstanding feature learning and end-to-end pixel labeling abilities. The commonly used feature fusion or skip-connection refine modules of FCNs often overlook the problem of feature selection and could reduce the learning efficiency of the networks. In this paper, we develop an end-to-end trainable gated residual refinement network (GRRNet) that fuses high-resolution aerial images and LiDAR point clouds for building extraction. The modified residual learning network is applied as the encoder part of GRRNet to learn multi-level features from the fusion data and a gated feature labeling (GFL) unit is introduced to reduce unnecessary feature transmission and refine classification results. The proposed model - GRRNet is tested in a publicly available dataset with urban and suburban scenes. Comparison results illustrated that GRRNet has competitive building extraction performance in comparison with other approaches. The source code of the developed GRRNet is made publicly available for studies.

[1]  Domen Mongus,et al.  Ground and building extraction from LiDAR data based on differential morphological profiles and locally fitted surfaces , 2014 .

[2]  Thomas Hofmann,et al.  Learning Aerial Image Segmentation From Online Maps , 2017, IEEE Transactions on Geoscience and Remote Sensing.

[3]  Jian Sun,et al.  Delving Deep into Rectifiers: Surpassing Human-Level Performance on ImageNet Classification , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[4]  Lingfeng Wang,et al.  Semantic Labeling in Very High Resolution Images via a Self-Cascaded Convolutional Neural Network , 2017, ISPRS Journal of Photogrammetry and Remote Sensing.

[5]  Jamie Sherrah,et al.  Semantic Labeling of Aerial and Satellite Imagery , 2016, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[6]  Jiangye Yuan,et al.  Learning Building Extraction in Aerial Scenes with Convolutional Networks , 2018, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[7]  Seunghoon Hong,et al.  Learning Deconvolution Network for Semantic Segmentation , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[8]  Jie Shan,et al.  Segmentation and Reconstruction of Polyhedral Building Roofs From Aerial Lidar Point Clouds , 2010, IEEE Transactions on Geoscience and Remote Sensing.

[9]  Rob Fergus,et al.  Predicting Depth, Surface Normals and Semantic Labels with a Common Multi-scale Convolutional Architecture , 2014, 2015 IEEE International Conference on Computer Vision (ICCV).

[10]  Liangpei Zhang,et al.  Morphological Building/Shadow Index for Building Extraction From High-Resolution Imagery Over Urban Areas , 2012, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[11]  Wai Yeung Yan,et al.  Urban land cover classification using airborne LiDAR data: A review , 2015 .

[12]  Claus Brenner,et al.  Extraction of buildings and trees in urban environments , 1999 .

[13]  Uwe Stilla,et al.  Classification With an Edge: Improving Semantic Image Segmentation with Boundary Detection , 2016, ISPRS Journal of Photogrammetry and Remote Sensing.

[14]  Roberto Cipolla,et al.  SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[15]  C. Fraser,et al.  Automatic Detection of Residential Buildings Using LIDAR Data and Multispectral Imagery , 2010 .

[16]  Jixian Zhang Multi-source remote sensing data fusion: status and trends , 2010 .

[17]  Payman Moallem,et al.  Building extraction from high-resolution satellite images in urban areas: recent methods and strategies against significant challenges , 2016 .

[18]  Yongyang Xu,et al.  Building Extraction in Very High Resolution Remote Sensing Imagery Using Deep Learning and Guided Filters , 2018, Remote. Sens..

[19]  Xinchang Zhang,et al.  Developing a multi-filter convolutional neural network for semantic segmentation using high-resolution aerial imagery and LiDAR data , 2018, ISPRS Journal of Photogrammetry and Remote Sensing.

[20]  Jaewook Jung,et al.  Results of the ISPRS benchmark on urban object detection and 3D building reconstruction , 2014 .

[21]  Wei Lee Woon,et al.  Simultaneous extraction of roads and buildings in remote sensing imagery with convolutional neural networks , 2017 .

[22]  William J. Emery,et al.  Contextually guided very-high-resolution imagery classification with semantic segments , 2017 .

[23]  Ali Mohammadzadeh,et al.  A Novel Building and Tree Detection Method From LiDAR Data and Aerial Images , 2016, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[24]  Bisheng Yang,et al.  Automatic registration of UAV-borne sequent images and LiDAR data , 2015 .

[25]  P. Zingaretti,et al.  Performance evaluation of automated approaches to building detection in multi-source aerial data , 2010 .

[26]  Wei Yuan,et al.  Automatic Building Segmentation of Aerial Imagery Using Multi-Constraint Fully Convolutional Networks , 2018, Remote. Sens..

[27]  J. Niemeyer,et al.  Contextual classification of lidar data and building object detection in urban areas , 2014 .

[28]  Ying Wang,et al.  Gated Convolutional Neural Network for Semantic Segmentation in High-Resolution Images , 2017, Remote. Sens..

[29]  Kyoung Mu Lee,et al.  Fusion of Lidar and Imagery for Reliable Building Extraction , 2008 .

[30]  Guojun Lu,et al.  An Automatic Building Extraction and Regularisation Technique Using LiDAR Point Cloud Data and Orthoimage , 2016, Remote. Sens..

[31]  Trevor Darrell,et al.  Caffe: Convolutional Architecture for Fast Feature Embedding , 2014, ACM Multimedia.

[32]  Geoffrey E. Hinton,et al.  Learning to Detect Roads in High-Resolution Aerial Images , 2010, ECCV.

[33]  Bo Du,et al.  Deep Learning for Remote Sensing Data: A Technical Tutorial on the State of the Art , 2016, IEEE Geoscience and Remote Sensing Magazine.

[34]  Xiaojun Yang,et al.  Detect Residential Buildings from Lidar and Aerial Photographs through Object-Oriented Land-Use Classification , 2012 .

[35]  Bertrand Le Saux,et al.  Beyond RGB: Very High Resolution Urban Remote Sensing With Multimodal Deep Networks , 2017, ISPRS Journal of Photogrammetry and Remote Sensing.

[36]  Siyang Chen,et al.  Automatic building extraction from LiDAR data fusion of point and grid-based features , 2017 .

[37]  Jian Wang,et al.  Automatic extraction of building boundaries using aerial LiDAR data , 2016 .

[38]  Xin Huang,et al.  A Multidirectional and Multiscale Morphological Index for Automatic Building Extraction from Multispectral GeoEye-1 Imagery , 2011 .

[39]  Gang Fu,et al.  Classification for High Resolution Remote Sensing Imagery Using a Fully Convolutional Network , 2017, Remote. Sens..

[40]  Shiming Xiang,et al.  Automatic Road Detection and Centerline Extraction via Cascaded End-to-End Convolutional Neural Network , 2017, IEEE Transactions on Geoscience and Remote Sensing.

[41]  Michele Volpi,et al.  Dense Semantic Labeling of Subdecimeter Resolution Images With Convolutional Neural Networks , 2016, IEEE Transactions on Geoscience and Remote Sensing.

[42]  John Trinder,et al.  Using the Dempster-Shafer method for the fusion of LIDAR data and multi-spectral images for building detection , 2005, Inf. Fusion.

[43]  Luis Ángel Ruiz Fernández,et al.  Evaluation of Automatic Building Detection Approaches Combining High Resolution Images and LiDAR Data , 2011, Remote. Sens..

[44]  Ramakant Nevatia,et al.  Detection and Modeling of Buildings from Multiple Aerial Images , 2001, IEEE Trans. Pattern Anal. Mach. Intell..