Deep Learning Guided Building Reconstruction from Satellite Imagery-derived Point Clouds

3D urban reconstruction of buildings from remotely sensed imagery has drawn significant attention during the past two decades. While aerial imagery and LiDAR provide higher resolution, satellite imagery is cheaper and more efficient to acquire for large scale need. However, the high, orbital altitude of satellite observation brings intrinsic challenges, like unpredictable atmospheric effect, multi view angles, significant radiometric differences due to the necessary multiple views, diverse land covers and urban structures in a scene, small base-height ratio or narrow field of view, all of which may degrade 3D reconstruction quality. To address these major challenges, we present a reliable and effective approach for building model reconstruction from the point clouds generated from multi-view satellite images. We utilize multiple types of primitive shapes to fit the input point cloud. Specifically, a deep-learning approach is adopted to distinguish the shape of building roofs in complex and yet noisy scenes. For points that belong to the same roof shape, a multi-cue, hierarchical RANSAC approach is proposed for efficient and reliable segmenting and reconstructing the building point cloud. Experimental results over four selected urban areas (0.34 to 2.04 sq km in size) demonstrate the proposed method can generate detailed roof structures under noisy data environments. The average successful rate for building shape recognition is 83.0%, while the overall completeness and correctness are over 70% with reference to ground truth created from airborne lidar. As the first effort to address the public need of large scale city model generation, the development is deployed as open source software.

[1]  Daniel G. Aliaga,et al.  Automatic Extraction of Manhattan-World Building Masses from 3D Laser Range Scans , 2012, IEEE Transactions on Visualization and Computer Graphics.

[2]  Vivek Verma,et al.  3D Building Detection and Modeling from Aerial LIDAR Data , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[3]  Scott Almes,et al.  Large-scale public lidar and satellite image data set for urban semantic labeling , 2018, Defense + Security.

[4]  Myron Brown,et al.  METRIC EVALUATION PIPELINE FOR 3D MODELING OF URBAN SCENES , 2017 .

[5]  George Vosselman,et al.  Building Reconstruction by Target Based Graph Matching on Incomplete Laser Data: Analysis and Limitations , 2009, Sensors.

[6]  Wuming Zhang,et al.  An Easy-to-Use Airborne LiDAR Data Filtering Method Based on Cloth Simulation , 2016, Remote. Sens..

[7]  Jie Shan,et al.  Segmentation and Reconstruction of Polyhedral Building Roofs From Aerial Lidar Point Clouds , 2010, IEEE Transactions on Geoscience and Remote Sensing.

[8]  J. Shan,et al.  Building boundary tracing and regularization from airborne lidar point clouds , 2007 .

[9]  Subhransu Maji,et al.  CSGNet: Neural Shape Parser for Constructive Solid Geometry , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[10]  M. Jancosek,et al.  Flexible building primitives for 3D building modeling , 2015 .

[11]  Jiaye Wu,et al.  Neural Procedural Reconstruction for Residential Buildings , 2018, ECCV.

[12]  Josiane Zerubia,et al.  Structural Approach for Building Reconstruction from a Single DSM , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[13]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[14]  Norbert Haala,et al.  An update on automatic 3D building reconstruction , 2010 .

[15]  Florent Lafarge,et al.  LOD Generation for Urban Scenes , 2015, ACM Trans. Graph..

[16]  Hans-Gerd Maas,et al.  Cycle graph analysis for 3D roof structure modelling: Concepts and performance , 2014 .

[17]  George Vosselman,et al.  Airborne and terrestrial laser scanning , 2011, Int. J. Digit. Earth.

[18]  Bo Xu,et al.  Urban Semantic 3D Reconstruction From Multiview Satellite Imagery , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[19]  Yin Zhou,et al.  VoxelNet: End-to-End Learning for Point Cloud Based 3D Object Detection , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[20]  Marc Alexa,et al.  Computing and Rendering Point Set Surfaces , 2003, IEEE Trans. Vis. Comput. Graph..

[21]  Wanshou Jiang,et al.  HRTT: A Hierarchical Roof Topology Structure for Robust Building Roof Reconstruction from Point Clouds , 2017, Remote. Sens..

[22]  Daniel G. Aliaga,et al.  A Survey of Urban Reconstruction , 2013, Comput. Graph. Forum.

[23]  Leonidas J. Guibas,et al.  PointNet: Deep Learning on Point Sets for 3D Classification and Segmentation , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[24]  Leonidas J. Guibas,et al.  PointNet++: Deep Hierarchical Feature Learning on Point Sets in a Metric Space , 2017, NIPS.

[25]  Florent Lafarge,et al.  Towards Large-Scale City Reconstruction from Satellites , 2016, ECCV.

[26]  Yongjun Zhang,et al.  3D building roof reconstruction from airborne LiDAR point clouds: a framework based on a spatial database , 2017, Int. J. Geogr. Inf. Sci..

[27]  Subhransu Maji,et al.  Multi-view Convolutional Neural Networks for 3D Shape Recognition , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[28]  Hai Huang,et al.  Towards Automatic Large-Scale 3D Building Reconstruction: Primitive Decomposition and Assembly , 2017, AGILE Conf..

[29]  Dong Chen,et al.  LiDAR Point Clouds to 3-D Urban Models$:$ A Review , 2018, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[30]  Lutz Plümer,et al.  Model driven reconstruction of roofs from sparse LIDAR point clouds , 2013 .

[31]  Jing Zhang,et al.  Investigation on the Weighted RANSAC Approaches for Building Roof Plane Segmentation from LiDAR Point Clouds , 2015, Remote. Sens..

[32]  Florent Lafarge,et al.  Building large urban environments from unstructured point data , 2011, 2011 International Conference on Computer Vision.