3D building modeling using images and LiDAR: a review

3D modeling from images and LiDAR (Light Detection And Ranging) has been an active research area in the photogrammetry, computer vision, and computer graphics communities. In terms of literature review, a comprehensive survey on 3D building modeling that contains methods from all these fields will be beneficial. This article attempts to survey the state-of-the-art 3D building modeling methods in the areas of photogrammetry, computer vision, and computer graphics. The existing methods are grouped into three categories: 3D reconstruction from images, 3D modeling using range data, and 3D modeling using images and range data. The use of both data for 3D modeling is a sensor fusion approach, in which methods of image-to-LiDAR registration, upsampling, and image-guided segmentation are reviewed. For each category, the key problems are identified and solutions are addressed.

[1]  Jie Chen,et al.  Architectural Modeling from Sparsely Scanned Range Data , 2008, International Journal of Computer Vision.

[2]  Lukas Zebedin,et al.  Towards 3D map generation from digital aerial images , 2006 .

[3]  Gabriele Guidi,et al.  3D digitization of a large model of imperial Rome , 2005, Fifth International Conference on 3-D Digital Imaging and Modeling (3DIM'05).

[4]  F. Ackermann Airborne laser scanning : present status and future expectations , 1999 .

[5]  Christian Früh,et al.  Data Processing Algorithms for Generating Textured 3D Building Facade Meshes from Laser Scans and Camera Images , 2005, International Journal of Computer Vision.

[6]  George Vosselman,et al.  Knowledge based reconstruction of building models from terrestrial laser scanning data , 2009 .

[7]  Fabio Remondino,et al.  Detailed 3D Modelling of Castles , 2007 .

[8]  Deren Li Remotely sensed images and GIS data fusion for automatic change detection , 2010 .

[9]  Wenyi Zhao,et al.  Alignment of continuous video onto 3D point clouds , 2004, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[10]  Li Zhang,et al.  Single view modeling of free-form scenes , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[11]  O.R.P. Bellon,et al.  New improvements to range image segmentation by edge detection , 2002, IEEE Signal Processing Letters.

[12]  Eyal Ben-Dor,et al.  Modelling and monitoring urban built environment via multi-source integrated and fused remote sensing data , 2013 .

[13]  Stephen J. Maybank,et al.  A Method for Interactive 3D Reconstruction of Piecewise Planar Objects from Single Images , 1999, BMVC.

[14]  Andrew Zisserman,et al.  Multiple View Geometry in Computer Vision (2nd ed) , 2003 .

[15]  Paul A. Viola,et al.  Alignment by Maximization of Mutual Information , 1997, International Journal of Computer Vision.

[16]  Christian Früh,et al.  Constructing 3D city models by merging ground-based and airborne views , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..

[17]  Qian-Yi Zhou,et al.  Fast and extensible building modeling from airborne LiDAR data , 2008, GIS '08.

[18]  Suya You,et al.  Urban Site Modeling from LiDAR , 2003, ICCSA.

[19]  Ahmed F. Elaksher,et al.  Reconstructing 3 D Buildings from Lidar Data , 2002 .

[20]  Sagi Filin,et al.  SEGMENTATION OF TERRESTRIAL LASER SCANNING DATA BY INTEGRATING RANGE AND IMAGE CONTENT , 2008 .

[21]  David Nist Automatic Passive Recovery of 3D from Images and Video , 2004 .

[22]  Horst Bunke,et al.  Fast range image segmentation using high-level segmentation primitives , 1996, Proceedings Third IEEE Workshop on Applications of Computer Vision. WACV'96.

[23]  K. Kraus,et al.  Determination of terrain models in wooded areas with airborne laser scanner data , 1998 .

[24]  George Vosselman,et al.  Building extraction from oblique airborne imagery based on robust façade detection , 2012 .

[25]  Pietro Zanuttigh,et al.  A Novel Interpolation Scheme for Range Data with Side Information , 2009, 2009 Conference for Visual Media Production.

[26]  Jan-Michael Frahm,et al.  Detailed Real-Time Urban 3D Reconstruction from Video , 2007, International Journal of Computer Vision.

[27]  Philip H. S. Torr,et al.  VideoTrace: rapid interactive scene modelling from video , 2007, SIGGRAPH 2007.

[28]  Allen R. Hanson,et al.  Generalized parallel-perspective stereo mosaics from airborne video , 2004, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[29]  Mahmoud Melkemi,et al.  Range-Image segmentation and model reconstruction based on a fit-and-merge strategy , 2002, SMA '02.

[30]  Luc Van Gool,et al.  Procedural modeling of buildings , 2006, SIGGRAPH 2006.

[31]  Richard Szeliski,et al.  A Comparison and Evaluation of Multi-View Stereo Reconstruction Algorithms , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[32]  Richard Szeliski,et al.  Building Rome in a day , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[33]  Bernhard Hohmann,et al.  Cityfit - High-Quality Urban Reconstruction by Fitting Shape Grammars to Image and derived Textured Point Clouds , 2008 .

[34]  Ramesh C. Jain,et al.  Segmentation through Variable-Order Surface Fitting , 1988, IEEE Trans. Pattern Anal. Mach. Intell..

[35]  Frank P. Ferrie,et al.  Camera localization and building reconstruction from single monocular images , 2008, 2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops.

[36]  Oszkár Józsa,et al.  Analysis of 3D Dynamic Urban Scenes Based on LiDAR Point Cloud Sequences , 2013 .

[37]  Paulo Dias,et al.  Registration and fusion of intensity and range data for 3D modelling of real world scenes , 2003, Fourth International Conference on 3-D Digital Imaging and Modeling, 2003. 3DIM 2003. Proceedings..

[38]  Robert L. Stevenson,et al.  Super-resolution from image sequences-a review , 1998, 1998 Midwest Symposium on Circuits and Systems (Cat. No. 98CB36268).

[39]  N. Mitra,et al.  Non-local scan consolidation for 3D urban scenes , 2010, ACM Trans. Graph..

[40]  James R. Williamson,et al.  Dimensional Analysis Through Perspective: A Reference Manual , 1990 .

[41]  Cecilia Chao Chen Range segmentation and registration for 3D modeling of large scale urban scenes , 2007 .

[42]  C. Briese,et al.  A NEW METHOD FOR BUILDING EXTRACTION IN URBAN AREAS FROM HIGH-RESOLUTION LIDAR DATA , 2002 .

[43]  Jitendra Malik,et al.  Modeling and Rendering Architecture from Photographs: A hybrid geometry- and image-based approach , 1996, SIGGRAPH.

[44]  Tao Ju,et al.  Dual contouring of hermite data , 2002, ACM Trans. Graph..

[45]  Andrew W. Fitzgibbon,et al.  Automatic 3D model acquisition and generation of new images from video sequences , 1998, 9th European Signal Processing Conference (EUSIPCO 1998).

[46]  Camillo J. Taylor,et al.  Reconstruction of Linearly Parameterized Models from Single Images with a Camera of Unknown Focal Length , 2001, IEEE Trans. Pattern Anal. Mach. Intell..

[47]  Lu Wang,et al.  A robust approach for automatic registration of aerial images with untextured aerial LiDAR data , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[48]  Rudolph Triebel,et al.  Non-Iterative Vision-Based Interpolation of 3D Laser Scans , 2007 .

[49]  Ruigang Yang,et al.  Spatial-Depth Super Resolution for Range Images , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[50]  Antonio Criminisi,et al.  Creating Architectural Models from Images , 1999, Comput. Graph. Forum.

[51]  Supun Samarasekera,et al.  Building segmentation for densely built urban regions using aerial LIDAR data , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[52]  Luc Van Gool,et al.  Image-based procedural modeling of facades , 2007, ACM Trans. Graph..

[53]  Avideh Zakhor,et al.  Constructing a Multivalued Representation for View Synthesis , 2001, International Journal of Computer Vision.

[54]  David Nistér Automatic passive recovery of 3D from images and video , 2004, Proceedings. 2nd International Symposium on 3D Data Processing, Visualization and Transmission, 2004. 3DPVT 2004..

[55]  Luc Van Gool,et al.  Towards mass-produced building models , 2007 .

[56]  Naokazu Yokoya,et al.  A Hybrid Approach to Range Image Segmentation Based on Differential Geometry , 1992 .

[57]  Suya You,et al.  Automatic reconstruction of cities from remote sensor data , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[58]  George Vosselman,et al.  3D BUILDING MODEL RECONSTRUCTION FROM POINT CLOUDS AND GROUND PLANS , 2001 .

[59]  Noah Snavely,et al.  Scene Reconstruction and Visualization from Internet Photo Collections: A Survey , 2011, IPSJ Trans. Comput. Vis. Appl..

[60]  Ulrich Neumann,et al.  2.5D Dual Contouring: A Robust Approach to Creating Building Models from Aerial LiDAR Point Clouds , 2010, ECCV.

[61]  Luc Van Gool,et al.  Image-based procedural modeling of facades , 2007, SIGGRAPH 2007.

[62]  Michael Elad,et al.  Fast and Robust Multi-Frame Super-Resolution , 2004, IEEE Transactions on Image Processing.

[63]  Gregory Dudek,et al.  Reconstruction of 3D Models from Intensity Images and Partial Depth , 2004, AAAI.

[64]  Steven M. Seitz,et al.  Photo tourism: exploring photo collections in 3D , 2006, ACM Trans. Graph..

[65]  Sebastian Thrun,et al.  LidarBoost: Depth superresolution for ToF 3D shape scanning , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[66]  Ioannis Stamos,et al.  Automatic 3D to 2D registration for the photorealistic rendering of urban scenes , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[67]  David Nistér,et al.  An efficient solution to the five-point relative pose problem , 2004, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[68]  Ryosuke Shibasaki,et al.  Reconstructing urban 3D model using vehicle-borne laser range scanners , 2001, Proceedings Third International Conference on 3-D Digital Imaging and Modeling.

[69]  S. Finsterwalder Die geometrischen Grundlagen der Photogrammetrie. , 1897 .

[70]  Hangbin Wu,et al.  Extraction of Urban 3D Features from Lidar Data Fused with Aerial Images Using an Improved Mean Shift Algorithm , 2011 .

[71]  G. Vosselman BUILDING RECONSTRUCTION USING PLANAR FACES IN VERY HIGH DENSITY HEIGHT DATA , 1999 .

[72]  Fabio Remondino,et al.  Image‐based 3D Modelling: A Review , 2006 .

[73]  Sebastian Thrun,et al.  Upsampling range data in dynamic environments , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[74]  Armin B. Cremers,et al.  Extracting Buildings from Aerial Images Using Hierarchical Aggregation in 2D and 3D , 1998, Comput. Vis. Image Underst..

[75]  Naokazu Yokoya,et al.  Range Image Segmentation Based on Differential Geometry: A Hybrid Approach , 1989, IEEE Trans. Pattern Anal. Mach. Intell..

[76]  Michael Goesele,et al.  Multi-View Stereo for Community Photo Collections , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[77]  Z. Pizlo Perception viewed as an inverse problem , 2001, Vision Research.

[78]  Marc Levoy,et al.  A volumetric method for building complex models from range images , 1996, SIGGRAPH.

[79]  Frédo Durand,et al.  A gentle introduction to bilateral filtering and its applications , 2007, SIGGRAPH Courses.

[80]  B. Caprile,et al.  Using vanishing points for camera calibration , 1990, International Journal of Computer Vision.

[81]  Jana Kosecka,et al.  Piecewise planar city 3D modeling from street view panoramic sequences , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[82]  Samia Boukir,et al.  Relevance of airborne lidar and multispectral image data for urban scene classification using Random Forests , 2011 .

[83]  Ulrich Neumann,et al.  2.5D building modeling with topology control , 2011, CVPR 2011.

[84]  Michal Irani,et al.  Improving resolution by image registration , 1991, CVGIP Graph. Model. Image Process..

[85]  John W. Fisher,et al.  Automatic registration of LIDAR and optical images of urban scenes , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[86]  Ioannis Stamos,et al.  Integrating Automated Range Registration with Multiview Geometry for the Photorealistic Modeling of Large-Scale Scenes , 2008, International Journal of Computer Vision.

[87]  Frank P. Ferrie,et al.  Automatic registration of mobile LiDAR and spherical panoramas , 2012, 2012 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops.

[88]  D. Scharstein,et al.  A Taxonomy and Evaluation of Dense Two-Frame Stereo Correspondence Algorithms , 2001, Proceedings IEEE Workshop on Stereo and Multi-Baseline Vision (SMBV 2001).

[89]  Feng Han,et al.  Bayesian reconstruction of 3D shapes and scenes from a single image , 2003, First IEEE International Workshop on Higher-Level Knowledge in 3D Modeling and Motion Analysis, 2003. HLK 2003..

[90]  Jianxiong Xiao,et al.  Image-based façade modeling , 2008, SIGGRAPH 2008.

[91]  Wolfgang Förstner,et al.  Towards automatic building extraction from high-resolution digital elevation models , 1995 .

[92]  Ioannis Stamos,et al.  Geometry and Texture Recovery of Scenes of Large Scale , 2002, Comput. Vis. Image Underst..

[93]  Richard A. Volz,et al.  Range image segmentation and surface parameter extraction for 3-D object recognition of industrial parts , 1987, Proceedings. 1987 IEEE International Conference on Robotics and Automation.

[94]  Ioannis Stamos,et al.  Range Image Segmentation for Modeling and Object Detection in Urban Scenes , 2007, Sixth International Conference on 3-D Digital Imaging and Modeling (3DIM 2007).

[95]  Hongdong Li,et al.  Five-Point Motion Estimation Made Easy , 2006, 18th International Conference on Pattern Recognition (ICPR'06).

[96]  Alexei A. Efros,et al.  Automatic photo pop-up , 2005, SIGGRAPH 2005.

[97]  S. El-Hakim,et al.  SURFACE RECONSTRUCTION OF LARGE COMPLEX STRUCTURES FROM MIXED RANGE DATA – THE ERECHTHEION EXPERIENCE , 2008 .

[98]  Sebastian Thrun,et al.  An Application of Markov Random Fields to Range Sensing , 2005, NIPS.

[99]  Roland Wahl,et al.  Ransac Based Out-of-Core Point-Cloud Shape Detection for City-Modeling , 2007 .

[100]  Richard Szeliski,et al.  Interactive 3D architectural modeling from unordered photo collections , 2008, SIGGRAPH Asia '08.

[101]  D. Cohen-Or,et al.  SmartBoxes for interactive urban reconstruction , 2010, ACM Trans. Graph..

[102]  Alexei A. Efros,et al.  Automatic photo pop-up , 2005, ACM Trans. Graph..

[103]  Roberto Cipolla,et al.  PhotoBuilder-3D models of architectural scenes from uncalibrated images , 1999, Proceedings IEEE International Conference on Multimedia Computing and Systems.

[104]  Bernhard P. Wrobel,et al.  Multiple View Geometry in Computer Vision , 2001 .

[105]  Ian D. Reid,et al.  Single View Metrology , 2000, International Journal of Computer Vision.

[106]  Paul J. Besl,et al.  A Method for Registration of 3-D Shapes , 1992, IEEE Trans. Pattern Anal. Mach. Intell..

[107]  Diego González-Aguilera,et al.  An automatic procedure for co-registration of terrestrial laser scanners and digital cameras , 2009 .

[108]  Jianxiong Xiao,et al.  Image-based façade modeling , 2008, ACM Trans. Graph..

[109]  Suchendra M. Bhandarkar,et al.  Integrating edge and surface information for range image segmentation , 1992, Pattern Recognit..

[110]  Robert C. Bolles,et al.  Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography , 1981, CACM.

[111]  Ulrich Neumann,et al.  A streaming framework for seamless building reconstruction from large-scale aerial LiDAR data , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[112]  Andrew W. Fitzgibbon,et al.  An Experimental Comparison of Range Image Segmentation Algorithms , 1996, IEEE Trans. Pattern Anal. Mach. Intell..

[113]  F. A. Heuvel,et al.  TRENDS IN CAD-BASED PHOTOGRAMMETRIC MEASUREMENT , 2000 .

[114]  Richard Szeliski,et al.  Towards Internet-scale multi-view stereo , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[115]  Rudolph Triebel,et al.  Vision based interpolation of 3D laser scans , 2006 .

[116]  Armin Gruen,et al.  CC-MODELER : A TOPOLOGY GENERATOR FOR 3-D CITY MODELS , 1998 .

[117]  David J. Kriegman,et al.  Structure and Motion from Line Segments in Multiple Images , 1995, IEEE Trans. Pattern Anal. Mach. Intell..

[118]  George Vosselman,et al.  Quality analysis on 3D building models reconstructed from airborne laser scanning data , 2011 .

[119]  Ioannis Stamos,et al.  A systematic approach for 2D-image to 3D-range registration in urban environments , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[120]  G LoweDavid,et al.  Distinctive Image Features from Scale-Invariant Keypoints , 2004 .

[121]  David G. Lowe,et al.  Fitting Parameterized Three-Dimensional Models to Images , 1991, IEEE Trans. Pattern Anal. Mach. Intell..

[122]  Andrew W. Fitzgibbon,et al.  Bundle Adjustment - A Modern Synthesis , 1999, Workshop on Vision Algorithms.

[123]  A. P. Nyaruhuma,et al.  Verification of 2D building outlines using oblique airborne images , 2012 .

[124]  Jianxiong Xiao,et al.  Image-based street-side city modeling , 2009, SIGGRAPH 2009.

[125]  Zhenguo Li,et al.  A Closed-form Solution to 3D Reconstruction of Piecewise Planar Objects from Single Images , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[126]  George Vosselman,et al.  Two algorithms for extracting building models from raw laser altimetry data , 1999 .

[127]  Emanuele Trucco,et al.  Introductory techniques for 3-D computer vision , 1998 .

[128]  Nina Amenta,et al.  Laser Scanner Super-resolution , 2006, PBG@SIGGRAPH.

[129]  Frank P. Ferrie,et al.  Window detection from mobile LiDAR data , 2011, 2011 IEEE Workshop on Applications of Computer Vision (WACV).

[130]  Reinhard Klein,et al.  Efficient RANSAC for Point‐Cloud Shape Detection , 2007, Comput. Graph. Forum.

[131]  Michael Elad,et al.  Fast and robust multiframe super resolution , 2004, IEEE Transactions on Image Processing.

[132]  N. Haala,et al.  COMBINED FEATURE EXTRACTION FOR FAÇADE RECONSTRUCTION , 2007 .

[133]  Daniel G. Aliaga,et al.  Modelling the Appearance and Behaviour of Urban Spaces , 2010, Comput. Graph. Forum.

[134]  Reinhard Koch,et al.  Visual Modeling with a Hand-Held Camera , 2004, International Journal of Computer Vision.

[135]  Vivek Verma,et al.  3D Building Detection and Modeling from Aerial LIDAR Data , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[136]  Philip H. S. Torr,et al.  VideoTrace: rapid interactive scene modelling from video , 2007, ACM Trans. Graph..

[137]  C. Brenner Building reconstruction from images and laser scanning , 2005 .

[138]  F. A. Heuvel VANISHING POINT DETECTION FOR ARCHITECTURAL PHOTOGRAMMETRY , 1998 .

[139]  G. Vosselman,et al.  Map based building reconstruction from laser data and images , 2001 .

[140]  Franz Rottensteiner,et al.  Automatic Generation of High-Quality Building Models from Lidar Data , 2003, IEEE Computer Graphics and Applications.

[141]  Avideh Zakhor,et al.  Automatic registration of aerial imagery with untextured 3D LiDAR models , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[142]  Bill Cowan,et al.  Simple 3D Reconstruction of Single Indoor Image with Perspective Cues , 2009, 2009 Canadian Conference on Computer and Robot Vision.

[143]  Nicolas Paparoditis,et al.  STRATEGY FOR THE EXTRACTION OF 3D ARCHITECTURAL OBJECTS FROM LASER AND IMAGE DATA ACQUIRED FROM THE SAME VIEWPOINT , 2005 .

[144]  Ken-ichi Anjyo,et al.  Tour into the picture: using a spidery mesh interface to make animation from a single image , 1997, SIGGRAPH.

[145]  Jianxiong Xiao,et al.  Image-based street-side city modeling , 2009, ACM Trans. Graph..

[146]  Jixian Zhang Multi-source remote sensing data fusion: status and trends , 2010 .

[147]  Ashutosh Saxena,et al.  Make3D: Learning 3D Scene Structure from a Single Still Image , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.