CONTEXT-BASED URBAN TERRAIN RECONSTRUCTION FROM IMAGES AND VIDEOS

Abstract. Detection of buildings and vegetation, and even more reconstruction of urban terrain from sequences of aerial images and videos is known to be a challenging task. It has been established that those methods that have as input a high-quality Digital Surface Model (DSM), are more straight-forward and produce more robust and reliable results than those image-based methods that require matching line segments or even whole regions. This motivated us to develop a new dense matching technique for DSM generation that is capable of simultaneous integration of multiple images in the reconstruction process. The DSMs generated by this new multi-image matching technique can be used for urban object extraction. In the first contribution of this paper, two examples of external sources of information added to the reconstruction pipeline will be shown. The GIS layers are used for recognition of streets and suppressing false alarms in the depth maps that were caused by moving vehicles while the near infrared channel is applied for separating vegetation from buildings. Three examples of data sets including both UAV-borne video sequences with a relatively high number of frames and high-resolution (10 cm ground sample distance) data sets consisting of (few spatial-temporarily diverse) images from large-format aerial frame cameras, will be presented. By an extensive quantitative evaluation of the Vaihingen block from the ISPRS benchmark on urban object detection, it will become clear that our procedure allows a straight-forward, efficient, and reliable instantiation of 3D city models.

[1]  Robert C. Bolles,et al.  Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography , 1981, CACM.

[2]  J. Lavery,et al.  Reconstruction and Texturing of 3D Urban Terrain from Uncalibrated Monocular Images Using L 1 Splines , 2010 .

[3]  Andrew Zisserman,et al.  A PLANE-SWEEP STRATEGY FOR THE 3D RECONSTRUCTION OF BUILDINGS FROM MULTIPLE IMAGES , 2000 .

[4]  George Vosselman,et al.  3D BUILDING MODEL RECONSTRUCTION FROM POINT CLOUDS AND GROUND PLANS , 2001 .

[5]  Bernhard P. Wrobel,et al.  Multiple View Geometry in Computer Vision , 2001 .

[6]  Horst Bischof,et al.  Fusion of Feature- and Area-Based Information for Urban Buildings Modeling from Aerial Imagery , 2008, ECCV.

[7]  J. B. Burns,et al.  Extracting straight lines , 1987 .

[8]  C. Heipke,et al.  Context-based urban terrain reconstruction from uav-videos for geoinformation applications , 2012 .

[9]  C. Heipke,et al.  Multi-view dense matching supported by triangular meshes , 2011 .

[10]  Eli Shechtman,et al.  Matching Local Self-Similarities across Images and Videos , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[11]  Armin B. Cremers,et al.  Extracting Buildings from Aerial Images Using Hierarchical Aggregation in 2D and 3D , 1998, Comput. Vis. Image Underst..

[12]  Franz Rottensteiner,et al.  Roof plane segmentation by combining multiple images and point clouds , 2010 .

[13]  Norbert Pfeifer,et al.  A Comparison of Evaluation Techniques for Building Extraction From Airborne Laser Scanning , 2009, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[14]  Volker Spreckels,et al.  DGPF Project: Evaluation of Digital Photogrammetric Camera Systems - Stereoplotting , 2010 .

[15]  Michael Cramer,et al.  The DGPF-Test on Digital Airborne Camera Evaluation - Over- view and Test Design , 2010 .

[16]  Dimitri Bulatov,et al.  Towards Euclidean Reconstruction From Video Sequences , 2008, VISAPP.

[17]  H. Gross,et al.  3D-MODELING OF URBAN STRUCTURES , 2005 .