Fast and Robust Generation of Semantic Urban Terrain Models from UAV Video Streams

We present an algorithm for extracting Level of Detail 2 (LOD2) building models from video streams captured by Unmaned Aerial Vehicles (UAVs). Typically, such imagery is of limited radiometric quality but the surface is captured with large redundancy. The first contribution of this paper is a novel algorithm exploiting this redundancy for precise depth computation. This is realized by fusing consistent depth estimations across single stereo models and generating a 2.5D elevation map from the resulting point clouds. Disparity maps are derived by a coarse-to-fine Semi-Global-Matching (SGM) method performing well on noisy imagery. The second contribution concerns a challenging step of the context-based urban terrain modeling: Dominant planes extraction for building reconstruction. Because of noisy data and complicated roof structures, both dominant plane parameters and initial values for support sets of planes are obtained by the J-Linkage algorithm. An improved point-to-plane labeling is presented to encourage the assignment of proximate points to the same plane. This is accomplished by non-local, Markov Random Field (MRF) - based optimization and segmentation of color information. The potential and the limitations of the proposed methods are shown using an UAV video sequence of limited radiometric quality.

[1]  Emmanuel P. Baltsavias,et al.  Multiphoto geometrically constrained matching , 1991 .

[2]  Robert T. Collins,et al.  A space-sweep approach to true multi-image matching , 1996, Proceedings CVPR IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[3]  Andrea Fusiello,et al.  Robust Multiple Structures Estimation with J-Linkage , 2008, ECCV.

[4]  Reinhard Koch,et al.  Multi Viewpoint Stereo from Uncalibrated Video Sequences , 1998, ECCV.

[5]  Dimitri Bulatov,et al.  Detection of Small Roof Details in Image Sequences , 2013, SCIA.

[6]  Norbert Haala,et al.  Dense Multi-Stereo Matching for High Quality Digital Elevation Models , 2012 .

[7]  Geert Verhoeven,et al.  Taking computer vision aloft – archaeological three‐dimensional reconstructions from aerial photographs with photoscan , 2011 .

[8]  Heiko Hirschmüller,et al.  Stereo Processing by Semiglobal Matching and Mutual Information , 2008, IEEE Trans. Pattern Anal. Mach. Intell..

[9]  F. Tarsha-Kurdi,et al.  EXTENDED RANSAC ALGORITHM FOR AUTOMATIC DETECTION OF BUILDING ROOF PLANES FROM LIDAR DATA , 2008 .

[10]  Steven M. Seitz,et al.  Photo tourism: exploring photo collections in 3D , 2006, ACM Trans. Graph..

[11]  Wolfgang Middelmann,et al.  An Efficient Parallel Algorithm for Graph-Based Image Segmentation , 2009, CAIP.

[12]  Hermann Gross,et al.  On Applications of Sequential Multi-view Dense Reconstruction from Aerial Images , 2012, ICPRAM.

[13]  H. Hirschmüller Ieee Transactions on Pattern Analysis and Machine Intelligence 1 Stereo Processing by Semi-global Matching and Mutual Information , 2022 .

[14]  N BelhumeurPeter A Bayesian approach to binocular stereopsis , 1996 .

[15]  Robert C. Bolles,et al.  Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography , 1981, CACM.

[16]  Horst Bischof,et al.  Fusion of Feature- and Area-Based Information for Urban Buildings Modeling from Aerial Imagery , 2008, ECCV.

[17]  Thomas H. Kolbe,et al.  Representing and Exchanging 3D City Models with CityGML , 2009 .

[18]  Florent Lafarge,et al.  Creating Large-Scale City Models from 3D-Point Clouds: A Robust Approach with Hybrid Representation , 2012, International Journal of Computer Vision.

[19]  H. Gross,et al.  3D-MODELING OF URBAN STRUCTURES , 2005 .

[20]  C. Brenner,et al.  3D URBAN GIS FROM LASER ALTIMETER AND 2D MAP DATA , 1997 .

[21]  Jean-Philippe Pons,et al.  Towards high-resolution large-scale multi-view stereo , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[22]  Karsten Schulz,et al.  CONTEXT-BASED URBAN TERRAIN RECONSTRUCTION FROM IMAGES AND VIDEOS , 2012 .

[23]  Yoonseok Jwa,et al.  AN IMPLICIT REGULARIZATION FOR 3D BUILDING ROOFTOP MODELING USING AIRBORNE LIDAR DATA , 2012 .

[24]  Horst Bischof,et al.  EFFICIENT AND GLOBALLY OPTIMAL MULTI VIEW DENSE MATCHING FOR AERIAL IMAGES , 2012 .

[25]  C. Heipke,et al.  Multi-view dense matching supported by triangular meshes , 2011 .

[26]  Ramin Zabih,et al.  Non-parametric Local Transforms for Computing Visual Correspondence , 1994, ECCV.

[27]  Mathias Rothermel,et al.  DENSE MULTIPLE STEREO MATCHING OF HIGHLY OVERLAPPING UAV IMAGERY , 2012 .