Aerial Reconstructions via Probabilistic Data Fusion

We propose an integrated probabilistic model for multi-modal fusion of aerial imagery, LiDAR data, and (optional) GPS measurements. The model allows for analysis and dense reconstruction (in terms of both geometry and appearance) of large 3D scenes. An advantage of the approach is that it explicitly models uncertainty and allows for missing data. As compared with image-based methods, dense reconstructions of complex urban scenes are feasible with fewer observations. Moreover, the proposed model allows one to estimate absolute scale and orientation and reason about other aspects of the scene, e.g., detection of moving objects. As formulated, the model lends itself to massively-parallel computing. We exploit this in an efficient inference scheme that utilizes both general purpose and domain-specific hardware components. We demonstrate results on large-scale reconstruction of urban terrain from LiDAR and aerial photography data.

[1]  A. Baumgartner,et al.  Fusion of LIDAR data and aerial imagery for automatic reconstruction of building surfaces , 2003, 2003 2nd GRSS/ISPRS Joint Workshop on Remote Sensing and Data Fusion over Urban Areas.

[2]  Jan-Michael Frahm,et al.  Detailed Real-Time Urban 3D Reconstruction from Video , 2007, International Journal of Computer Vision.

[3]  Richard Szeliski,et al.  Geometrically Constrained Structure from Motion: Points on Planes , 1998, SMILE.

[4]  Robert C. Bolles,et al.  Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography , 1981, CACM.

[5]  Pascal Fua,et al.  Efficient large-scale multi-view stereo for ultra high-resolution image sets , 2011, Machine Vision and Applications.

[6]  John P. Kerekes,et al.  3D Scene Reconstruction through a Fusion of Passive Video and Lidar Imagery , 2007, 36th Applied Imagery Pattern Recognition Workshop (aipr 2007).

[7]  Tee-Ann Teo,et al.  Fusion of lidar data and optical imagery for building modeling , 2004 .

[8]  T. Arikawa,et al.  Reconstruction of 3D urban model using range image and aerial image , 2001, IGARSS 2001. Scanning the Present and Resolving the Future. Proceedings. IEEE 2001 International Geoscience and Remote Sensing Symposium (Cat. No.01CH37217).

[9]  Olivier Faugeras,et al.  Motion and Structure from Motion in a piecewise Planar Environment , 1988, Int. J. Pattern Recognit. Artif. Intell..

[10]  Harry Shum,et al.  A linear algorithm for camera self-calibration, motion and structure recovery for multi-planar scenes from two perspective images , 2000, Proceedings IEEE Conference on Computer Vision and Pattern Recognition. CVPR 2000 (Cat. No.PR00662).

[11]  Anna Gerber,et al.  Opengl Programming Guide The Official Guide To Learning Opengl Versions 3 0 And 3 1 , 2016 .

[12]  Ulrich Neumann,et al.  2.5D Dual Contouring: A Robust Approach to Creating Building Models from Aerial LiDAR Point Clouds , 2010, ECCV.

[13]  Rama Chellappa,et al.  Structure from Motion Using Sequential Monte Carlo Methods , 2001, Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001.

[14]  Fadi Dornaika,et al.  A Featureless Approach to 3D Polyhedral Building Modeling from Aerial Images , 2010, Sensors.

[15]  G LoweDavid,et al.  Distinctive Image Features from Scale-Invariant Keypoints , 2004 .

[16]  David J. C. MacKay,et al.  Information Theory, Inference, and Learning Algorithms , 2004, IEEE Transactions on Information Theory.

[17]  Sunil Arya,et al.  An optimal algorithm for approximate nearest neighbor searching fixed dimensions , 1998, JACM.

[18]  Andrew W. Fitzgibbon,et al.  Bundle Adjustment - A Modern Synthesis , 1999, Workshop on Vision Algorithms.

[19]  Paul A. Beardsley,et al.  Sequential Updating of Projective and Affine Structure from Motion , 1997, International Journal of Computer Vision.

[20]  Gang Zhao,et al.  Rapid reconstruction of 3D building models from aerial images and LiDAR data , 2010, 2010 3rd International Conference on Advanced Computer Theory and Engineering(ICACTE).

[21]  Horst Steuer Height snakes: 3D building reconstruction from aerial image and laser scanner data , 2011, 2011 Joint Urban Remote Sensing Event.

[22]  Carl E. Rasmussen,et al.  Gaussian processes for machine learning , 2005, Adaptive computation and machine learning.

[23]  Manolis I. A. Lourakis,et al.  SBA: A software package for generic sparse bundle adjustment , 2009, TOMS.

[24]  Michael M. Kazhdan,et al.  Screened poisson surface reconstruction , 2013, TOGS.

[25]  Andrew Zisserman,et al.  Automatic reconstruction of piecewise planar models from multiple views , 1999, Proceedings. 1999 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No PR00149).

[26]  Adrien Bartoli,et al.  Constrained Structure and Motion From Multiple Uncalibrated Views of a Piecewise Planar Scene , 2003, International Journal of Computer Vision.

[27]  Peter F. Sturm,et al.  A Factorization Based Algorithm for Multi-Image Projective Structure and Motion , 1996, ECCV.

[28]  O. D. Faugeras,et al.  Camera Self-Calibration: Theory and Experiments , 1992, ECCV.

[29]  Thomas S. Huang,et al.  Motion and structure from feature correspondences: a review , 1994, Proc. IEEE.

[30]  Radu Horaud,et al.  Projective Structure and Motion from Two Views of a Piecewise Planar Scene , 2001, ICCV.

[31]  Jean Ponce,et al.  Accurate, Dense, and Robust Multiview Stereopsis , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[32]  Fuan Tsai,et al.  Construction and visualization of photo-realistic three-dimensional digital city , 2009, 2009 Joint Urban Remote Sensing Event.

[33]  George Vosselman,et al.  Reconstruction of 3D building models from aerial images and maps , 2003 .

[34]  Hanan Samet,et al.  Foundations of multidimensional and metric data structures , 2006, Morgan Kaufmann series in data management systems.

[35]  Takeo Kanade,et al.  Shape and motion from image streams under orthography: a factorization method , 1992, International Journal of Computer Vision.

[36]  Andrew W. Fitzgibbon,et al.  Automatic Camera Recovery for Closed or Open Image Sequences , 1998, ECCV.

[37]  Luc Van Gool,et al.  Semi-automatic modelling of urban buildings from high resolution aerial imagery , 1998, Proceedings. Computer Graphics International (Cat. No.98EX149).

[38]  Jean Ponce,et al.  Computer Vision: A Modern Approach , 2002 .

[39]  Yongjun Zhang,et al.  3D Building Modelling with Digital Map, Lidar Data and Video Image Sequences , 2005 .

[40]  Richard Szeliski,et al.  Building Rome in a day , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[41]  Richard Szeliski,et al.  Modeling the World from Internet Photo Collections , 2008, International Journal of Computer Vision.

[42]  Jean-Philippe Pons,et al.  High Accuracy and Visibility-Consistent Dense Multiview Stereo , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[43]  Rama Chellappa,et al.  Structure from Motion Using Sequential Monte Carlo Methods , 2004, International Journal of Computer Vision.

[44]  John A. Nelder,et al.  A Simplex Method for Function Minimization , 1965, Comput. J..

[45]  Stan Sclaroff,et al.  Recursive estimation of motion and planar structure , 2000, Proceedings IEEE Conference on Computer Vision and Pattern Recognition. CVPR 2000 (Cat. No.PR00662).

[46]  Yirong Wu,et al.  Building Recognition and Reconstruction from Aerial Imagery and LIDAR Data , 2006, 2006 CIE International Conference on Radar.

[47]  Steven M. Seitz,et al.  Photo tourism: exploring photo collections in 3D , 2006, ACM Trans. Graph..

[48]  Armin B. Cremers,et al.  Extracting Buildings from Aerial Images Using Hierarchical Aggregation in 2D and 3D , 1998, Comput. Vis. Image Underst..

[49]  Frank Dellaert,et al.  Structure from motion without correspondence , 2000, Proceedings IEEE Conference on Computer Vision and Pattern Recognition. CVPR 2000 (Cat. No.PR00662).

[50]  Tee-Ann Teo,et al.  Building reconstruction from LIDAR data and aerial imagery , 2005, Proceedings. 2005 IEEE International Geoscience and Remote Sensing Symposium, 2005. IGARSS '05..

[51]  Daniel Cremers,et al.  Superresolution texture maps for multiview reconstruction , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[52]  David A. Forsyth,et al.  Bayesian structure from motion , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[53]  Jan-Michael Frahm,et al.  Piecewise planar and non-planar stereo for urban scene reconstruction , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[54]  Alexandru Tupan,et al.  Triangulation , 1997, Comput. Vis. Image Underst..

[55]  Roberto Cipolla,et al.  Combining Single View Recognition and Multiple View Stereo for Architectural Scenes , 2001, ICCV.

[56]  C. Ellum INTEGRATION OF RAW GPS MEASUREMENTS INTO A BUNDLE ADJUSTMENT , 2000 .

[57]  Michael J. Black,et al.  Lie Bodies: A Manifold Representation of 3D Human Shape , 2012, ECCV.

[58]  John W. Fisher,et al.  Automatic registration of LIDAR and optical images of urban scenes , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[59]  Suya You,et al.  Integrating LiDAR, Aerial Image and Ground Images for Complete Urban Building Modeling , 2006, Third International Symposium on 3D Data Processing, Visualization, and Transmission (3DPVT'06).

[60]  M. Hodgson,et al.  Accuracy of Airborne Lidar-Derived Elevation: Empirical Assessment and Error Budget , 2004 .

[61]  Richard Szeliski,et al.  Piecewise planar stereo for image-based rendering , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[62]  Gérard G. Medioni,et al.  Aerial 3D reconstruction with line-constrained dynamic programming , 2011, 2011 International Conference on Computer Vision.

[63]  Richard Szeliski,et al.  Computer Vision - Algorithms and Applications , 2011, Texts in Computer Science.