Automated 3D object modeling from aerial video imagery

Research in physically accurate 3D modeling of a scene is gaining momentum because of its far reaching applications in civilian and defense sectors. The modeled 3D scene must conform both geometrically and spectrally to the real world for all the applications. Geometric modeling of a scene can be achieved in many ways of which the two most popular methods are a) using multiple 2D passive images of the scene also called as stereo vision and b) using 3D point clouds like Lidar (Light detection and ranging) data. In this research work, we derive the 3D models of objects in a scene using passive aerial video imagery. At present, this geometric modeling requires a lot of manual intervention due to a variety of factors like sensor noise, low contrast conditions during image capture, etc. Hence long time periods, in the order of weeks and months, are required to model even a small scene. This thesis focuses on automating the process of geometric modeling of objects in a scene from passive aerial video imagery. The aerial video frames are stitched into stereo mosaics. These stereo mosaics not only provide the elevation information of a scene but also act as good 3D visualization tools. The 3D information obtained from the stereo mosaics is used to identify the various 3D objects, especially man-made buildings using probabilistic inference provided by Bayesian Networks. The initial 3D building models are further optimized by projecting them on to the individual video frames. The limitations of the state-of-art technology in attaining these goals are presented along with the techniques to overcome them. The improvement that can be achieved in the accuracy of the 3D models when Lidar data is fused with aerial video during the object identification process is also examined.

[1]  David P. Dobkin,et al.  The quickhull algorithm for convex hulls , 1996, TOMS.

[2]  Yael Pritch,et al.  Omnistereo: Panoramic Stereo Imaging , 2001, IEEE Trans. Pattern Anal. Mach. Intell..

[3]  P. Wolf,et al.  Elements of Photogrammetry(with Applications in GIS) , 2000 .

[4]  David Heckerman,et al.  A Tutorial on Learning with Bayesian Networks , 1998, Learning in Graphical Models.

[5]  Eli Saber,et al.  Automatic color image segmentation by dynamic region growth and multimodal merging of color and texture information , 2008, 2008 IEEE International Conference on Acoustics, Speech and Signal Processing.

[6]  Bernhard P. Wrobel,et al.  Multiple View Geometry in Computer Vision , 2001 .

[7]  Chung-Lin Huang,et al.  A new motion compensation method for image sequence coding using hierarchical grid interpolation , 1994, IEEE Trans. Circuits Syst. Video Technol..

[8]  Stuart J. Russell,et al.  Adaptive Probabilistic Networks with Hidden Variables , 1997, Machine Learning.

[9]  Ellery Eells,et al.  Choices: An Introduction to Decision Theory. , 1990 .

[10]  Prudhvi Gurram,et al.  Extraction of digital elevation map from parallel-perspective stereo mosaics , 2008, Electronic Imaging.

[11]  Jan P. Allebach,et al.  Sequential linear interpolation of multidimensional functions , 1997, IEEE Trans. Image Process..

[12]  George Vosselman,et al.  Reconstruction of 3D building models from aerial images and maps , 2003 .

[13]  Kiyoharu Aizawa,et al.  Model-based image coding advanced video coding techniques for very low bit-rate applications , 1995, Proc. IEEE.

[14]  Andrew Zisserman,et al.  Automatic reconstruction of piecewise planar models from multiple views , 1999, Proceedings. 1999 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No PR00149).

[15]  Scott D. Brown,et al.  Advances in wide-area hyperspectral image simulation , 2003, SPIE Defense + Commercial Sensing.

[16]  Rajiv Gupta,et al.  Linear Pushbroom Cameras , 1994, ECCV.

[17]  A. Murat Tekalp,et al.  Hierarchical 2-D mesh representation, tracking, and compression for object-based video , 1999, IEEE Trans. Circuits Syst. Video Technol..

[18]  Dorin Comaniciu,et al.  Mean Shift: A Robust Approach Toward Feature Space Analysis , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[19]  R. Okafor Maximum likelihood estimation from incomplete data , 1987 .

[20]  Shmuel Peleg,et al.  Panoramic mosaics by manifold projection , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[21]  Stephen R. Lach,et al.  Semi-Automated DIRSIG scene modeling from 3D lidar and passive imagery , 2008 .

[22]  Harry N. Gross,et al.  An Advanced Synthetic Image Generation Model and its Application to Multi/Hyperspectral Algorithm Development , 1999 .

[23]  Allen R. Hanson,et al.  Ascender II, a Visual Framework for 3D Reconstruction , 1999, ICVS.

[24]  A. Murat Tekalp,et al.  Fusion of color and edge information for improved segmentation and edge linking , 1997, Image Vis. Comput..

[25]  Jie Cheng,et al.  An Algorithm for Bayesian Belief Network Construction from Data , 2004 .

[26]  Emanuele Trucco,et al.  Introductory techniques for 3-D computer vision , 1998 .

[27]  Ramakant Nevatia,et al.  Expandable Bayesian Networks for 3D Object Description from Multiple Views and Multiple Mode Inputs , 2003, IEEE Trans. Pattern Anal. Mach. Intell..

[28]  Allen R. Hanson,et al.  Generalized parallel-perspective stereo mosaics from airborne video , 2004, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[29]  Judea Pearl,et al.  Probabilistic reasoning in intelligent systems - networks of plausible inference , 1991, Morgan Kaufmann series in representation and reasoning.

[30]  Bernd Hamann,et al.  An Edge-Preserving, Data-Dependent Triangulation Scheme for Hierarchical Rendering , 1997, Scientific Visualization Conference (dagstuhl '97).

[31]  Takeo Kanade,et al.  Image-consistent surface triangulation , 2000, Proceedings IEEE Conference on Computer Vision and Pattern Recognition. CVPR 2000 (Cat. No.PR00662).

[32]  Allen R. Hanson,et al.  Decision Making and Uncertainty Management in a 3D Reconstruction System , 2003, IEEE Trans. Pattern Anal. Mach. Intell..

[33]  Shmuel Peleg,et al.  Stereo panorama with a single camera , 1999, Proceedings. 1999 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No PR00149).

[34]  Eli Saber,et al.  Unsupervised color image segmentation using a dynamic color gradient thresholding algorithm , 2008, Electronic Imaging.

[35]  Finn V. Jensen,et al.  Bayesian Networks and Decision Graphs , 2001, Statistics for Engineering and Information Science.

[36]  Armin B. Cremers,et al.  Extracting Buildings from Aerial Images Using Hierarchical Aggregation in 2D and 3D , 1998, Comput. Vis. Image Underst..

[37]  A. Murat Tekalp,et al.  Closed-form connectivity-preserving solutions for motion compensation using 2-D meshes , 1997, IEEE Trans. Image Process..

[38]  Ziya Arnavut,et al.  Fault diagnosis for airplane engines using Bayesian networks and distributed particle swarm optimization , 2007, Parallel Comput..

[39]  Hiroshi Harashima,et al.  Motion compensation based on spatial transformations , 1994, IEEE Trans. Circuits Syst. Video Technol..

[40]  Richard Szeliski,et al.  Stereo reconstruction from multiperspective panoramas , 2004 .

[41]  Kiyoharu Aizawa,et al.  Model-based analysis synthesis image coding (MBASIC) system for a person's face , 1989, Signal Process. Image Commun..

[42]  E. Baltsavias,et al.  3-D Building Reconstruction with ARUBA: A Qualitative and Quantitative Evaluation , 1997 .

[43]  John P. Kerekes,et al.  3D Scene Reconstruction through a Fusion of Passive Video and Lidar Imagery , 2007, 36th Applied Imagery Pattern Recognition Workshop (aipr 2007).

[44]  James Kennedy,et al.  Particle swarm optimization , 2002, Proceedings of ICNN'95 - International Conference on Neural Networks.

[45]  Yao Wang,et al.  Active mesh-a feature seeking and tracking image sequence representation scheme , 1994, IEEE Trans. Image Process..

[46]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[47]  C. Brenner Building reconstruction from images and laser scanning , 2005 .

[48]  Jan Flusser,et al.  An adaptive method for image registration , 1992, Pattern Recognit..