3D Modeling from Multiple Images

Although the visual perception of 3D shape from 2D images is a basic capability of human beings, it remains challenging to computers Hence, one goal of vision research is to computationally understand and model the latent 3D scene from the captured images, and provide human-like visual system for machines In this paper, we present a method that is capable of building a realistic 3D model for the latent scene from multiple images taken at different viewpoints Specifically, the reconstruction proceeds in two steps First, generate dense depth map for each input image by a Bayesian-based inference model Second, build a complete 3D model for the latent scene by integrating all reliable 3D information embedded in the depth maps Experiments are conducted to demonstrate the effectiveness of the proposed approach.

[1]  Marc Pollefeys,et al.  Interactive 3D architectural modeling from unordered photo collections , 2008, SIGGRAPH 2008.

[2]  Andrew W. Fitzgibbon,et al.  Image-Based Rendering Using Image-Based Priors , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[3]  Long Quan,et al.  Image-based tree modeling , 2007, SIGGRAPH 2007.

[4]  William E. Lorensen,et al.  Marching cubes: a high resolution 3D surface construction algorithm , 1996 .

[5]  Takeo Kanade,et al.  Constructing virtual worlds using dense stereo , 1998, Sixth International Conference on Computer Vision (IEEE Cat. No.98CH36271).

[6]  Richard Szeliski,et al.  A multi-view approach to motion and stereo , 1999, Proceedings. 1999 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No PR00149).

[7]  Pau Gargallo,et al.  Bayesian 3D modeling from images using multiple depth maps , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[8]  Andrew W. Fitzgibbon,et al.  Automatic 3D Model Construction for Turn-Table Sequences , 1998, SMILE.

[9]  Reinhard Koch,et al.  3D Structure from Multiple Images of Large-Scale Environments , 1998, Lecture Notes in Computer Science.

[10]  Michael Goesele,et al.  Multi-View Stereo Revisited , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[11]  Adrian Hilton,et al.  Geometric fusion for a hand-held 3D sensor , 2000 .

[12]  Marc Levoy,et al.  A volumetric method for building complex models from range images , 1996, SIGGRAPH.

[13]  Ashutosh Saxena,et al.  Make3D: Learning 3D Scene Structure from a Single Still Image , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[14]  C. Strecha,et al.  Wide-baseline stereo from multiple views: A probabilistic account , 2004, CVPR 2004.