Automatic 3D model building from video sequences

We describe progress on automating the construction of 3D geometric models from video sequences. The system we are developing allows a person to walk around a room with a video camera, and from the video sequence alone extract a 3D model of the room. The room can then be graphically rendered together with possible arrangements of previously modelled objects, e.g. CAD models of furniture. The system is designed to ease model acquisition for virtual reality and telepresence applications, however the approach also has application in model based image coding. The approach is based on recent work in computer vision where image primitives are extracted and matched through an image sequence. For example, the simultaneous estimation of epipolar geometry and image corner correspondences. The matching techniques are both robust (detecting and discarding mismatches) and fully automatic. No knowledge of the camera (e.g. its focal length) or motion is required. 3D structure is computed from the matched image primitives (corners, lines and texture). The structure is then triangulated and texture is mapped onto the triangles from intensities in the images. Experimental results are provided for a variety of scenes, including isolated objects acquired with a standard CCD camera, and outdoor scenes acquired with a hand-held camcorder.

[1]  Robert C. Bolles,et al.  Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography , 1981, CACM.

[2]  Andrew P. Witkin,et al.  Analyzing Oriented Patterns , 1985, IJCAI.

[3]  John F. Canny,et al.  A Computational Approach to Edge Detection , 1986, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[4]  Richard S. Weiss,et al.  Reconstruction of Surfaces from Profiles , 1987, ICCV 1987.

[5]  Christopher G. Harris,et al.  Determination of Ego-Motion from Matched Points , 1987, Alvey Vision Conference.

[6]  Christopher G. Harris,et al.  3D positional integration from image sequences , 1988, Image Vis. Comput..

[7]  Christopher G. Harris,et al.  A Combined Corner and Edge Detector , 1988, Alvey Vision Conference.

[8]  Régis Vaillant Using Occluding Contours for 3D Object Modeling , 1990, ECCV.

[9]  Richard I. Hartley,et al.  Estimation of Relative Camera Positions for Uncalibrated Cameras , 1992, ECCV.

[10]  Olivier Faugeras,et al.  3D Dynamic Scene Analysis , 1992 .

[11]  Olivier D. Faugeras,et al.  What can be seen in three dimensions with an uncalibrated stereo rig , 1992, ECCV.

[12]  Long Quan,et al.  Relative 3D Reconstruction Using Multiple Uncalibrated Images , 1993, Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.

[13]  Reinhard Koch,et al.  Dynamic 3-D Scene Analysis Through Synthesis Feedback Control , 1993, IEEE Trans. Pattern Anal. Mach. Intell..

[14]  Luc Van Gool,et al.  Affine Reconstruction from Perspective Image Pairs Obtained by a Translating Camera , 1993, Applications of Invariance in Computer Vision.

[15]  Luc Van Gool,et al.  Determination of Optical Flow and its Discontinuities using Non-Linear Diffusion , 1994, ECCV.

[16]  Paul A. Beardsley,et al.  Euclidean Structure from Uncalibrated Images , 1994, BMVC.

[17]  Paul A. Beardsley,et al.  Navigation using Affine Structure from Motion , 1994, ECCV.

[18]  Amnon Shashua,et al.  Trilinearity in Visual Recognition by Alignment , 1994, ECCV.

[19]  Richard I. Hartley,et al.  A linear method for reconstruction from lines and points , 1995, Proceedings of IEEE International Conference on Computer Vision.

[20]  Paul A. Beardsley,et al.  3D Model Acquisition from Extended Image Sequences , 1996, ECCV.

[21]  Luc Van Gool,et al.  Euclidean 3D Reconstruction from Image Sequences with Variable Focal Lenghts , 1996, ECCV.