Single View Corridor Reconstruction

We present an autonomous algorithm for 3d reconstruction fr om a single image of an indoor scene. Most work on 3d reconstruction from vision uses multiple ima ges (for example, binocular vision/stereopsis), and recovers depths using triangulation. However, the rang e t which binocular vision is accurate is limited by the “baseline” distance between the two cameras. In c ontrast, humans are often able to look at a monocular (single-camera) image, and guess how far away obj ects were from this camera. Humans do this by bringing to bear on the problem significant prior know ledge about the world, which is used to resolve the ambiguities intrinsic to monocular vision. Foc using on the problem of 3d reconstruction in indoor scenes, in this report we present an algorithm for dep th recovery based on precise segmentation of the floor in an image. Our approach encodes prior knowledge about the environment such that the floor is flat and that walls are vertical, and is based on recogn izing the floor and the wall regions in the image. We also present experimental resulting showing our a lgorithm giving good 3d reconstruction performance using monocular images of indoor scenes.

[1]  Jaime López-Krahe,et al.  Contribution to the Determination of Vanishing Points Using Hough Transform , 1994, IEEE Trans. Pattern Anal. Mach. Intell..

[2]  D. Scharstein,et al.  A Taxonomy and Evaluation of Dense Two-Frame Stereo Correspondence Algorithms , 2001, Proceedings IEEE Workshop on Stereo and Multi-Baseline Vision (SMBV 2001).

[3]  Ian D. Reid,et al.  Single View Metrology , 2000, International Journal of Computer Vision.

[4]  Roberto Cipolla,et al.  Camera Calibration from Vanishing Points in Image of Architectural Scenes , 1999, BMVC.

[5]  Jitendra Malik,et al.  Learning to Detect Natural Image Boundaries Using Brightness and Texture , 2002, NIPS.

[6]  Ping-Sing Tsai,et al.  Shape from Shading: A Survey , 1999, IEEE Trans. Pattern Anal. Mach. Intell..

[7]  Takeo Kanade,et al.  Shape and motion from image streams under orthography: a factorization method , 1992, International Journal of Computer Vision.

[8]  Emanuele Trucco,et al.  Introductory techniques for 3-D computer vision , 1998 .

[9]  Mei Han,et al.  Interactive construction of 3D models from panoramic mosaics , 1998, Proceedings. 1998 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No.98CB36231).

[10]  Geoffrey E. Hinton,et al.  A View of the Em Algorithm that Justifies Incremental, Sparse, and other Variants , 1998, Learning in Graphical Models.

[11]  Paul Debevec,et al.  Modeling and Rendering Architecture from Photographs , 1996, SIGGRAPH 1996.

[12]  Alex Bateman,et al.  An introduction to hidden Markov models. , 2007, Current protocols in bioinformatics.

[13]  H.M. Wechsler,et al.  Digital image processing, 2nd ed. , 1981, Proceedings of the IEEE.