Fast Automatic Single-View 3-d Reconstruction of Urban Scenes

We consider the problem of estimating 3-d structure from a single still image of an outdoor urban scene. Our goal is to efficiently create 3-d models which are visually pleasant. We chose an appropriate 3-d model structure and formulate the task of 3-d reconstruction as model fitting problem. Our 3-d models are composed of a number of vertical walls and a ground plane, where ground-vertical boundary is a continuous polyline. We achieve computational efficiency by special preprocessing together with stepwise search of 3-d model parameters dividing the problem into two smaller sub-problems on chain graphs. The use of Conditional Random Field models for both problems allows to various cues. We infer orientation of vertical walls of 3-d model vanishing points.

[1]  Paul A. Viola,et al.  Robust Real-time Object Detection , 2001 .

[2]  John F. Canny,et al.  A Computational Approach to Edge Detection , 1986, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[3]  Ashutosh Saxena,et al.  3-D Depth Reconstruction from a Single Still Image , 2007, International Journal of Computer Vision.

[4]  Alexei A. Efros,et al.  Geometric context from a single image , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[5]  Honglak Lee,et al.  A Dynamic Bayesian Network Model for Autonomous 3D Reconstruction from a Single Indoor Image , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[6]  Andrew McCallum,et al.  Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data , 2001, ICML.

[7]  Anton Konushin,et al.  INTERACTIVE IMAGE-BASED URBAN MODELLING , 2007 .

[8]  Wei Zhang,et al.  Video Compass , 2002, ECCV.

[9]  Rich Caruana,et al.  Obtaining Calibrated Probabilities from Boosting , 2005, UAI.

[10]  Dorin Comaniciu,et al.  Mean Shift: A Robust Approach Toward Feature Space Analysis , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[11]  L. Wasserman All of Nonparametric Statistics , 2005 .

[12]  Paul A. Viola,et al.  Robust Real-Time Face Detection , 2001, International Journal of Computer Vision.

[13]  Vladimir Vezhnevets,et al.  Learning class specific edges for vanishing point estimation , 2007 .

[14]  Ashutosh Saxena,et al.  Make3D: Learning 3D Scene Structure from a Single Still Image , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[15]  Ashutosh Saxena,et al.  Learning Depth from Single Monocular Images , 2005, NIPS.

[16]  Taghi M. Khoshgoftaar,et al.  Experimental perspectives on learning from imbalanced data , 2007, ICML '07.

[17]  Andrew McCallum,et al.  Piecewise Training for Undirected Models , 2005, UAI.

[18]  Ken-ichi Anjyo,et al.  Tour Into the Picture using a Vanishing Line and its Extension to Panoramic Images , 2001, Comput. Graph. Forum.

[19]  Alexei A. Efros,et al.  Automatic photo pop-up , 2005, ACM Trans. Graph..

[20]  Ian D. Reid,et al.  Single View Metrology , 2000, International Journal of Computer Vision.

[21]  Honglak Lee,et al.  Automatic Single-Image 3d Reconstructions of Indoor Manhattan World Scenes , 2007, ISRR.

[22]  Jiri Matas,et al.  Locally Optimized RANSAC , 2003, DAGM-Symposium.

[23]  Antonio Criminisi,et al.  TextonBoost for Image Understanding: Multi-Class Object Recognition and Segmentation by Jointly Modeling Texture, Layout, and Context , 2007, International Journal of Computer Vision.

[24]  Alexei A. Efros,et al.  Recovering Occlusion Boundaries from a Single Image , 2007, 2007 IEEE 11th International Conference on Computer Vision.