论文信息 - Geometric context from a single image

Geometric context from a single image

Many computer vision algorithms limit their performance by ignoring the underlying 3D geometric structure in the image. We show that we can estimate the coarse geometric properties of a scene by learning appearance-based models of geometric classes, even in cluttered natural scenes. Geometric classes describe the 3D orientation of an image region with respect to the camera. We provide a multiple-hypothesis framework for robustly estimating scene structure from a single image and obtaining confidences for each geometric label. These confidences can then be used to improve the performance of many other applications. We provide a thorough quantitative evaluation of our algorithm on a set of outdoor images and demonstrate its usefulness in two applications: object detection and automatic single-view reconstruction.

Alexei A. Efros | Derek Hoiem | Martial Hebert | M. Hebert | Derek Hoiem

[1] David G. Stork,et al. Pattern Classification , 1973 .

[2] 大田友一,et al. Knowledge-based interpretation of outdoor natural color scenes , 1985 .

[3] Jitendra Malik,et al. Normalized cuts and image segmentation , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[4] Barry T. Thomas,et al. Head-Mounted Mobility Aid for Low Vision Using Scene Classification Techniques , 1998, Int. J. Virtual Real..

[5] Reinhard Koch,et al. Self-Calibration and Metric Reconstruction Inspite of Varying and Unknown Intrinsic Camera Parameters , 1998, Sixth International Conference on Computer Vision (IEEE Cat. No.98CH36271).

[6] Antonio Criminisi,et al. Creating Architectural Models from Images , 1999, Comput. Graph. Forum.

[7] J. Friedman. Special Invited Paper-Additive logistic regression: A statistical view of boosting , 2000 .

[8] Alan L. Yuille,et al. Statistical cues for domain specific image segmentation with performance analysis , 2000, Proceedings IEEE Conference on Computer Vision and Pattern Recognition. CVPR 2000 (Cat. No.PR00662).

[9] Bernhard P. Wrobel,et al. Multiple View Geometry in Computer Vision , 2001 .

[10] Wei Zhang,et al. Video Compass , 2002, ECCV.

[11] Jiebo Luo,et al. Probabilistic spatial context models for scene content understanding , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..

[12] Andrew Zisserman,et al. Multiple View Geometry in Computer Vision (2nd ed) , 2003 .

[13] Feng Han,et al. Bayesian reconstruction of 3D shapes and scenes from a single image , 2003, First IEEE International Workshop on Higher-Level Knowledge in 3D Modeling and Motion Analysis, 2003. HLK 2003..

[14] Jitendra Malik,et al. Learning a classification model for segmentation , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[15] Antonio Torralba,et al. Graphical Model For Recognizing Scenes and Objects. , 2003, NIPS 2003.

[16] Paul A. Viola,et al. Detecting Pedestrians Using Patterns of Motion and Appearance , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[17] Martial Hebert,et al. Discriminative random fields: a discriminative framework for contextual interaction in classification , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[18] R. Zemel,et al. Multiscale conditional random fields for image labeling , 2004, CVPR 2004.

[19] Henry Schneiderman,et al. Learning a restricted Bayesian network for object detection , 2004, Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004..

[20] Miguel Á. Carreira-Perpiñán,et al. Multiscale conditional random fields for image labeling , 2004, Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004..

[21] Pietro Perona,et al. Is bottom-up attention useful for object recognition? , 2004, Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004..

[22] Nando de Freitas,et al. A Statistical Model for General Contextual Object Recognition , 2004, ECCV.

[23] Ian D. Reid,et al. Single View Metrology , 2000, International Journal of Computer Vision.

[24] Antonio Torralba,et al. Contextual Models for Object Detection Using Boosted Random Fields , 2004, NIPS.

[25] Paul A. Viola,et al. Robust Real-Time Face Detection , 2001, International Journal of Computer Vision.

[26] W. Grimson,et al. Improving object classification in far-field video , 2004, Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004..

[27] Yoram Singer,et al. Logistic Regression, AdaBoost and Bregman Distances , 2000, Machine Learning.

[28] Antonio Torralba,et al. Contextual Priming for Object Detection , 2003, International Journal of Computer Vision.

[29] Allen Y. Yang,et al. On Symmetry and Multiple-View Geometry: Structure, Pose, and Calibration from a Single Image , 2004, International Journal of Computer Vision.

[30] Cordelia Schmid,et al. Human Detection Based on a Probabilistic Assembly of Robust Part Detectors , 2004, ECCV.

[31] Paul A. Viola,et al. Detecting Pedestrians Using Patterns of Motion and Appearance , 2005, International Journal of Computer Vision.

[32] Alexei A. Efros,et al. Automatic photo pop-up , 2005, ACM Trans. Graph..

[33] Alexei A. Efros,et al. Recovering Surface Layout from an Image , 2007, International Journal of Computer Vision.