Estimating Planar Structure in Single Images by Learning from Examples

Outdoor urban scenes typically contain many planar surfaces, which are useful for tasks such as scene reconstruction, object recognition, and navigation, especially when only a single image is available. In such situations the lack of 3D information makes finding planes difficult; but motivated by how humans use their prior knowledge to interpret new scenes with ease, we develop a method which learns from a set of training examples, in order to identify planar image regions and estimate their orientation. Because it does not rely explicitly on rectangular structures or the assumption of a ‘Manhattan world’, our method can generalise to a variety of outdoor environments. From only one image, our method reliably distinguishes planes from non-planes, and estimates their orientation accurately; this is fast and efficient, with application to a real-time

[1]  Alan F. Smeaton,et al.  An Improved Spatiogram Similarity Measure for Robust Object Localisation , 2007, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07.

[2]  Tom Drummond,et al.  Machine Learning for High-Speed Corner Detection , 2006, ECCV.

[3]  Somkiat Wangsiripitak,et al.  Reducing mismatching under time-pressure by reasoning about visibility and occlusion , 2010, BMVC.

[4]  KeeChang Lee,et al.  Fast Automatic Single-View 3-d Reconstruction of Urban Scenes , 2008, ECCV.

[5]  Ashutosh Saxena,et al.  High speed obstacle avoidance using monocular vision and reinforcement learning , 2005, ICML.

[6]  Andrew Zisserman,et al.  Video Google: a text retrieval approach to object matching in videos , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[7]  Jana Kosecka,et al.  Detection and matching of rectilinear structures , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[8]  T. Landauer,et al.  Indexing by Latent Semantic Analysis , 1990 .

[9]  Bernhard P. Wrobel,et al.  Multiple View Geometry in Computer Vision , 2001 .

[10]  Ian D. Reid,et al.  Single View Metrology , 2000, International Journal of Computer Vision.

[11]  Dale Purves,et al.  Comparison of Bayesian and empirical ranking approaches to visual perception. , 2006, Journal of theoretical biology.

[12]  Tao Xiong,et al.  A combined SVM and LDA approach for classification , 2005, Proceedings. 2005 IEEE International Joint Conference on Neural Networks, 2005..

[13]  Stephen J. Maybank,et al.  A Method for Interactive 3D Reconstruction of Piecewise Planar Objects from Single Images , 1999, BMVC.

[14]  Walterio W. Mayol-Cuevas,et al.  Discovering Higher Level Structure in Visual SLAM , 2008, IEEE Transactions on Robotics.

[15]  Adrien Bartoli,et al.  A random sampling strategy for piecewise planar scene segmentation , 2007, Comput. Vis. Image Underst..

[16]  G LoweDavid,et al.  Distinctive Image Features from Scale-Invariant Keypoints , 2004 .

[17]  Stanley T. Birchfield,et al.  Spatiograms versus histograms for region-based tracking , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[18]  Alexei A. Efros,et al.  Recovering Surface Layout from an Image , 2007, International Journal of Computer Vision.

[19]  Antonio Torralba,et al.  Depth Estimation from Image Structure , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[20]  Seungjin Choi,et al.  Algorithms for orthogonal nonnegative matrix factorization , 2008, 2008 IEEE International Joint Conference on Neural Networks (IEEE World Congress on Computational Intelligence).

[21]  H. Sebastian Seung,et al.  Algorithms for Non-negative Matrix Factorization , 2000, NIPS.

[22]  Andrew Calway,et al.  Unifying Planar and Point Mapping in Monocular SLAM , 2010, BMVC.

[23]  Pietro Perona,et al.  A sparse object category model for efficient learning and exhaustive recognition , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[24]  Alexei A. Efros,et al.  Putting Objects in Perspective , 2006, CVPR.

[25]  Bill Triggs,et al.  Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[26]  Jana Kosecka,et al.  Extraction, matching and pose recovery based on dominant rectangular structures , 2003, HLK.