A Framework for Pencil-of-Points Structure-from-Motion

Our goal is to match contour lines between images and to recover structure and motion from those. The main difficulty is that pairs of lines from two images do not induce direct geometric constraint on camera motion. Previous work uses geometric attributes | orientation, length, etc. | for single or groups of lines. Our approach is based on using Pencil-of-Points (points on line) or pops for short. There are many advantages to using pops for structure-from-motion. The most important one is that, contrarily to pairs of lines, pairs of pops may constrain camera motion. We give a complete theoretical and practical framework for automatic structure-from-motion using pops | detection, matching, robust motion estimation, triangulation and bundle adjustment. For wide baseline matching, it has been shown that cross-correlation scores computed on neighbouring patches to the lines gives reliable results, given 2D homographic transformations to compensate for the pose of the patches. When cameras are known, this transformation has a 1-dimensional ambiguity. We show that when cameras are unknown, using pops lead to a 3-dimensional ambiguity, from which it is still possible to reliably compute cross-correlation. We propose linear and non-linear algorithms for estimating the fundamental matrix and for the multiple-view triangulation of pops. Experimental results are provided for simulated and real data.

[1]  H. C. Longuet-Higgins,et al.  A computer algorithm for reconstructing a scene from two projections , 1981, Nature.

[2]  Richard I. Hartley,et al.  Lines and Points in Three Views and the Trifocal Tensor , 1997, International Journal of Computer Vision.

[3]  John F. Canny,et al.  A Computational Approach to Edge Detection , 1986, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[4]  Ramakant Nevatia,et al.  Segment-based stereo matching , 1985, Comput. Vis. Graph. Image Process..

[5]  R. Hartley Triangulation, Computer Vision and Image Understanding , 1997 .

[6]  Bernhard P. Wrobel,et al.  Multiple View Geometry in Computer Vision , 2001 .

[7]  Zhengyou Zhang,et al.  Determining the Epipolar Geometry and its Uncertainty: A Review , 1998, International Journal of Computer Vision.

[8]  Bill Triggs,et al.  Linear Projective Reconstruction from Matching Tensors , 1996, BMVC.

[9]  Cordelia Schmid,et al.  Automatic line matching across views , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[10]  Alexandru Tupan,et al.  Triangulation , 1997, Comput. Vis. Image Underst..

[11]  Cordelia Schmid,et al.  An Affine Invariant Interest Point Detector , 2002, ECCV.

[12]  Mads Nielsen,et al.  Computer Vision — ECCV 2002 , 2002, Lecture Notes in Computer Science.

[13]  Roberto Cipolla,et al.  Computer Vision — ECCV '96 , 1996, Lecture Notes in Computer Science.

[14]  Robert C. Bolles,et al.  Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography , 1981, CACM.

[15]  Pascal Fua,et al.  Automatic Extraction of Generic House Roofs from High Resolution Aerial Imagery , 1996, ECCV.