Global Fusion of Relative Motions for Robust, Accurate and Scalable Structure from Motion

Multi-view structure from motion (SfM) estimates the position and orientation of pictures in a common 3D coordinate frame. When views are treated incrementally, this external calibration can be subject to drift, contrary to global methods that distribute residual errors evenly. We propose a new global calibration approach based on the fusion of relative motions between image pairs. We improve an existing method for robustly computing global rotations. We present an efficient a contrario trifocal tensor estimation method, from which stable and precise translation directions can be extracted. We also define an efficient translation registration method that recovers accurate camera positions. These components are combined into an original SfM pipeline. Our experiments show that, on most datasets, it outperforms in accuracy other existing incremental and global pipelines. It also achieves strikingly good running times: it is about 20 times faster than the other global method we could compare to, and as fast as the best incremental method. More importantly, it features better scalability properties.

[1]  M. Gugat A Fast Algorithm for a Class of Generalized Fractional Programs , 1996 .

[2]  Venu Madhav Govindu,et al.  Combining two-view constraints for motion estimation , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[3]  Sang Wook Lee,et al.  Toward multiview registration in frame space , 2001, Proceedings 2001 ICRA. IEEE International Conference on Robotics and Automation (Cat. No.01CH37164).

[4]  Lionel Moisan,et al.  A Probabilistic Criterion to Detect Rigid Point Matches Between Two Images and Estimate the Fundamental Matrix , 2004, International Journal of Computer Vision.

[5]  David Nistér,et al.  An efficient solution to the five-point relative pose problem , 2004, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[6]  G LoweDavid,et al.  Distinctive Image Features from Scale-Invariant Keypoints , 2004 .

[7]  Richard I. Hartley,et al.  L-8Minimization in Geometric Reconstruction Problems , 2004, CVPR.

[8]  Venu Madhav Govindu,et al.  Lie-algebraic averaging for globally consistent motion estimation , 2004, Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004..

[9]  R. Hartley,et al.  L/sub /spl infin// minimization in geometric reconstruction problems , 2004, Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004..

[10]  Venu Madhav Govindu,et al.  Robustness in Motion Averaging , 2006, ACCV.

[11]  David Nistér,et al.  Scalable Recognition with a Vocabulary Tree , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[12]  Steven M. Seitz,et al.  Photo tourism: exploring photo collections in 3D , 2006, ACM Trans. Graph..

[13]  Richard I. Hartley,et al.  Recovering Camera Motion Using L\infty Minimization , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[14]  Yongduek Seo,et al.  A fast method to minimize L∞ error norm for geometric vision problems , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[15]  Tomás Pajdla,et al.  Robust Rotation and Translation Estimation in Multiview Reconstruction , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[16]  R. Hartley,et al.  Multiple-View Geometry under the L 1-Norm , 2007 .

[17]  Richard I. Hartley,et al.  Multiple-View Geometry Under the {$L_\infty$}-Norm , 2008, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[18]  Steven M. Seitz,et al.  Fast algorithms for L∞ problems in multiview geometry , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[19]  Pascal Fua,et al.  On benchmarking camera calibration and multi-view stereo for high resolution imagery , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[20]  Richard Szeliski,et al.  Skeletal graphs for efficient structure from motion , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[21]  Andrea Fusiello,et al.  Improving the efficiency of hierarchical structure-and-motion , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[22]  Marc Pollefeys,et al.  Practical Methods for Convex Multi-view Reconstruction , 2010, ECCV.

[23]  Richard Szeliski,et al.  A Multi-stage Linear Approach to Structure from Motion , 2010, ECCV Workshops.

[24]  Marc Pollefeys,et al.  Disambiguating visual relations using loop constraints , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[25]  Arnak S. Dalalyan,et al.  Robust Estimation for an Inverse Problem Arising in Multiview Geometry , 2011, Journal of Mathematical Imaging and Vision.

[26]  Carl Olsson,et al.  Non-sequential structure from motion , 2011, 2011 IEEE International Conference on Computer Vision Workshops (ICCV Workshops).

[27]  Peter F. Sturm,et al.  On Camera Calibration with Linear Programming and Loop Constraint Linearization , 2012, International Journal of Computer Vision.

[28]  Steven M. Seitz,et al.  Multicore bundle adjustment , 2011, CVPR 2011.

[29]  Andrew Owens,et al.  Discrete-continuous optimization for large-scale structure from motion , 2011, CVPR 2011.

[30]  Pedro E. López-de-Teruel,et al.  Reduced epipolar cost for accelerated incremental SfM , 2011, CVPR 2011.

[31]  Carl Olsson,et al.  Stable Structure from Motion for Unordered Image Collections , 2011, SCIA.

[32]  Pascal Monasse,et al.  UNORDERED FEATURE TRACKING MADE FAST AND EASY , 2011 .

[33]  Ira Kemelmacher-Shlizerman,et al.  Global Motion Estimation from Point Matches , 2012, 2012 Second International Conference on 3D Imaging, Modeling, Processing, Visualization & Transmission.

[34]  Jean-Philippe Pons,et al.  High Accuracy and Visibility-Consistent Dense Multiview Stereo , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[35]  Johan Fredriksson,et al.  Simultaneous Multiple Rotation Averaging Using Lagrangian Duality , 2012, ACCV.

[36]  Hongdong Li,et al.  Rotation Averaging , 2013, International Journal of Computer Vision.

[37]  Changchang Wu,et al.  Towards Linear-Time Incremental Structure from Motion , 2013, 2013 International Conference on 3D Vision.