Large-Scale, Metric Structure From Motion for Unordered Light Fields

This paper presents a large scale, metric Structure from Motion (SfM) pipeline for generalised cameras with overlapping fields-of-view, and demonstrates it using Light Field (LF) images. We build on recent developments in algorithms for absolute and relative pose recovery for generalised cameras and couple them with multi-view triangulation in a robust framework that advances the state-of-the-art on 3D reconstruction from LFs in several ways. First, our framework can recover the scale of a scene. Second, it is concerned with unordered sets of LF images, meticulously determining the order in which images should be considered. Third, it can scale to datasets with hundreds of LF images. Finally, it recovers 3D scene structure while abstaining from triangulating using very small baselines. Our approach outperforms the state-of-the-art, as demonstrated by real-world experiments with variable size datasets.

[1]  Manolis I. A. Lourakis,et al.  Accurate Scale Factor Estimation in 3D Reconstruction , 2013, CAIP.

[2]  Marc Pollefeys,et al.  Minimal solutions for the multi-camera pose estimation problem , 2015, Int. J. Robotics Res..

[3]  Robert Pless,et al.  Using many cameras as one , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..

[4]  David Nistér,et al.  A Minimal Solution to the Generalised 3-Point Pose Problem , 2004, Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004..

[5]  P. Hanrahan,et al.  Light Field Photography with a Hand-held Plenoptic Camera , 2005 .

[6]  Bastian Goldlücke,et al.  Layered Scene Reconstruction from Multiple Light Field Camera Views , 2016, ACCV.

[7]  Xiaowei Zhou,et al.  Fast Multi-image Matching via Density-Based Clustering , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[8]  Philip H. S. Torr An assessment of information criteria for motion model selection , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[9]  In-So Kweon,et al.  Accurate depth map estimation from a lenslet light field camera , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[10]  Wei Yang,et al.  Ray Space Features for Plenoptic Structure-from-Motion , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[11]  Andrea Fusiello,et al.  Hierarchical structure-and-motion recovery from uncalibrated images , 2015, Comput. Vis. Image Underst..

[12]  Viktor Larsson,et al.  Efficient Solvers for Minimal Problems by Syzygy-Based Reduction , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[13]  E. Adelson,et al.  The Plenoptic Function and the Elements of Early Vision , 1991 .

[14]  Torsten Sattler,et al.  Minimal Solvers for Generalized Pose and Scale Estimation from Two Rays and One Point , 2016, ECCV.

[15]  Shree K. Nayar,et al.  The Raxel Imaging Model and Ray-Based Calibration , 2005, International Journal of Computer Vision.

[16]  Andrew Zisserman,et al.  Three things everyone should know to improve object retrieval , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[17]  Joachim M. Buhmann,et al.  Empirical evaluation of dissimilarity measures for color and texture , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[18]  Marc Levoy,et al.  Light Fields and Computational Imaging , 2006, Computer.

[19]  Roland Siegwart,et al.  Using multi-camera systems in robotics: Efficient solutions to the NPnP problem , 2013, 2013 IEEE International Conference on Robotics and Automation.

[20]  Stefan B. Williams,et al.  Decoding, Calibration and Rectification for Lenselet-Based Plenoptic Cameras , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[21]  Xin-She Yang,et al.  Introduction to Algorithms , 2021, Nature-Inspired Optimization Algorithms.

[22]  Robert C. Bolles,et al.  Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography , 1981, CACM.

[23]  In-So Kweon,et al.  Geometric Calibration of Micro-Lens-Based Light Field Cameras Using Line Features , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[24]  Andrea Fusiello,et al.  Practical and Efficient Multi-view Matching , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[25]  SnavelyNoah,et al.  Modeling the World from Internet Photo Collections , 2008 .

[26]  Hongdong Li,et al.  Efficient Computation of Relative Pose for Multi-camera Systems , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[27]  Torsten Sattler,et al.  Large-scale outdoor 3D reconstruction on a mobile device , 2017, Comput. Vis. Image Underst..

[28]  V. Lepetit,et al.  EPnP: An Accurate O(n) Solution to the PnP Problem , 2009, International Journal of Computer Vision.

[29]  Jan-Michael Frahm,et al.  Reconstructing the world* in six days , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[30]  David Nistér,et al.  An efficient solution to the five-point relative pose problem , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..

[31]  Bastian Goldlücke,et al.  On Linear Structure from Motion for Light Field Cameras , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[32]  Peter F. Sturm,et al.  A generic structure-from-motion framework , 2006, Comput. Vis. Image Underst..

[33]  Anders Heyden,et al.  Covariance Propagation and Next Best View Planning for 3D Reconstruction , 2012, ECCV.

[34]  Manolis I. A. Lourakis,et al.  Modified Rodrigues Parameters: An Efficient Representation of Orientation in 3D Vision and Graphics , 2018, Journal of Mathematical Imaging and Vision.

[35]  Michael Goesele,et al.  Scene Reconstruction and Visualization From Community Photo Collections , 2010, Proceedings of the IEEE.

[36]  Tobias Höllerer,et al.  Model Estimation and Selection towards Unconstrained Real-Time Tracking and Mapping. , 2013, IEEE transactions on visualization and computer graphics.

[37]  G LoweDavid,et al.  Distinctive Image Features from Scale-Invariant Keypoints , 2004 .

[38]  Hongdong Li,et al.  A linear approach to motion estimation using generalized camera models , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[39]  Karl Johan Åström,et al.  Solutions to Minimal Generalized Relative Pose Problems , 2005 .

[40]  Thorsten Thormählen,et al.  Keyframe Selection for Camera Motion and Structure Estimation from Multiple Views , 2004, ECCV.

[41]  Lennart Wietzke,et al.  Single lens 3D-camera with extended depth-of-field , 2012, Electronic Imaging.

[42]  Manolis I. A. Lourakis,et al.  SBA: A software package for generic sparse bundle adjustment , 2009, TOMS.

[43]  Qionghai Dai,et al.  Light Field Image Processing: An Overview , 2017, IEEE Journal of Selected Topics in Signal Processing.

[44]  Michel Dhome,et al.  Real Time Localization and 3D Reconstruction , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[45]  Andrew Zisserman,et al.  Multi-view Matching for Unordered Image Sets, or "How Do I Organize My Holiday Snaps?" , 2002, ECCV.

[46]  Laurent Kneip,et al.  The generalized relative pose and scale problem: View-graph fusion via 2D-2D registration , 2016, 2016 IEEE Winter Conference on Applications of Computer Vision (WACV).

[47]  Hongdong Li,et al.  UPnP: An Optimal O(n) Solution to the Absolute Pose Problem with Universal Applicability , 2014, ECCV.

[48]  Kiriakos N. Kutulakos,et al.  What does an aberrated photo tell us about the lens and the scene? , 2013, IEEE International Conference on Computational Photography (ICCP).

[49]  David C. Hoaglin,et al.  Volume 16: How to Detect and Handle Outliers , 2013 .

[50]  Sébastien Ourselin,et al.  Corner-Based Geometric Calibration of Multi-focus Plenoptic Cameras , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[51]  Laurent Kneip,et al.  OpenGV: A unified and generalized approach to real-time calibrated geometric vision , 2014, 2014 IEEE International Conference on Robotics and Automation (ICRA).

[52]  Tobias Höllerer,et al.  Model Estimation and Selection towardsUnconstrained Real-Time Tracking and Mapping , 2014, IEEE Transactions on Visualization and Computer Graphics.