Globally-Optimal Inlier Set Maximisation for Camera Pose and Correspondence Estimation

Estimating the 6-DoF pose of a camera from a single image relative to a 3D point-set is an important task for many computer vision applications. Perspective-n-point solvers are routinely used for camera pose estimation, but are contingent on the provision of good quality 2D–3D correspondences. However, finding cross-modality correspondences between 2D image points and a 3D point-set is non-trivial, particularly when only geometric information is known. Existing approaches to the simultaneous pose and correspondence problem use local optimisation, and are therefore unlikely to find the optimal solution without a good pose initialisation, or introduce restrictive assumptions. Since a large proportion of outliers and many local optima are common for this problem, we instead propose a robust and globally-optimal inlier set maximisation approach that jointly estimates the optimal camera pose and correspondences. Our approach employs branch-and-bound to search the 6D space of camera poses, guaranteeing global optimality without requiring a pose prior. The geometry of SE(3) is used to find novel upper and lower bounds on the number of inliers and local optimisation is integrated to accelerate convergence. The algorithm outperforms existing approaches on challenging synthetic and real datasets, reliably finding the global optimum, with a GPU implementation greatly reducing runtime.

[1]  Éric Marchand,et al.  Pose Estimation for Augmented Reality: A Hands-On Survey , 2016, IEEE Transactions on Visualization and Computer Graphics.

[2]  Joel A. Hesch,et al.  A Direct Least-Squares (DLS) method for PnP , 2011, 2011 International Conference on Computer Vision.

[3]  Pascal Fua,et al.  Worldwide Pose Estimation Using 3D Point Clouds , 2012, ECCV.

[4]  Alexei A. Efros,et al.  Seeing 3D Chairs: Exemplar Part-Based 2D-3D Alignment Using a Large Dataset of CAD Models , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[5]  Pascal Vasseur,et al.  Robust and Optimal Sum-of-Squares-Based Point-to-Plane Registration of Image Sets and Structured Scenes , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[6]  Lars Petersson,et al.  Globally-Optimal Inlier Set Maximisation for Simultaneous Camera Pose and Feature Correspondence , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[7]  Eric Brachmann,et al.  DSAC — Differentiable RANSAC for Camera Localization , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[8]  Matthijs C. Dorst Distinctive Image Features from Scale-Invariant Keypoints , 2011 .

[9]  Carl Olsson,et al.  Optimal Estimation of Perspective Camera Pose , 2006, 18th International Conference on Pattern Recognition (ICPR'06).

[10]  Torsten Sattler,et al.  Fast image-based localization using direct 2D-to-3D matching , 2011, 2011 International Conference on Computer Vision.

[11]  A. Land,et al.  An Automatic Method for Solving Discrete Programming Problems , 1960, 50 Years of Integer Programming.

[12]  J. Kiefer,et al.  Sequential minimax search for a maximum , 1953 .

[13]  Jiri Matas,et al.  Optimal Randomized RANSAC , 2008, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[14]  Vincent Lepetit,et al.  Pose Priors for Simultaneously Solving Alignment and Correspondence , 2008, ECCV.

[15]  V. Lepetit,et al.  EPnP: An Accurate O(n) Solution to the PnP Problem , 2009, International Journal of Computer Vision.

[16]  Lars Petersson,et al.  A Multi-modal Graphical Model for Scene Analysis , 2015, 2015 IEEE Winter Conference on Applications of Computer Vision.

[17]  Kostas Daniilidis,et al.  Correspondence-free Structure from Motion , 2007, International Journal of Computer Vision.

[18]  Fredrik Kahl,et al.  Optimal Geometric Fitting under the Truncated L2-Norm , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[19]  Robert M. Haralick,et al.  Review and analysis of solutions of the three point perspective pose estimation problem , 1994, International Journal of Computer Vision.

[20]  Fredrik Kahl,et al.  Tractable Algorithms for Robust Model Estimation , 2014, International Journal of Computer Vision.

[21]  Fredrik Kahl,et al.  Robust Optimal Pose Estimation , 2008, ECCV.

[22]  Roberto Cipolla,et al.  PoseNet: A Convolutional Network for Real-Time 6-DOF Camera Relocalization , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[23]  David Windridge,et al.  Globally Optimal 2D-3D Registration from Points or Lines without Correspondences , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[24]  Clark F. Olson,et al.  Efficient Pose Clustering Using a Randomized Algorithm , 1997, International Journal of Computer Vision.

[25]  Frédéric Jurie,et al.  Solution of the Simultaneous Pose and Correspondence Problem Using Gaussian Error Model , 1999, Comput. Vis. Image Underst..

[26]  Lars Petersson,et al.  GOGMA: Globally-Optimal Gaussian Mixture Alignment , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[27]  William Grimson,et al.  Object recognition by computer - the role of geometric constraints , 1991 .

[28]  Richard I. Hartley,et al.  Global Optimization through Rotation Space Search , 2009, International Journal of Computer Vision.

[29]  Torsten Sattler,et al.  Camera Pose Voting for Large-Scale Image-Based Localization , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[30]  Kristy Sim,et al.  Removing outliers using the L∞ Norm , 2006, CVPR 2006.

[31]  Carl Olsson,et al.  Branch-and-Bound Methods for Euclidean Registration Problems , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[32]  Roberto Cipolla,et al.  Geometric Loss Functions for Camera Pose Regression with Deep Learning , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[33]  Hongdong Li,et al.  SDICP: Semi-Dense Tracking based on Iterative Closest Points , 2015, BMVC.

[34]  Robert C. Bolles,et al.  Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography , 1981, CACM.

[35]  Steven Gold,et al.  A Graduated Assignment Algorithm for Graph Matching , 1996, IEEE Trans. Pattern Anal. Mach. Intell..

[36]  Clark F. Olson,et al.  A General Method for Geometric Feature Matching and Model Extraction , 2001, International Journal of Computer Vision.

[37]  Andrew W. Fitzgibbon,et al.  Scene Coordinate Regression Forests for Camera Relocalization in RGB-D Images , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[38]  Loong Fah Cheong,et al.  Simultaneous Camera Pose and Correspondence Estimation with Motion Coherence , 2011, International Journal of Computer Vision.

[39]  Jiaolong Yang,et al.  Go-ICP: A Globally Optimal Solution to 3D ICP Point-Set Registration , 2016, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[40]  Torsten Sattler,et al.  Efficient & Effective Prioritized Matching for Large-Scale Image-Based Localization , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[41]  Todd A. Cass Polynomial-Time Geometric Matching for Object Recognition , 2004, International Journal of Computer Vision.

[42]  Fredrik Kahl,et al.  City-Scale Localization for Cameras with Known Vertical Direction , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[43]  Thomas M. Breuel,et al.  Implementation techniques for geometric branch-and-bound matching methods , 2003, Comput. Vis. Image Underst..

[44]  Anders P. Eriksson,et al.  Guaranteed Outlier Removal with Mixed Integer Linear Programs , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[45]  Laurent Kneip,et al.  OpenGV: A unified and generalized approach to real-time calibrated geometric vision , 2014, 2014 IEEE International Conference on Robotics and Automation (ICRA).

[46]  Roland Siegwart,et al.  A novel parametrization of the perspective-three-point problem for a direct computation of absolute camera position and orientation , 2011, CVPR 2011.

[47]  Philip David,et al.  SoftPOSIT: Simultaneous Pose and Correspondence Determination , 2002, International Journal of Computer Vision.

[48]  Viktor Larsson,et al.  Optimal Relative Pose with Unknown Correspondences , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[49]  Hongdong Li,et al.  The 3D-3D Registration Problem Revisited , 2007, 2007 IEEE 11th International Conference on Computer Vision.