A universal, closed-form approach for absolute pose problems

Abstract We propose a general approach for absolute pose problems including the well known perspective-n-point (PnP) problem, its generalized variant (GPnP) with and without scale, and the pose from 2D line correspondences (PnL). These have received a tremendous attention in the computer vision community during the last decades. However, it was only recently that efficient, globally optimal, closed-form solutions have been proposed, which can handle arbitrary numbers of correspondences including minimal configurations as well as over-constrained cases with linear complexity. We follow the general scheme by eliminating the linear parameters first, which results in a least squares error function that only depends on the non-linear rotation and a small symmetric coefficient matrix of fixed size. Then, in a second step the rotation is solved with algorithms which are derived using methods from algebraic geometry such as the Grobner basis method. We propose a unified formulation based on a representation with orthogonal complements which allows to combine different types of constraints elegantly in one single framework. We show that with our unified formulation existing polynomial solvers can be interchangeably applied to problem instances other than those they were originally proposed for. It becomes possible to compare them on various registrations problems with respect to accuracy, numerical stability, and computational speed. Our compression procedure not only preserves linear complexity, it is even faster than previous formulations. For the second step we also derive an own algebraic equation solver, which can additionally handle the registration from 3D point-to-point correspondences, where other rotation solvers fail. Finally, we also present a marker-based SLAM approach with automatic registration to a target coordinate system based on partial and distributed reference information. It represents an application example that goes beyond classical camera pose estimation from image measurements and also serves for evaluation on real data.

[1]  Peter F. Sturm,et al.  Pose estimation using both points and lines for geo-localization , 2011, 2011 IEEE International Conference on Robotics and Automation.

[2]  Helder Araújo,et al.  Pose Estimation for General Cameras Using Lines , 2015, IEEE Transactions on Cybernetics.

[3]  Dieter Schmalstieg,et al.  A Minimal Solution to the Generalized Pose-and-Scale Problem , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[4]  Dieter Schmalstieg,et al.  Robust and unobtrusive marker tracking on mobile phones , 2008, 2008 7th IEEE/ACM International Symposium on Mixed and Augmented Reality.

[5]  Paul D. Fiore,et al.  Efficient Linear Solution of Exterior Orientation , 2001, IEEE Trans. Pattern Anal. Mach. Intell..

[6]  Hongdong Li,et al.  Efficient Computation of Relative Pose for Multi-camera Systems , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[7]  Jianliang Tang,et al.  Complete Solution Classification for the Perspective-Three-Point Problem , 2003, IEEE Trans. Pattern Anal. Mach. Intell..

[8]  Arjan Kuijper,et al.  Composing the feature map retrieval process for robust and ready-to-use monocular tracking , 2011, Comput. Graph..

[9]  Arjan Kuijper,et al.  Unifying Algebraic Solvers for Scaled Euclidean Registration from Point, Line and Plane Constraints , 2016, ACCV.

[10]  Olivier D. Faugeras,et al.  Monocular pose determination from lines: critical sets and maximum number of solutions , 1993, Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.

[11]  Arjan Kuijper,et al.  Reconstruction and Accurate Alignment of Feature Maps for Augmented Reality , 2011, 2011 International Conference on 3D Imaging, Modeling, Processing, Visualization and Transmission.

[12]  Simone Gasparini,et al.  Camera Models and Fundamental Concepts Used in Geometric Computer Vision , 2011, Found. Trends Comput. Graph. Vis..

[13]  Ahmed M. Elgammal,et al.  Line-based relative pose estimation , 2011, CVPR 2011.

[14]  Holger Regenbrecht,et al.  Interactive multi-marker calibration for augmented reality applications , 2002, Proceedings. International Symposium on Mixed and Augmented Reality.

[15]  Homer H. Chen,et al.  Pose determination from line-to-plane correspondences: existence condition and closed-form solutions , 1990, [1990] Proceedings Third International Conference on Computer Vision.

[16]  Young Sam Lee,et al.  Real-time single camera SLAM using fiducial markers , 2009, 2009 ICCAS-SICE.

[17]  Tobias Höllerer,et al.  Theia: A Fast and Scalable Structure-from-Motion Library , 2015, ACM Multimedia.

[18]  K. S. Arun,et al.  Least-Squares Fitting of Two 3-D Point Sets , 1987, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[19]  Yubin Kuang,et al.  Exploiting p-fold symmetries for faster polynomial equation solving , 2012, Proceedings of the 21st International Conference on Pattern Recognition (ICPR2012).

[20]  Luc Van Gool,et al.  Generalised Linear Pose Estimation , 2007, BMVC.

[21]  Pavel Zemcík,et al.  Absolute pose estimation from line correspondences using direct linear transformation , 2016, Comput. Vis. Image Underst..

[22]  Hatem Hmam,et al.  Optimal non-iterative pose estimation via convex relaxation , 2010, Image Vis. Comput..

[23]  Jean Ponce What is a camera? , 2009, CVPR.

[24]  Robert Pless,et al.  Using many cameras as one , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..

[25]  Long Quan,et al.  Linear N-Point Camera Pose Determination , 1999, IEEE Trans. Pattern Anal. Mach. Intell..

[26]  Javier González,et al.  Convex Global 3D Registration with Lagrangian Duality , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[27]  Zuzana Kukelova,et al.  Automatic Generator of Minimal Problem Solvers , 2008, ECCV.

[28]  Robert M. Haralick,et al.  Analysis and solutions of the three point perspective pose estimation problem , 1991, Proceedings. 1991 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[29]  Hongdong Li,et al.  A linear approach to motion estimation using generalized camera models , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[30]  Tobias Höllerer,et al.  Large Scale SfM with the Distributed Camera Model , 2016, 2016 Fourth International Conference on 3D Vision (3DV).

[31]  Carl Olsson,et al.  Branch-and-Bound Methods for Euclidean Registration Problems , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[32]  Francisco José Madrid-Cuevas,et al.  Automatic generation and detection of highly reliable fiducial markers under occlusion , 2014, Pattern Recognit..

[33]  David Nistér,et al.  A Minimal Solution to the Generalised 3-Point Pose Problem , 2004, Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004..

[34]  Visesh Chari,et al.  A theory of multi-layer flat refractive geometry , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[35]  Dieter Schmalstieg,et al.  Automatic Reconstruction of Wide-Area Fiducial Marker Models , 2007, 2007 6th IEEE and ACM International Symposium on Mixed and Augmented Reality.

[36]  Ivan E. Sutherland,et al.  Three-dimensional data input by tablet , 1974 .

[37]  Axel Pinz,et al.  Globally Optimal O(n) Solution to the PnP Problem for General Camera Models , 2008, BMVC.

[38]  Rafael Muñoz-Salinas,et al.  Mapping and Localization from Planar Markers , 2016, Pattern Recognit..

[39]  Hiroyuki Yamamoto,et al.  A marker calibration method utilizing a priori knowledge on marker arrangement , 2004, Third IEEE and ACM International Symposium on Mixed and Augmented Reality.

[40]  Harald Wuest,et al.  A camera-based calibration for automotive augmented reality Head-Up-Displays , 2013, 2013 IEEE International Symposium on Mixed and Augmented Reality (ISMAR).

[41]  Gene H. Golub,et al.  Matrix computations , 1983 .

[42]  Stergios I. Roumeliotis,et al.  Globally optimal pose estimation from line correspondences , 2011, 2011 IEEE International Conference on Robotics and Automation.

[43]  Robert C. Bolles,et al.  Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography , 1981, CACM.

[44]  T. Yamada,et al.  A study on SLAM for indoor blimp with visual markers , 2009, 2009 ICCAS-SICE.

[45]  Mark Fiala,et al.  Designing Highly Reliable Fiducial Markers , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[46]  Andrew W. Fitzgibbon,et al.  Efficient Intersection of Three Quadrics and Applications in Computer Vision , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[47]  Reinhard Koch,et al.  Pose Estimation from Line Correspondences: A Complete Analysis and a Series of Solutions , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[48]  Berthold K. P. Horn,et al.  Closed-form solution of absolute orientation using unit quaternions , 1987 .

[49]  Andrew W. Fitzgibbon,et al.  Bundle Adjustment - A Modern Synthesis , 1999, Workshop on Vision Algorithms.

[50]  Marc Pollefeys,et al.  Minimal Solutions for Pose Estimation of a Multi-Camera System , 2013, ISRR.

[51]  Axel Pinz,et al.  Robust Pose Estimation from a Planar Target , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[52]  Larry S. Davis,et al.  Pose Determination of a Three-Dimensional Object Using Triangle Pairs , 1988, IEEE Trans. Pattern Anal. Mach. Intell..

[53]  Torsten Sattler,et al.  Minimal Solvers for Generalized Pose and Scale Estimation from Two Rays and One Point , 2016, ECCV.

[54]  Edwin Olson,et al.  AprilTag: A robust and flexible visual fiducial system , 2011, 2011 IEEE International Conference on Robotics and Automation.

[55]  Kostas Daniilidis,et al.  Linear Pose Estimation from Points or Lines , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[56]  Berthold K. P. Horn,et al.  Closed-form solution of absolute orientation using orthonormal matrices , 1988 .

[57]  Hirokazu Kato,et al.  Marker tracking and HMD calibration for a video-based augmented reality conferencing system , 1999, Proceedings 2nd IEEE and ACM International Workshop on Augmented Reality (IWAR'99).

[58]  Victor Y. Pan,et al.  Numerical methods for roots of polynomials , 2007 .

[59]  Wen-Yan Chang,et al.  On pose recovery for generalized visual sensors , 2004, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[60]  Jonas Buchli,et al.  An open source, fiducial based, visual-inertial motion capture system , 2016, 2016 19th International Conference on Information Fusion (FUSION).

[61]  Joel A. Hesch,et al.  A Direct Least-Squares (DLS) method for PnP , 2011, 2011 International Conference on Computer Vision.

[62]  V. Lepetit,et al.  EPnP: An Accurate O(n) Solution to the PnP Problem , 2009, International Journal of Computer Vision.

[63]  Lejing Wang,et al.  Floyd-Warshall all-pair shortest path for accurate multi-marker calibration , 2010, 2010 IEEE International Symposium on Mixed and Augmented Reality.

[64]  Manolis I. A. Lourakis An efficient solution to absolute orientation , 2016, 2016 23rd International Conference on Pattern Recognition (ICPR).

[65]  Andrea Torsello,et al.  An Accurate and Robust Artificial Marker Based on Cyclic Codes , 2016, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[66]  Yubin Kuang,et al.  Revisiting the PnP Problem: A Fast, General and Optimal Solution , 2013, 2013 IEEE International Conference on Computer Vision.

[67]  Michel Dhome,et al.  Determination of the Attitude of 3D Objects from a Single Perspective View , 1989, IEEE Trans. Pattern Anal. Mach. Intell..

[68]  Helder Araújo,et al.  Direct Solution to the Minimal Generalized Pose , 2015, IEEE Transactions on Cybernetics.

[69]  Laurent Kneip,et al.  OpenGV: A unified and generalized approach to real-time calibrated geometric vision , 2014, 2014 IEEE International Conference on Robotics and Automation (ICRA).

[70]  Francesc Moreno-Noguer,et al.  Accurate and Linear Time Pose Estimation from Points and Lines , 2016, ECCV.

[71]  Yuichi Taguchi,et al.  Analytical Forward Projection for Axial Non-central Dioptric and Catadioptric Cameras , 2010, ECCV.

[72]  Hongdong Li,et al.  UPnP: An Optimal O(n) Solution to the Absolute Pose Problem with Universal Applicability , 2014, ECCV.

[73]  Gaku Nakano,et al.  Globally Optimal DLS Method for PnP Problem with Cayley parameterization , 2015, BMVC.

[74]  Shiqi Li,et al.  A Robust O(n) Solution to the Perspective-n-Point Problem , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[75]  Adrien Bartoli,et al.  Infinitesimal Plane-Based Pose Estimation , 2014, International Journal of Computer Vision.

[76]  Gregory D. Hager,et al.  Fast and Globally Convergent Pose Estimation from Video Images , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[77]  David Nistér,et al.  An efficient solution to the five-point relative pose problem , 2004, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[78]  Didier Henrion,et al.  Globally Optimal Estimates for Geometric Reconstruction Problems , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[79]  Roland Siegwart,et al.  Using multi-camera systems in robotics: Efficient solutions to the NPnP problem , 2013, 2013 IEEE International Conference on Robotics and Automation.

[80]  Javier Civera,et al.  S-PTAM: Stereo Parallel Tracking and Mapping , 2017, Robotics Auton. Syst..

[81]  Hideo Saito,et al.  Random dot markers , 2011, 2011 IEEE Virtual Reality Conference.

[82]  Sundaram Ganapathy,et al.  Decomposition of transformation matrices for robot vision , 1984, Pattern Recognition Letters.

[83]  Tobias Höllerer,et al.  gDLS: A Scalable Solution to the Generalized Pose and Scale Problem , 2014, ECCV.

[84]  Larry S. Davis,et al.  Model-based object pose in 25 lines of code , 1992, International Journal of Computer Vision.

[85]  Richard I. Hartley,et al.  Lines and Points in Three Views and the Trifocal Tensor , 1997, International Journal of Computer Vision.

[86]  Roland Siegwart,et al.  A novel parametrization of the perspective-three-point problem for a direct computation of absolute camera position and orientation , 2011, CVPR 2011.

[87]  Carl Olsson,et al.  The Registration Problem Revisited: Optimal Solutions From Points, Lines and Planes , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[88]  S. Umeyama,et al.  Least-Squares Estimation of Transformation Parameters Between Two Point Patterns , 1991, IEEE Trans. Pattern Anal. Mach. Intell..