HARD-PnP: PnP Optimization Using a Hybrid Approximate Representation

This paper proposes a Hybrid Approximate Representation (HAR) based on unifying several efficient approximations of the generalized reprojection error (which is known as the <italic>gold standard</italic> for multiview geometry). The HAR is an over-parameterization scheme where the approximation is applied simultaneously in multiple parameter spaces. A joint minimization scheme “HAR-Descent” can then solve the PnP problem efficiently, while remaining robust to approximation errors and local minima. The technique is evaluated extensively, including numerous synthetic benchmark protocols and the real-world data evaluations used in previous works. The proposed technique was found to have runtime complexity comparable to the fastest <inline-formula><tex-math notation="LaTeX">$O(n)$</tex-math><alternatives><inline-graphic xlink:href="hadfield-ieq1-2806446.gif"/></alternatives></inline-formula> techniques, and up to 10 times faster than current state of the art minimization approaches. In addition, the accuracy exceeds that of all 9 previous techniques tested, providing definitive state of the art performance on the benchmarks, across all 90 of the experiments in the paper and supplementary material, which can be found on the Computer Society Digital Library at <uri>http://doi.ieeecomputersociety.org/10.1109/TPAMI.2018.2806446</uri>.

[1]  Carl Olsson,et al.  Branch-and-Bound Methods for Euclidean Registration Problems , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[2]  Francesc Moreno-Noguer,et al.  Very Fast Solution to the PnP Problem with Algebraic Outlier Rejection , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[3]  Tomas Lozano-Perez,et al.  An automatic registration method for frameless stereotaxy, image guided surgery, and enhanced reality visualization , 1996 .

[4]  David A. Cox,et al.  Ideals, Varieties, and Algorithms: An Introduction to Computational Algebraic Geometry and Commutative Algebra, 3/e (Undergraduate Texts in Mathematics) , 2007 .

[5]  Steven M. Seitz,et al.  Multicore bundle adjustment , 2011, CVPR 2011.

[6]  G. Roth,et al.  provided that the source of such material is fully acknowledged. Robust 2D Tracking for Real-Time Augmented Reality , 2002 .

[7]  Axel Pinz,et al.  Robust Pose Estimation from a Planar Target , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[8]  Hongdong Li,et al.  UPnP: An Optimal O(n) Solution to the Absolute Pose Problem with Universal Applicability , 2014, ECCV.

[9]  Masatoshi Okutomi,et al.  ASPnP: An Accurate and Scalable Solution to the Perspective-n-Point Problem , 2013, IEICE Trans. Inf. Syst..

[10]  V. Lepetit,et al.  EPnP: An Accurate O(n) Solution to the PnP Problem , 2009, International Journal of Computer Vision.

[11]  Richard Bowden,et al.  Natural Action Recognition Using Invariant 3D Motion Encoding , 2014, ECCV.

[12]  Jiri Matas,et al.  Fixing the Locally Optimized RANSAC , 2012, BMVC.

[13]  Zuzana Kukelova,et al.  Automatic Generator of Minimal Problem Solvers , 2008, ECCV.

[14]  Andrea Fusiello,et al.  Solving the PnP Problem with Anisotropic Orthogonal Procrustes Analysis , 2012, 2012 Second International Conference on 3D Imaging, Modeling, Processing, Visualization & Transmission.

[15]  Joel A. Hesch,et al.  A Direct Least-Squares (DLS) method for PnP , 2011, 2011 International Conference on Computer Vision.

[16]  Robert C. Bolles,et al.  Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography , 1981, CACM.

[17]  Tsuyoshi Murata,et al.  {m , 1934, ACML.

[18]  Yubin Kuang,et al.  Revisiting the PnP Problem: A Fast, General and Optimal Solution , 2013, 2013 IEEE International Conference on Computer Vision.

[19]  XuChi,et al.  A Robust O(n) Solution to the Perspective-n-Point Problem , 2012 .

[20]  Changchang Wu,et al.  Towards Linear-Time Incremental Structure from Motion , 2013, 2013 International Conference on 3D Vision.

[21]  Axel Pinz,et al.  Globally Optimal O(n) Solution to the PnP Problem for General Camera Models , 2008, BMVC.

[22]  Roland Siegwart,et al.  Using multi-camera systems in robotics: Efficient solutions to the NPnP problem , 2013, 2013 IEEE International Conference on Robotics and Automation.

[23]  Changchang Wu,et al.  P3.5P: Pose estimation with unknown focal length , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[24]  Shiqi Li,et al.  A Robust O(n) Solution to the Perspective-n-Point Problem , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[25]  H. M. Karara,et al.  Direct Linear Transformation from Comparator Coordinates into Object Space Coordinates in Close-Range Photogrammetry , 2015 .

[26]  Martin Jaggi,et al.  Revisiting Frank-Wolfe: Projection-Free Sparse Convex Optimization , 2013, ICML.

[27]  W. C. Mcginnis Ideals , 1925, Free Speech.

[28]  Roland Siegwart,et al.  A novel parametrization of the perspective-three-point problem for a direct computation of absolute camera position and orientation , 2011, CVPR 2011.

[29]  Richard Bowden,et al.  2D or Not 2D: Bridging the Gap Between Tracking and Structure from Motion , 2014, ACCV.

[30]  Gregory D. Hager,et al.  Fast and Globally Convergent Pose Estimation from Video Images , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[31]  Richard Szeliski,et al.  Bundle Adjustment in the Large , 2010, ECCV.

[32]  David Nister,et al.  Recent developments on direct relative orientation , 2006 .

[33]  Mark A. Livingston,et al.  Superior augmented reality registration by integrating landmark tracking and magnetic tracking , 1996, SIGGRAPH.

[34]  Bernhard P. Wrobel,et al.  Multiple View Geometry in Computer Vision , 2001 .