Fast and robust absolute camera pose estimation with known focal length

Some 3D computer vision techniques such as structure from motion (SFM) and augmented reality (AR) depend on a specific perspective-n-point (PnP) algorithm to estimate the absolute camera pose. However, existing PnP algorithms are difficult to achieve a good balance between accuracy and efficiency, and most of them do not make full use of the internal camera information such as focal length. In order to attack these drawbacks, we propose a fast and robust PnP (FRPnP) method to calculate the absolute camera pose for 3D compute vision. In the proposed FRPnP method, we firstly formulate the PnP problem as the optimization problem in the null space that can avoid the effects of the depth of each 3D point. Secondly, we can easily get the solution by the direct manner using singular value decomposition. Finally, the accurate information of camera pose can be obtained by optimization strategy. We explore four ways to evaluate the proposed FRPnP algorithm with synthetic dataset, real images, and apply it in the AR and SFM system. Experimental results show that the proposed FRPnP method can obtain the best balance between computational cost and precision, and clearly outperforms the state-of-the-art PnP methods.

[1]  Zhuwen Li,et al.  Diminished reality using appearance and 3D geometry of internet photo collections , 2013, 2013 IEEE International Symposium on Mixed and Augmented Reality (ISMAR).

[2]  Kenneth Levenberg A METHOD FOR THE SOLUTION OF CERTAIN NON – LINEAR PROBLEMS IN LEAST SQUARES , 1944 .

[3]  Raymond H. Chan,et al.  A Fast Randomized Eigensolver with Structured LDL Factorization Update , 2014, SIAM J. Matrix Anal. Appl..

[4]  Chen Kong,et al.  Prior-Less Compressible Structure from Motion , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[5]  Hujun Bao,et al.  Keyframe-based real-time camera tracking , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[6]  Jan-Michael Frahm,et al.  Correcting for Duplicate Scene Structure in Sparse 3D Reconstruction , 2014, ECCV.

[7]  H. Opower Multiple view geometry in computer vision , 2002 .

[8]  Zhanyi Hu,et al.  PnP Problem Revisited , 2005, Journal of Mathematical Imaging and Vision.

[9]  Steven M. Seitz,et al.  Photo tourism: exploring photo collections in 3D , 2006, ACM Trans. Graph..

[10]  Chris Dede,et al.  Augmented Reality Teaching and Learning , 2014 .

[11]  Noah Snavely,et al.  Robust Global Translations with 1DSfM , 2014, ECCV.

[12]  Stefan Hinz,et al.  MLPnP - A Real-Time Maximum Likelihood Solution to the Perspective-n-Point Problem , 2016, ArXiv.

[13]  Huizhong Chen,et al.  The stanford mobile visual search data set , 2011, MMSys.

[14]  Dieter Schmalstieg,et al.  A Minimal Solution to the Generalized Pose-and-Scale Problem , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[15]  Jorge Gustavo Rocha,et al.  State of Art Survey On: Large Scale Image Location Recognition , 2016, ICCSA.

[16]  Kostas Daniilidis,et al.  Linear Pose Estimation from Points or Lines , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[17]  Gaku Nakano,et al.  A Versatile Approach for Solving PnP, PnPf, and PnPfr Problems , 2016, ECCV.

[18]  Francisco José Madrid-Cuevas,et al.  Automatic generation and detection of highly reliable fiducial markers under occlusion , 2014, Pattern Recognit..

[19]  Jan-Michael Frahm,et al.  Structure-from-Motion Revisited , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[20]  Hendrik P. A. Lensch,et al.  Scale Robust Multi View Stereo , 2012, ECCV.

[21]  Kalle Åström,et al.  Absolute pose for cameras under flat refractive interfaces , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[22]  Francesc Moreno-Noguer,et al.  Exhaustive Linearization for Robust Camera Pose and Focal Length Estimation , 2013, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[23]  Gregory D. Hager,et al.  Fast and Globally Convergent Pose Estimation from Video Images , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[24]  Rin-ichiro Taniguchi,et al.  Augmented Blendshapes for Real-Time Simultaneous 3D Head Modeling and Facial Motion Capture , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[25]  Tobias Höllerer,et al.  Optimizing the Viewing Graph for Structure-from-Motion , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[26]  Luc Vincent,et al.  Building indoor multi-panorama experiences at scale , 2012, SIGGRAPH Talks.

[27]  Vincent Lepetit,et al.  Accurate Non-Iterative O(n) Solution to the PnP Problem , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[28]  Kostas Daniilidis,et al.  Hand-Eye Calibration Using Dual Quaternions , 1999, Int. J. Robotics Res..

[29]  Torsten Sattler,et al.  Camera Pose Voting for Large-Scale Image-Based Localization , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[30]  Steven K. Feiner,et al.  Exploring MARS: developing indoor and outdoor user interfaces to a mobile augmented reality system , 1999, Comput. Graph..

[31]  Xin Yang,et al.  LDB: An ultra-fast feature for scalable Augmented Reality on mobile devices , 2012, 2012 IEEE International Symposium on Mixed and Augmented Reality (ISMAR).

[32]  Francesc Moreno-Noguer,et al.  Leveraging Feature Uncertainty in the PnP Problem , 2014, BMVC.

[33]  Soh-Khim Ong,et al.  Virtual and Augmented Reality Applications in Manufacturing , 2004, MIM.

[34]  Andrew Owens,et al.  Discrete-continuous optimization for large-scale structure from motion , 2011, CVPR 2011.

[35]  Wenbin Li,et al.  Dense Nonrigid Ground Truth for Optical Flow in Real-World Scenes , 2016, ArXiv.

[36]  Hongdong Li,et al.  UPnP: An Optimal O(n) Solution to the Absolute Pose Problem with Universal Applicability , 2014, ECCV.

[37]  Radu Horaud,et al.  An analytic solution for the perspective 4-point problem , 1989, Proceedings CVPR '89: IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[38]  Zhanyi Hu,et al.  A Note on the Number of Solutions of the Noncoplanar P4P Problem , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[39]  Yubin Kuang,et al.  Revisiting the PnP Problem: A Fast, General and Optimal Solution , 2013, 2013 IEEE International Conference on Computer Vision.

[40]  Wolfgang Förstner,et al.  Minimal Representations for Uncertainty and Estimation in Projective Spaces , 2010, ACCV.

[41]  Klas Josephson,et al.  Pose estimation with radial distortion and unknown focal length , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[42]  Roland Siegwart,et al.  A novel parametrization of the perspective-three-point problem for a direct computation of absolute camera position and orientation , 2011, CVPR 2011.

[43]  Andrew Owens,et al.  SUN3D: A Database of Big Spaces Reconstructed Using SfM and Object Labels , 2013, 2013 IEEE International Conference on Computer Vision.

[44]  Hanqing Lu,et al.  Fast and Accurate Image Matching with Cascade Hashing for 3D Reconstruction , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[45]  Gilad Lerman,et al.  Hybrid Linear Modeling via Local Best-Fit Flats , 2010, International Journal of Computer Vision.

[46]  Tobias Höllerer,et al.  gDLS: A Scalable Solution to the Generalized Pose and Scale Problem , 2014, ECCV.

[47]  John J. Leonard,et al.  Past, Present, and Future of Simultaneous Localization and Mapping: Toward the Robust-Perception Age , 2016, IEEE Transactions on Robotics.

[48]  Shiqi Li,et al.  A Robust O(n) Solution to the Perspective-n-Point Problem , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[49]  Reinhard Koch,et al.  Robust and Efficient Pose Estimation from Line Correspondences , 2012, ACCV.

[50]  Matthijs C. Dorst Distinctive Image Features from Scale-Invariant Keypoints , 2011 .

[51]  Changchang Wu,et al.  Towards Linear-Time Incremental Structure from Motion , 2013, 2013 International Conference on 3D Vision.

[52]  Axel Pinz,et al.  Globally Optimal O(n) Solution to the PnP Problem for General Camera Models , 2008, BMVC.

[53]  Haibo Li,et al.  Head Orientation Modeling: Geometric Head Pose Estimation using Monocular Camera , 2013, ICIS 2013.

[54]  Xuejin Chen,et al.  Geometry-Aware Image Completion via Multiple Examples , 2014, PG.

[55]  Martin Byröd,et al.  Pose estimation with radial distortion and unknown focal length , 2009, CVPR.

[56]  Hujun Bao,et al.  Efficient Non-Consecutive Feature Tracking for Robust Structure-From-Motion , 2015, IEEE Transactions on Image Processing.

[57]  Shiyu Song,et al.  Robust Scale Estimation in Real-Time Monocular SFM for Autonomous Driving , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[58]  Pascal Monasse,et al.  Global Fusion of Relative Motions for Robust, Accurate and Scalable Structure from Motion , 2013, ICCV.

[59]  Andrea Fusiello,et al.  Solving the PnP Problem with Anisotropic Orthogonal Procrustes Analysis , 2012, 2012 Second International Conference on 3D Imaging, Modeling, Processing, Visualization & Transmission.

[60]  Bodo Rosenhahn,et al.  Multicamera Calibration from Visible and Mirrored Epipoles , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[61]  Zicheng Liu,et al.  Model-based bundle adjustment with application to face modeling , 2001, Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001.

[62]  Zhihan Lv,et al.  Bigdata Oriented Multimedia Mobile Health Applications , 2016, Journal of Medical Systems.

[63]  Joel A. Hesch,et al.  A Direct Least-Squares (DLS) method for PnP , 2011, 2011 International Conference on Computer Vision.

[64]  P. J. Huber Robust Regression: Asymptotics, Conjectures and Monte Carlo , 1973 .

[65]  Niloy J. Mitra,et al.  Dynamic SfM: Detecting Scene Changes from Image Pairs , 2015, SGP '15.

[66]  Francesc Moreno-Noguer,et al.  Very Fast Solution to the PnP Problem with Algebraic Outlier Rejection , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[67]  Robert M. Haralick,et al.  Review and analysis of solutions of the three point perspective pose estimation problem , 1994, International Journal of Computer Vision.

[68]  Didier Henrion,et al.  Globally Optimal Estimates for Geometric Reconstruction Problems , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[69]  Adrien Bartoli,et al.  KAZE Features , 2012, ECCV.

[70]  David Nistér,et al.  An efficient solution to the five-point relative pose problem , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..

[71]  Changchang Wu,et al.  Structure from Motion Using Structure-Less Resection , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[72]  Changchang Wu,et al.  SiftGPU : A GPU Implementation of Scale Invariant Feature Transform (SIFT) , 2007 .

[73]  Wojciech Cellary,et al.  Evaluation of learners' attitude toward learning in ARIES augmented reality environments , 2013, Comput. Educ..

[74]  Silvio Savarese,et al.  Semantic structure from motion , 2011, CVPR 2011.

[75]  Takayuki Okatani,et al.  Detecting Changes in 3D Structure of a Scene from Multi-view Images Captured by a Vehicle-Mounted Camera , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[76]  Andrea Fusiello,et al.  Robust Global Motion Estimation with Matrix Completion , 2014, The International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences.

[77]  Zhihan Lv,et al.  Touch-less interactive augmented reality game on vision-based wearable device , 2015, Personal and Ubiquitous Computing.

[78]  Josef Sivic,et al.  NetVLAD: CNN Architecture for Weakly Supervised Place Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[79]  Steven M. Seitz,et al.  Multicore bundle adjustment , 2011, CVPR 2011.

[80]  Carlos Delgado Kloos,et al.  Impact of an augmented reality system on students' motivation for a visual art course , 2013, Comput. Educ..

[81]  Luc Van Gool,et al.  Progressive Prioritized Multi-view Stereo , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[82]  Tobias Höllerer,et al.  Efficient Computation of Absolute Pose for Gravity-Aware Augmented Reality , 2015, 2015 IEEE International Symposium on Mixed and Augmented Reality.

[83]  Zhihan Lv,et al.  Multimodal Hand and Foot Gesture Interaction for Handheld Devices , 2014, TOMM.

[84]  Masatoshi Okutomi,et al.  ASPnP: An Accurate and Scalable Solution to the Perspective-n-Point Problem , 2013, IEICE Trans. Inf. Syst..

[85]  John J. Leonard,et al.  Past, Present, and Future of Simultaneous Localization and Mapping: Toward the Robust-Perception Age , 2016, IEEE Transactions on Robotics.

[86]  Francisco José Madrid-Cuevas,et al.  Generation of fiducial marker dictionaries using Mixed Integer Linear Programming , 2016, Pattern Recognit..

[87]  Zhihan Lv,et al.  ARGIS-based outdoor underground pipeline information system , 2016, J. Vis. Commun. Image Represent..

[88]  Stephen P. Boyd,et al.  Convex Optimization , 2004, Algorithms and Theory of Computation Handbook.