Fast and Globally Convergent Pose Estimation from Video Images

Determining the rigid transformation relating 2D images to known 3D geometry is a classical problem in photogrammetry and computer vision. Heretofore, the best methods for solving the problem have relied on iterative optimization methods which cannot be proven to converge and/or which do not effectively account for the orthonormal structure of rotation matrices. We show that the pose estimation problem can be formulated as that of minimizing an error metric based on collinearity in object (as opposed to image) space. Using object space collinearity error, we derive an iterative algorithm which directly computes orthogonal rotation matrices and which is globally convergent. Experimentally, we show that the method is computationally efficient, that it is no less accurate than the best currently employed optimization methods, and that it outperforms all tested methods in robustness to outliers.

[1]  E. H. Thompson The projective theory of relative orientation , 1968 .

[2]  Yoram Yakimovsky,et al.  A system for extracting three-dimensional measurements from a stereo pair of TV cameras , 1976 .

[3]  Hans P. Moravec Obstacle avoidance and navigation in the real world by a seeing robot rover , 1980 .

[4]  Robert C. Bolles,et al.  Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography , 1981, CACM.

[5]  Sundaram Ganapathy,et al.  Decomposition of transformation matrices for robot vision , 1984, Pattern Recognition Letters.

[6]  Sundaram Ganapathy,et al.  Decomposition of transformation matrices for robot vision , 1984, Pattern Recognit. Lett..

[7]  R. Y. Tsai,et al.  An Efficient and Accurate Camera Calibration Technique for 3D Machine Vision , 1986, CVPR 1986.

[8]  Roger Y. Tsai,et al.  A versatile camera calibration technique for high-accuracy 3D machine vision metrology using off-the-shelf TV cameras and lenses , 1987, IEEE J. Robotics Autom..

[9]  Yoshiaki Shirai,et al.  Three-Dimensional Computer Vision , 1987, Symbolic Computation.

[10]  Berthold K. P. Horn,et al.  Closed-form solution of absolute orientation using unit quaternions , 1987 .

[11]  J. K. Aggarwal,et al.  TRIANGULATION ERRORS IN STEREO ALGORITHMS. , 1987 .

[12]  K. S. Arun,et al.  Least-Squares Fitting of Two 3-D Point Sets , 1987, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[13]  David G. Lowe,et al.  Three-Dimensional Object Recognition from Single Two-Dimensional Images , 1987, Artif. Intell..

[14]  Roger Y. Tsai,et al.  Techniques for calibration of the scale factor and image center for high accuracy 3D machine vision metrology , 1987, Proceedings. 1987 IEEE International Conference on Robotics and Automation.

[15]  Berthold K. P. Horn,et al.  Closed-form solution of absolute orientation using orthonormal matrices , 1988 .

[16]  Roger Y. Tsai,et al.  Techniques for Calibration of the Scale Factor and Image Center for High Accuracy 3-D Machine Vision Metrology , 1988, IEEE Trans. Pattern Anal. Mach. Intell..

[17]  Michel Dhome,et al.  Determination of the Attitude of 3D Objects from a Single Perspective View , 1989, IEEE Trans. Pattern Anal. Mach. Intell..

[18]  Radu Horaud,et al.  An analytic solution for the perspective 4-point problem , 1989, Proceedings CVPR '89: IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[19]  Xinhua Zhuang,et al.  Pose estimation from corresponding point data , 1989, IEEE Trans. Syst. Man Cybern..

[20]  William Grimson,et al.  Object recognition by computer - the role of geometric constraints , 1991 .

[21]  Richard A. Volz,et al.  Estimating 3-D location parameters using dual number quaternions , 1991, CVGIP Image Underst..

[22]  Robert M. Haralick,et al.  Analysis and solutions of the three point perspective pose estimation problem , 1991, Proceedings. 1991 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[23]  Linda G. Shapiro,et al.  Computer and Robot Vision , 1991 .

[24]  David G. Lowe,et al.  Fitting Parameterized Three-Dimensional Models to Images , 1991, IEEE Trans. Pattern Anal. Mach. Intell..

[25]  T. D. Alter 3D Pose from Three Corresponding Points Under Weak-Perspective Projection , 1992 .

[26]  Ryutarou Ohbuchi,et al.  Merging virtual objects with the real world: seeing ultrasound imagery within the patient , 1992, SIGGRAPH.

[27]  David B. Kirk,et al.  Graphics Gems III , 1992 .

[28]  T. D. Alter D Pose from 3 Corresponding Points under Weak-Perspective Projection , 1992 .

[29]  Larry S. Davis,et al.  Model-Based Object Pose in 25 Lines of Code , 1992, ECCV.

[30]  FuchsHenry,et al.  Merging virtual objects with the real world , 1992 .

[31]  Larry S. Davis,et al.  Exact and Approximate Solutions of the Perspective-Three-Point Problem , 1992, IEEE Trans. Pattern Anal. Mach. Intell..

[32]  W. Sardha Wijesoma,et al.  Eye-to-Hand Coordination for Vision-Guided Robot Control Applications , 1993, Int. J. Robotics Res..

[33]  Narendra Ahuja,et al.  Optimal Motion and Structure Estimation , 1993, IEEE Trans. Pattern Anal. Mach. Intell..

[34]  O. Faugeras Three-dimensional computer vision: a geometric viewpoint , 1993 .

[35]  William L. Wilson,et al.  VISUAL SERVO CONTROL OF ROBOTS USING KALMAN FILTER ESTIMATES OF ROBOT POSE RELATIVE TO WORK-PIECES , 1993 .

[36]  Ronald Azuma,et al.  Improving static and dynamic registration in an optical see-through HMD , 1994, SIGGRAPH.

[37]  Gregory D. Hager Real-time feature tracking and projective invariance as a basis for hand-eye coordination , 1994, 1994 Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.

[38]  Thomas S. Huang,et al.  Motion and structure from feature correspondences: a review , 1994, Proc. IEEE.

[39]  W. Eric L. Grimson,et al.  An automatic registration method for frameless stereotaxy, image guided surgery, and enhanced reality visualization , 1994, 1994 Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.

[40]  Emanuele Trucco,et al.  Computer and Robot Vision , 1995 .

[41]  Chien-Ping Lu Online pose estimation and model matching , 1996 .

[42]  Helder Araujo,et al.  A Fully Projective Formulation for Lowe's Tracking Algorithm , 1996 .

[43]  Mark A. Livingston,et al.  Superior augmented reality registration by integrating landmark tracking and magnetic tracking , 1996, SIGGRAPH.

[44]  Tomas Lozano-Perez,et al.  An automatic registration method for frameless stereotaxy, image guided surgery, and enhanced reality visualization , 1996 .

[45]  Gregory D. Hager,et al.  X Vision: A Portable Substrate for Real-Time Vision Applications , 1998, Comput. Vis. Image Underst..

[46]  Gregory D. Hager,et al.  Robust Vision for Vision-Based Control of Motion , 1999 .