Model-based object pose in 25 lines of code

In this paper, we describe a method for finding the pose of an object from a single image. We assume that we can detect and match in the image four or more noncoplanar feature points of the object, and that we know their relative geometry on the object. The method combines two algorithms; the first algorithm,POS (Pose from Orthography and Scaling) approximates the perspective projection with a scaled orthographic projection and finds the rotation matrix and the translation vector of the object by solving a linear system; the second algorithm,POSIT (POS with ITerations), uses in its iteration loop the approximate pose found by POS in order to compute better scaled orthographic projections of the feature points, then applies POS to these projections instead of the original image projections. POSIT converges to accurate pose measurements in a few iterations. POSIT can be used with many feature points at once for added insensitivity to measurement errors and image noise. Compared to classic approaches making use of Newton's method, POSIT does not require starting from an initial guess, and computes the pose using an order of magnitude fewer floating point operations; it may therefore be a useful alternative for real-time operation. When speed is not an issue, POSIT can be written in 25 lines or less in Mathematica; the code is provided in an Appendix.

[1]  Lawrence G. Roberts,et al.  Machine Perception of Three-Dimensional Solids , 1963, Outstanding Dissertations in the Computer Sciences.

[2]  Ivan E. Sutherland,et al.  Three-dimensional data input by tablet , 1974 .

[3]  Robert C. Bolles,et al.  Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography , 1981, CACM.

[4]  Roger Y. Tsai,et al.  A versatile camera calibration technique for high-accuracy 3D machine vision metrology using off-the-shelf TV cameras and lenses , 1987, IEEE J. Robotics Autom..

[5]  Michel Dhome,et al.  Determination of the Attitude of 3D Objects from a Single Perspective View , 1989, IEEE Trans. Pattern Anal. Mach. Intell..

[6]  Radu Horaud,et al.  An analytic solution for the perspective 4-point problem , 1989, Proceedings CVPR '89: IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[7]  Joseph S.-C. Yuan A general photogrammetric method for determining object position and orientation , 1989, IEEE Trans. Robotics Autom..

[8]  S. Maybank The projective geometry of ambiguous surfaces , 1990, Philosophical Transactions of the Royal Society of London. Series A: Physical and Engineering Sciences.

[9]  Larry S. Davis,et al.  New exact and approximate solutions of the three-point perspective problem , 1990, Proceedings., IEEE International Conference on Robotics and Automation.

[10]  David G. Lowe,et al.  Fitting Parameterized Three-Dimensional Models to Images , 1991, IEEE Trans. Pattern Anal. Mach. Intell..

[11]  Ronen Basri,et al.  Recognition by Linear Combinations of Models , 1991, IEEE Trans. Pattern Anal. Mach. Intell..

[12]  Thomas M. Breuel,et al.  Fast recognition using adaptive subdivisions of transformation space , 1992, Proceedings 1992 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[13]  Robert M. Haralick,et al.  Performance characterization in image analysis: thinning, a case in point , 1992, Pattern Recognit. Lett..

[14]  Larry S. Davis,et al.  Model-Based Object Pose in 25 Lines of Code , 1992, ECCV.

[15]  Frank Biocca,et al.  A Survey of Position Trackers , 1992, Presence: Teleoperators & Virtual Environments.

[16]  Larry S. Davis,et al.  Exact and Approximate Solutions of the Perspective-Three-Point Problem , 1992, IEEE Trans. Pattern Anal. Mach. Intell..

[17]  Daphna Weinshall,et al.  Distance metric between 3D models and 2D images for recognition and classification , 1993, Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.

[18]  Larry S. Davis,et al.  Recognition and Tracking of 3D Objects by 1D Search , 1993 .

[19]  C Tomasi,et al.  Shape and motion from image streams: a factorization method. , 1992, Proceedings of the National Academy of Sciences of the United States of America.

[20]  O. Faugeras Three-dimensional computer vision: a geometric viewpoint , 1993 .

[21]  Mongi A. Abidi,et al.  A New Efficient and Direct Solution for Pose Estimation Using Quadrangular Targets: Algorithm and Evaluation , 1995, IEEE Trans. Pattern Anal. Mach. Intell..

[22]  Proceedings of the IEEE , 2018, IEEE Journal of Emerging and Selected Topics in Power Electronics.

[23]  Larry S. Davis,et al.  Iterative Pose Estimation Using Coplanar Feature Points , 1996, Comput. Vis. Image Underst..

[24]  Daphna Weinshall,et al.  Distance Metric Between 3D Models and 2D Images for Recognition and Classification , 1996, IEEE Trans. Pattern Anal. Mach. Intell..

[25]  H. Bourlard,et al.  S E a R C H , 2002 .

[26]  William H. Press,et al.  Numerical recipes in C , 2002 .