Use Your Hand as a 3-D Mouse, or, Relative Orientation from Extended Sequences of Sparse Point and Line Correspondences Using the Affine Trifocal Tensor

This paper addresses the problem of computing three-dimensional structure and motion from an unknown rigid configuration of point and lines viewed by an affine projection model. An algebraic structure, analogous to the trilinear tensor for three perspective cameras, is defined for configurations of three centered affine cameras. This centered affine trifocal tensor contains 12 non-zero coefficients and involves linear relations between point correspondences and trilinear relations between line correspondences. It is shown how the affine trifocal tensor relates to the perspective trilinear tensor, and how three-dimensional motion can be computed from this tensor in a straightforward manner. A factorization approach is also developed to handle point features and line features simultaneously in image sequences. This theory is applied to a specific problem in human-computer interaction of capturing three-dimensional rotations from gestures of a human hand. Besides the obvious application, this test problem illustrates the usefulness of the affine trifocal tensor in a situation where sufficient information is not available to compute the perspective trilinear tensor, while the geometry requires point correspondences as well as line correspondences over at least three views.

[1]  Andrew Zisserman,et al.  Geometric invariance in computer vision , 1992 .

[2]  Tony Lindeberg,et al.  Edge Detection and Ridge Detection with Automatic Scale Selection , 1996, Proceedings CVPR IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[3]  Paul A. Beardsley,et al.  Navigation using Affine Structure from Motion , 1994, ECCV.

[4]  Takeo Kanade,et al.  A sequential factorization method for recovering shape and motion from image streams , 1997, IEEE Trans. Pattern Anal. Mach. Intell..

[5]  O. Faugeras Stratification of three-dimensional vision: projective, affine, and metric representations , 1995 .

[6]  Amnon Shashua,et al.  Algebraic Functions For Recognition , 1995, IEEE Trans. Pattern Anal. Mach. Intell..

[7]  Takeo Kanade,et al.  Affine structure from line correspondences with uncalibrated affine cameras , 1997, IEEE Trans. Pattern Anal. Mach. Intell..

[8]  Anders Heyden,et al.  Perception and Action Using Multilinear Forms , 1997, AFPAC.

[9]  Olivier D. Faugeras,et al.  What can be seen in three dimensions with an uncalibrated stereo rig , 1992, ECCV.

[10]  Robert J. Holt,et al.  Motion and structure from line correspondences: Some further results , 1994, Int. J. Imaging Syst. Technol..

[11]  David C. Hogg,et al.  Towards 3D hand tracking using a deformable model , 1996, Proceedings of the Second International Conference on Automatic Face and Gesture Recognition.

[12]  Gang Xu,et al.  Epipolar Geometry in Stereo, Motion and Object Recognition , 1996, Computational Imaging and Vision.

[13]  Narendra Ahuja,et al.  Motion and Structure from Line Correspondences; Closed-Form Solution, Uniqueness, and Optimization , 1992, IEEE Trans. Pattern Anal. Mach. Intell..

[14]  Tosiyasu L. Kunii,et al.  Model-based analysis of hand posture , 1995, IEEE Computer Graphics and Applications.

[15]  Thomas S. Huang,et al.  Motion and structure from feature correspondences: a review , 1994, Proc. IEEE.

[16]  S. Ullman,et al.  The interpretation of visual motion , 1977 .

[17]  Lars Bretzner,et al.  On the Handling of Spatial and Temporal Scales in Feature Tracking , 1997, Scale-Space.

[18]  Paul A. Beardsley,et al.  3D Model Acquisition from Extended Image Sequences , 1996, ECCV.

[19]  Amnon Shashua,et al.  Trilinear Tensor: The Fundamental Construct of Multiple-view Geometry and Its Applications , 1997, AFPAC.

[20]  Ian D. Reid,et al.  Recursive Affine Structure and Motion from Image Sequences , 1994, ECCV.

[21]  Peter F. Sturm,et al.  A Factorization Based Algorithm for Multi-Image Projective Structure and Motion , 1996, ECCV.

[22]  Tony Lindeberg,et al.  Scale-Space Theory in Computer Vision , 1993, Lecture Notes in Computer Science.

[23]  Lars Bretzner,et al.  Feature Tracking with Automatic Selection of Spatial Scales , 1998, Comput. Vis. Image Underst..

[24]  H. C. Longuet-Higgins,et al.  A computer algorithm for reconstructing a scene from two projections , 1981, Nature.

[25]  Ronen Basri,et al.  Recognition by Linear Combinations of Models , 1991, IEEE Trans. Pattern Anal. Mach. Intell..

[26]  Thomas S. Huang,et al.  Motion and Structure from Orthographic Projections , 1989, IEEE Trans. Pattern Anal. Mach. Intell..

[27]  Johan Wiklund,et al.  Multidimensional Orientation Estimation with Applications to Texture Analysis and Optical Flow , 1991, IEEE Trans. Pattern Anal. Mach. Intell..

[28]  Long Quan A ne Structure from Line Correspondences with Uncalibrated A ne Cameras , 1999 .

[29]  J J Koenderink,et al.  Affine structure from motion. , 1991, Journal of the Optical Society of America. A, Optics and image science.

[30]  Larry S. Shapiro,et al.  Affine Analysis of Image Sequences: Contents , 1995 .

[31]  Olivier D. Faugeras,et al.  On the geometry and algebra of the point and line correspondences between N images , 1995, Proceedings of IEEE International Conference on Computer Vision.

[32]  Thomas S. Huang,et al.  Theory of Reconstruction from Image Motion , 1992 .

[33]  Richard I. Hartley,et al.  A linear method for reconstruction from lines and points , 1995, Proceedings of IEEE International Conference on Computer Vision.