3-D Structure from Visual Motion: Modeling, Representation and Observability

Abstract The problem of ‘structure from motion’ concerns the reconstruction of the three-dimensional structure of a scene from its projection onto a moving two-dimensional surface. Such a problem is solved effectively by the human visual system, judging from the ease with which we perform delicate control tasks involving vision as a sensor such as reaching for objects in the environment or driving a car. In this paper we study ‘structure from motion’ from the point of view of dynamical systems: we first formalize the problem of 3-D structure and motion reconstruction as the estimation of the state of certain nonlinear dynamical models. Then we study the feasibility of ‘structure from motion’ by analyzing the observability of such models. The models that define the visual motion estimation problem for feature points in the Euclidean 3-D space are not locally observable; however, the non-observable manifold can be easily isolated by imposing metric constraints on the state space. One of the peculiarities of vision as a sensor is its richness , which can be a disadvantage when we are interested only in few of the unknown parameters. For instance, if we want to control the direction of heading of our car by measuring brightness values on our retina, we have to overcome the effects that the shape of the environment, its reflectance properties, illumination and other quantities have on our measurements. Invariance to undesired parameters can be achieved by appropriate modeling or by choice of representation of the parameter space. We propose and analyze models for 3-D structure that are independent of 3-D motion and vice versa. Estimating unknown parameters from such models amounts to the identification of nonlinear and implicit systems with parameters on differentiable manifolds, such as a sphere or the so-called essential manifold . © 1997 Elsevier Science Ltd.

[1]  Alex Pentland,et al.  Recursive Estimation of Motion, Structure, and Focal Length , 1995, IEEE Trans. Pattern Anal. Mach. Intell..

[2]  R. Bucy Nonlinear filtering theory , 1965 .

[3]  Pietro Perona,et al.  Motion Estimation on the Essential Manifold , 1994, ECCV.

[4]  Kenichi Kanatani,et al.  Informat ion Criter ion , 2005 .

[5]  Pietro Perona,et al.  Reducing "structure from motion" , 1996, Proceedings CVPR IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[6]  P. Perona,et al.  Three dimensional transparent structure segmentation and multiple 3D motion estimation from monocular perspective image sequences , 1994, Proceedings of 1994 IEEE Workshop on Motion of Non-rigid and Articulated Objects.

[7]  Olivier D. Faugeras,et al.  Determining the fundamental matrix with planes: instability and new algorithms , 1993, Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.

[8]  P. Holmes,et al.  Nonlinear Oscillations, Dynamical Systems, and Bifurcations of Vector Fields , 1983, Applied Mathematical Sciences.

[9]  P. Schönemann On artificial intelligence , 1985, Behavioral and Brain Sciences.

[10]  D. Kendall,et al.  The Riemannian Structure of Euclidean Shape Spaces: A Novel Environment for Statistics , 1993 .

[11]  Narendra Ahuja,et al.  Optimal Motion and Structure Estimation , 1993, IEEE Trans. Pattern Anal. Mach. Intell..

[12]  Stefan Treue,et al.  Human perception of structure from motion , 1991, Vision Research.

[13]  J. Marsden Lectures on Mechanics , 1992 .

[14]  S. Ullman The Interpretation of Visual Motion , 1979 .

[15]  M. Spivak A comprehensive introduction to differential geometry , 1979 .

[16]  Andrew Blake,et al.  Grasping visual symmetry , 1993, 1993 (4th) International Conference on Computer Vision.

[17]  K. Mardia,et al.  Shape distributions for landmark data , 1989, Advances in Applied Probability.

[18]  Rama Chellappa,et al.  Estimating the Kinematics and Structure of a Rigid Object from a Sequence of Monocular Images , 1991, IEEE Trans. Pattern Anal. Mach. Intell..

[19]  P. Perona,et al.  Structure From Visual Motion as a Nonlinear Observation Problem , 1995 .

[20]  Narendra Ahuja,et al.  Motion and Structure From Two Perspective Views: Algorithms, Error Analysis, and Error Estimation , 1989, IEEE Trans. Pattern Anal. Mach. Intell..

[21]  Raymond E. Suorsa,et al.  Vision-based obstacle detection for rotorcraft flight , 1992, J. Field Robotics.

[22]  H. C. Longuet-Higgins,et al.  A computer algorithm for reconstructing a scene from two projections , 1981, Nature.

[23]  Donald B. Gennery,et al.  Tracking Known Three-Dimensional Objects , 1982, AAAI.

[24]  W. Dayawansa,et al.  A necessary and sufficient condition for the perspective observability problem , 1995 .

[25]  Hermann von Helmholtz,et al.  Treatise on Physiological Optics , 1962 .

[26]  D. Kendall SHAPE MANIFOLDS, PROCRUSTEAN METRICS, AND COMPLEX PROJECTIVE SPACES , 1984 .

[27]  Thomas S. Huang,et al.  Theory of Reconstruction from Image Motion , 1992 .

[28]  R. E. Kalman,et al.  A New Approach to Linear Filtering and Prediction Problems , 2002 .

[29]  O. Faugeras,et al.  Motion from point matches: Multiplicity of solutions , 1989, [1989] Proceedings. Workshop on Visual Motion.

[30]  A. Isidori Nonlinear Control Systems , 1985 .

[31]  Joachim Heel Dynamic motion vision , 1990, Robotics Auton. Syst..

[32]  Alain Bensoussan,et al.  Non linear filtering theory , 1990 .

[33]  Thomas Kailath,et al.  Linear Systems , 1980 .

[34]  H. C. Longuet-Higgins,et al.  Mental Processes: Studies in Cognitive Science , 1987 .

[35]  John Oliensis,et al.  Recursive Multi-Frame Structure from Motion Incorporating Motion Error , 1992 .

[36]  N. H. McClamroch,et al.  Autonomous spacecraft docking using a computer vision system , 1992, [1992] Proceedings of the 31st IEEE Conference on Decision and Control.

[37]  Olivier D. Faugeras,et al.  On the geometry and algebra of the point and line correspondences between N images , 1995, Proceedings of IEEE International Conference on Computer Vision.

[38]  J. Gibson,et al.  Motion parallax as a determinant of perceived depth. , 1959, Journal of experimental psychology.

[39]  Tsutomu Kimoto,et al.  Manipulator control with image-based visual servo , 1991, Proceedings. 1991 IEEE International Conference on Robotics and Automation.

[40]  P. Perona,et al.  Motion estimation via dynamic vision , 1994, Proceedings of 1994 33rd IEEE Conference on Decision and Control.

[41]  O. Faugeras Three-dimensional computer vision: a geometric viewpoint , 1993 .

[42]  Richard M. Murray,et al.  A Mathematical Introduction to Robotic Manipulation , 1994 .

[43]  Takeo Kanade,et al.  An Iterative Image Registration Technique with an Application to Stereo Vision , 1981, IJCAI.

[44]  Rama Chellappa,et al.  Estimation of Object Motion Parameters from Noisy Images , 1986, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[45]  A. Jazwinski Stochastic Processes and Filtering Theory , 1970 .

[46]  Ernst D. Dickmanns Historical development of use of dynamical models for the representation of knowledge about real world processes in machine vision , 1994, Signal Process..