The Sensitivity of Motion and Structure Computations

We address the problem of interpreting image velocity fields generated by a moving monocular observer viewing a stationary environment under perspective projection to obtain 3-D information about the relative motion of the observer (egomotion) and the relative depth of environmental surface points (environmental layout). The algorithm presented in this paper involves computing motion and structure from a spatio-temporal distribution of image velocities that are hypothesized to belong to the same 3-D planar surface. However, the main result of this paper is not just another motion and structure algorithm that exhibits some novel features but rather an extensive error analysis of the algorithm's preformance for various types of noise in the image velocities. Waxman and Ullman [83] have devised an algorithm for computing motion and structure using image velocity and its 1st and 2nd order spatial derivatives at one image point. We generalize this result to include derivative information in time as well. Further, we show the approximate equivalence of reconstruction algorithms that use orily image velocities and those that use one image velocity and its 1st and/or 2nd spatio-temporal derivatives at one image point. The main question addressed in this paper is: "How accurate do the input image velocities have to be?" or equivalently, "How accurate does the input image velocity and its 1st and 2nd order derivatives have to be?". The answer to this question involves worst case error analysis. We end the paper by drawing some conclusions about the feasibility of motion and structure calculations in general.

[1]  Allen M. Waxman,et al.  Contour Evolution, Neighborhood Deformation, and Global Image Flow: Planar Surfaces in Motion , 1985 .

[2]  Daryl T. Lawton,et al.  Processing translational motion sequences , 1983, Comput. Vis. Graph. Image Process..

[3]  J. Hay,et al.  Optical motions and space perception: an extension of Gibson's analysis. , 1966, Psychological review.

[4]  K. Prazdny,et al.  Motion and Structure from Optical Flow , 1979, IJCAI.

[5]  Hans-Hellmut Nagel,et al.  Volumetric model and 3D trajectory of a moving car derived from monocular TV frame sequences of a street scene , 1981, Comput. Graph. Image Process..

[6]  A. Waxman,et al.  On the Uniqueness of Image Flow Solutions for Planar Surfaces in Motion , 1985 .

[7]  R. Woodham,et al.  Determining the movement of objects from a sequence of images , 1980, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[8]  William B. Thompson,et al.  Disparity Analysis of Images , 1980, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[9]  John K. Tsotsos,et al.  Determinations of egomotion and environmental layout from noisy time varying image velocity information in monocular image sequences , 1988 .

[10]  John Aloimonos,et al.  Perception of Rigid Motion from Spatio-Temporal Derivatives of Optical Flow , 1985 .

[11]  Jake K. Aggarwal,et al.  Visually Interpreting the Motion of Objects in Space , 1981, Computer.

[12]  Thomas S. Huang,et al.  Solving three-dimensional small-rotation motion equations: Uniqueness, algorithms, and numerical results , 1983, Computer Vision Graphics and Image Processing.

[13]  Berthold K. P. Horn,et al.  Determining Optical Flow , 1981, Other Conferences.

[14]  Thomas S. Huang,et al.  Some Experiments on Estimating the 3-D Motion Parameters of a Rigid Body from Two Consecutive Image Frames , 1984, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[15]  John K. Tsotsos,et al.  Determination of Egomotion and Environmental Layout from Noisy Time-Varying Image Velocity in Binocular Image Sequences , 1987, IJCAI.

[16]  Thomas S. Huang,et al.  Uniqueness and Estimation of Three-Dimensional Motion Parameters of Rigid Objects with Curved Surfaces , 1984, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[17]  T. D. Williams,et al.  Depth from camera motion in a real world scene , 1980, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[18]  David W. Murray,et al.  3D Solutions to the Aperture Problem , 1984, ECAI.

[19]  S. Ullman,et al.  The interpretation of visual motion , 1977 .

[20]  Yiannis Aloimonos,et al.  Determining the 3-D motion of a rigid surface patch without correspondence, under perspective projection: I. planar surfaces. II. curved surfaces , 1986, AAAI 1986.

[21]  Kenichi Kanatani Structure from Motion Without Correspondence: General Principle , 1985, IJCAI.

[22]  H. C. Longuet-Higgins,et al.  A computer algorithm for reconstructing a scene from two projections , 1981, Nature.