The Feasibility Of Motion And Structure Computations

We address the problem of interpreting image velocity fields generated by a moving monocular observer viewing a stationary environment under perspective projection to obtain 3-D information about the relative motion of the observer (egomotion) and the relative depth of environmental surface points (environmental layout). The algorithm presented in this paper involves computing motion and structure from a spatio-temporal distribution of image velocities that are hypothesized to belong to the same 3-D planar surface. However, the main result of this paper is not just another motion and structure algorithm that exhibits some novel features but rather an extensive error analysis of the algorithm’s preformance for various types of noise in the image velocities. Waxman and Ullman [83] have devised an algorithm for computing motion and structure using image velocity and its 1st and 2d order spatial derivatives at one image point. We generalize this result to include derivative information in time as well. Further, we show the approximate equivalence of reconstruction algorithms that use only image velocities and those that use one image velocity and its 1st and/or 2”d spatio-temporal derivatives at one image point. The main question addressed in this paper is: “How accurate do the input image velocities have to be?’ or equivalently, “How accurate does the input image velocity and its I~ and 2& order derivatives have to be?“. The answer to this question involves worst case error analysis. We end the paper by drawing some conclusions about the feasibility of motion and structure calculations in general. I.1 Introduction In this paper, we present a algorithm for computing the motion and strncture parameters that describe egomotion and environmental layout from image velocity fields generated by a moving monocular observer viewing a stationary environment. Egomotion is defined as the motion of the observer relative to his environment and can be described by 6 parameters; 3 dvth-scaled translational parameters, Z and 3 rotation parameters, o. Environmental layout refers to the 3-D shape and location of objects in the environment. For monocular image sequences, en$ronmental layout is described by the normalized surface gradient, a, at each image point. To determine these motion and structure parameters we derive nonlinear equations relating image velocity at some image int ?(?*,t ‘) to the underlying motion and structure parameters at (P,c). The computaP tion of egomotion and environmental layout from image velocity is sometimes called the reconstruction problem; we reconstruct the observer’s motion, and the layout of his environment, from (timevarying) image velocity. A lot of research has been devoted to

[1]  John K. Tsotsos,et al.  The Sensitivity of Motion and Structure Computations , 1987, AAAI.

[2]  David J. Heeger,et al.  Optical flow from spatialtemporal filters , 1987 .

[3]  Daryl T. Lawton,et al.  Processing translational motion sequences , 1983, Comput. Vis. Graph. Image Process..

[4]  J. Gibson Optical motions and transformations as stimuli for visual perception. , 1957, Psychological review.

[5]  D Marr,et al.  Directional selectivity and its use in early visual processing , 1981, Proceedings of the Royal Society of London. Series B. Biological Sciences.

[6]  R. Woodham,et al.  Determining the movement of objects from a sequence of images , 1980, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[7]  H. C. Longuet-Higgins,et al.  A computer algorithm for reconstructing a scene from two projections , 1981, Nature.

[8]  Jake K. Aggarwal,et al.  Visually Interpreting the Motion of Objects in Space , 1981, Computer.

[9]  A. Bandopadhay A Computational Study of Rigid Motion Perception. , 1986 .

[10]  Allen M. Waxman,et al.  Contour Evolution, Neighborhood Deformation, and Global Image Flow: Planar Surfaces in Motion , 1985 .

[11]  Alessandro Verri,et al.  Against Quantitative Optical Flow , 1987 .

[12]  William B. Thompson,et al.  Disparity Analysis of Images , 1980, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[13]  Yiannis Aloimonos,et al.  Determining the 3-D motion of a rigid surface patch without correspondence, under perspective projection: I. planar surfaces. II. curved surfaces , 1986, AAAI 1986.

[14]  Thomas S. Huang,et al.  Estimating three-dimensional motion parameters of a rigid planar patch, II: Singular value decomposition , 1982 .

[15]  S. Ullman,et al.  The interpretation of visual motion , 1977 .

[16]  Thomas S. Huang,et al.  Solving three-dimensional small-rotation motion equations: Uniqueness, algorithms, and numerical results , 1983, Computer Vision Graphics and Image Processing.

[17]  Hans-Hellmut Nagel,et al.  Volumetric model and 3D trajectory of a moving car derived from monocular TV frame sequences of a street scene , 1981, Comput. Graph. Image Process..

[18]  A. Waxman,et al.  On the Uniqueness of Image Flow Solutions for Planar Surfaces in Motion , 1985 .

[19]  Berthold K. P. Horn,et al.  Determining Optical Flow , 1981, Other Conferences.

[20]  Thomas S. Huang,et al.  Uniqueness and Estimation of Three-Dimensional Motion Parameters of Rigid Objects with Curved Surfaces , 1984, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[21]  K. Prazdny,et al.  Motion and Structure from Optical Flow , 1979, IJCAI.

[22]  Thomas S. Huang,et al.  Some Experiments on Estimating the 3-D Motion Parameters of a Rigid Body from Two Consecutive Image Frames , 1984, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[23]  John K. Tsotsos,et al.  Determination of Egomotion and Environmental Layout from Noisy Time-Varying Image Velocity in Binocular Image Sequences , 1987, IJCAI.

[24]  J. Hay,et al.  Optical motions and space perception: an extension of Gibson's analysis. , 1966, Psychological review.

[25]  H. C. Longuet-Higgins,et al.  The interpretation of a moving retinal image , 1980, Proceedings of the Royal Society of London. Series B. Biological Sciences.

[26]  T. D. Williams,et al.  Depth from camera motion in a real world scene , 1980, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[27]  Allen M. Waxman,et al.  Surface Structure and Three-Dimensional Motion from Image Flow Kinematics , 1985 .

[28]  Kenichi Kanatani Structure from Motion Without Correspondence: General Principle , 1985, IJCAI.