Experiments on estimating egomotion and structure parameters using long monocular image sequences

This paper presents a simple but robust model based approach to estimating the kinematics of a moving camera and the structure of the objects in a stationary environment using long, noisy, monocular image sequences. Both batch and recursive algorithms are presented and the problem due to occlusion is addressed. The approach is based on representing the constant translational velocity and constant angular velocity of the camera motion using nine rectilinear motion parameters, which are 3-D vectors of the position of the rotation center, linear and angular velocities. The structure parameters are 3-D coordinates of the salient feature points in the inertial coordinate system. Due to redundancies in parameterization, the total number of independent parameters to be estimated is 3M+7, whereM is the number of feature points. The image plane coordinates of these feature points in each frame are first detected and matched over the frames. These noisy image coordinates serve as the input to our algorithms. Due to the nonlinear nature of perspective projection, a nonlinear least squares method is formulated for the batch algorithm, and a conjugate gradient method is then applied to find the solution. A recursive method using an Iterated Extended Kalman Filter (IEKF) for incremental estimation of motion and structure is also presented. Since the plant model is simple in our formulation, closed form solutions for the state and covariance transition equations are easily derived. Experimental results for simulated imagery as well as several real image sequences are included.

[1]  Rama Chellappa,et al.  3-D Motion Estimation Using a Sequence of Noisy Stereo Images: Models, Estimation, and Uniqueness Results , 1990, IEEE Trans. Pattern Anal. Mach. Intell..

[2]  H. Ando Dynamic reconstruction of 3D structure and 3D motion , 1991, Proceedings of the IEEE Workshop on Visual Motion.

[3]  Volker Graefe,et al.  Applications of dynamic monocular machine vision , 1988, Machine Vision and Applications.

[4]  Rama Chellappa,et al.  Automatic feature point extraction and tracking in image sequences for arbitrary camera motion , 1995, International Journal of Computer Vision.

[5]  Steven D. Blostein,et al.  MOTION DETECTION AND ESTIMATION FROM STEREO IMAGE SEQUENCES: SOME PRELIMINARY EXPERIMENTAL RESULTS. , 1986 .

[6]  Qi Tian,et al.  Algorithms for subpixel registration , 1986 .

[7]  Thomas S. Huang,et al.  Estimating three-dimensional motion parameters of a rigid planar patch, II: Singular value decomposition , 1982 .

[8]  Ramesh C. Jain,et al.  Structure from motion-a critical analysis of methods , 1991, IEEE Trans. Syst. Man Cybern..

[9]  Rama Chellappa,et al.  Estimating the Kinematics and Structure of a Rigid Object from a Sequence of Monocular Images , 1991, IEEE Trans. Pattern Anal. Mach. Intell..

[10]  Donald B. Gennery,et al.  Visual tracking of known three-dimensional objects , 1992, International Journal of Computer Vision.

[11]  Thomas S. Huang,et al.  Uniqueness and Estimation of Three-Dimensional Motion Parameters of Rigid Objects with Curved Surfaces , 1984, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[12]  Patrick Bouthemy,et al.  Estimation of time-to-collision maps from first order motion models and normal flows , 1992, [1992] Proceedings. 11th IAPR International Conference on Pattern Recognition.

[13]  Larry H. Matthies,et al.  Error modeling in stereo navigation , 1986, IEEE J. Robotics Autom..

[14]  Rama Chellappa,et al.  Performance bounds for estimating three-dimensional motion parameters from a sequence of noisy images , 1989 .

[15]  Juyang Weng,et al.  Motion and structure from long stereo image sequences , 1991, Proceedings of the IEEE Workshop on Visual Motion.

[16]  Volker Graefe,et al.  Dynamic monocular machine vision , 1988, Machine Vision and Applications.

[17]  Olivier D. Faugeras,et al.  Three-dimensional motion computation and object segmentation in a long sequence of stereo frames , 1992, International Journal of Computer Vision.

[18]  Ishwar K. Sethi,et al.  Finding Trajectories of Feature Points in a Monocular Image Sequence , 1987, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[19]  Hormoz Shariat,et al.  Motion Estimation with More than Two Frames , 1990, IEEE Trans. Pattern Anal. Mach. Intell..

[20]  Rama Chellappa,et al.  Estimation of Object Motion Parameters from Noisy Images , 1986, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[21]  Narendra Ahuja,et al.  3-D Motion Estimation, Understanding, and Prediction from Noisy Image Sequences , 1987, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[22]  Shahriar Negahdaripour,et al.  IEEE Transactions on Pattern Analysis and Machine Intelligence , 2004 .

[23]  Donald B. Gennery,et al.  Tracking Known Three-Dimensional Objects , 1982, AAAI.

[24]  Berthold K. P. Horn,et al.  Direct methods for recovering motion , 1988, International Journal of Computer Vision.

[25]  A. Jazwinski Stochastic Processes and Filtering Theory , 1970 .

[26]  Allen R. Hanson,et al.  Identification and 3D description of 'shallow' environmental structure in a sequence of images , 1991, Proceedings. 1991 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[27]  R. Woodham,et al.  Determining the movement of objects from a sequence of images , 1980, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[28]  C. J. Taylor,et al.  Structure and motion in two dimensions from multiple images: a least squares approach , 1991, Proceedings of the IEEE Workshop on Visual Motion.

[29]  Yehoshua Y. Zeevi,et al.  The Generalized Gabor Scheme of Image Representation in Biological and Machine Vision , 1988, IEEE Trans. Pattern Anal. Mach. Intell..

[30]  J. Oliensis,et al.  Incorporating motion error in multi-frame structure from motion , 1991, Proceedings of the IEEE Workshop on Visual Motion.

[31]  Rama Chellappa,et al.  Estimation of Illuminant Direction, Albedo, and Shape from Shading , 1991, IEEE Trans. Pattern Anal. Mach. Intell..

[32]  Narendra Ahuja,et al.  Motion and Structure from Line Correspondences; Closed-Form Solution, Uniqueness, and Optimization , 1992, IEEE Trans. Pattern Anal. Mach. Intell..

[33]  John Oliensis,et al.  Recursive Multi-Frame Structure from Motion Incorporating Motion Error , 1992 .

[34]  Edward M. Riseman,et al.  A data set for quantitative motion analysis , 1989, Proceedings CVPR '89: IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[35]  Rama Chellappa,et al.  A computational vision approach to image registration , 1993, IEEE Trans. Image Process..

[36]  Rama Chellappa,et al.  A feature based approach to face recognition , 1992, Proceedings 1992 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[37]  J. G. Daugman Relaxation neural network for nonorthogonal image transforms , 1988, IEEE 1988 International Conference on Neural Networks.

[38]  Rama Chellappa,et al.  Automatic feature point extraction and tracking in image sequences for unknown camera motion , 1993, 1993 (4th) International Conference on Computer Vision.

[39]  Thomas S. Huang,et al.  Estimating three-dimensional motion parameters of a rigid planar patch , 1981 .

[40]  R. Chellappa,et al.  Recursive 3-D motion estimation from a monocular image sequence , 1990 .

[41]  J. Heel Direct Estimation of Structure and Motion from Multiple Frames , 1990 .

[42]  Thomas S. Huang,et al.  Some Experiments on Estimating the 3-D Motion Parameters of a Rigid Body from Two Consecutive Image Frames , 1984, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[43]  Azriel Rosenfeld,et al.  Point pattern matching by relaxation , 1980, Pattern Recognit..

[44]  Thomas S. Huang,et al.  Estimating three-dimensional motion parameters of a rigid planar patch, III: Finite point correspondences and the three-view problem , 1984 .

[45]  Rama Chellappa,et al.  Passive ranging using a moving camera , 1992, J. Field Robotics.

[46]  C. Tomasi,et al.  Factoring image sequences into shape and motion , 1991, Proceedings of the IEEE Workshop on Visual Motion.