Three-dimensional modeling from two-dimensional video

This paper presents the surface-based factorization method to recover three-dimensional (3-D) structure, i.e., the 3-D shape and 3-D motion, of a rigid object from a two-dimensional (2-D) video sequence. The main ingredients of our approach are as follows: 1) we describe the unknown shape of the 3-D rigid object by polynomial patches; 2) projections of these patches in the image plane move according to parametric 2-D motion models; 3) we recover the parameters describing the 3-D shape and 3-D motion from the 2-D motion parameters by factorizing a matrix that is rank 1 in a noiseless situation. Our method is simultaneously an extension and a simplification of the original factorization method of Tomasi and Kanade (1992). We track regions where the 2-D motion in the image plane is described by a single set of parameters, avoiding the need to track a large number of pointwise features, in general, a difficult task. Then our method estimates the parameters describing the 3-D structure by factoring a rank 1 matrix, not rank 3 as in Tomasi and Kanade. This allows the use of fast iterative algorithms to compute the 3-D structure that best fits the data. Experimental results with real-life video sequences illustrate the good performance of our approach.

[1]  J. Heel Direct Estimation of Structure and Motion from Multiple Frames , 1990 .

[2]  R. Chellappa,et al.  Recursive 3-D motion estimation from a monocular image sequence , 1990 .

[3]  Rama Chellappa,et al.  Structure from motion: sparse versus dense correspondence methods , 1999, Proceedings 1999 International Conference on Image Processing (Cat. 99CH36348).

[4]  Yiannis Aloimonos,et al.  Shape from video , 1999, Proceedings. 1999 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No PR00149).

[5]  Pietro Perona,et al.  Recursive motion and structure estimation with complete error characterization , 1993, Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.

[6]  A. Murat Tekalp,et al.  Simultaneous motion estimation and segmentation , 1997, IEEE Trans. Image Process..

[7]  Serge Ayer,et al.  Sequential and competitive methods for estimation of multiple motions , 1995 .

[8]  Alex Pentland,et al.  Recursive Estimation of Motion, Structure, and Focal Length , 1995, IEEE Trans. Pattern Anal. Mach. Intell..

[9]  Rama Chellappa,et al.  Estimating the Kinematics and Structure of a Rigid Object from a Sequence of Monocular Images , 1991, IEEE Trans. Pattern Anal. Mach. Intell..

[10]  Thomas S. Huang,et al.  Uniqueness and Estimation of Three-Dimensional Motion Parameters of Rigid Objects with Curved Surfaces , 1984, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[11]  S. B. Kang,et al.  Recovering 3 D Shape and Motion from Image Streams using Non-Linear Least Squares , 1993 .

[12]  José M. F. Moura,et al.  Video representation via 3D shaped mosaics , 1998, Proceedings 1998 International Conference on Image Processing. ICIP98 (Cat. No.98CB36269).

[13]  P. Anandan,et al.  Hierarchical Model-Based Motion Estimation , 1992, ECCV.

[14]  David J. Michael Exploiting continuity-in-time in motion vision , 1992 .

[15]  Richard Szeliski,et al.  Recovering 3D Shape and Motion from Image Streams Using Nonlinear Least Squares , 1994, J. Vis. Commun. Image Represent..

[16]  Steven D. Blostein,et al.  Motion-based object segmentation and estimation using the MDL principle , 1995, IEEE Trans. Image Process..

[17]  Allen R. Hanson,et al.  Understanding noise: The critical role of motion error in scene reconstruction , 1993, 1993 (4th) International Conference on Computer Vision.

[18]  José M. F. Moura,et al.  A fast algorithm for rigid structure from image sequences , 1999, Proceedings 1999 International Conference on Image Processing (Cat. 99CH36348).

[19]  Takeo Kanade,et al.  A Paraperspective Factorization Method for Shape and Motion Recovery , 1994, IEEE Trans. Pattern Anal. Mach. Intell..

[20]  Narendra Ahuja,et al.  Optimal Motion and Structure Estimation , 1993, IEEE Trans. Pattern Anal. Mach. Intell..

[21]  José M. F. Moura,et al.  Maximum Likelihood Inference of 3D Structure from Image Sequences , 1999, EMMCVPR.

[22]  Amnon Shashua,et al.  Model-based brightness constraints: on direct estimation of structure and motion , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[23]  Steve Mann,et al.  Video orbits of the projective group a simple approach to featureless estimation of parameters , 1997, IEEE Trans. Image Process..