Incremental model-based estimation using geometric constraints

We present a model-based framework for incremental, adaptive object shape estimation and tracking in monocular image sequences. Parametric structure and motion estimation methods usually assume a fixed class of shape representation (splines, deformable superquadrics, etc.) that is initialized prior to tracking. Since the model shape coverage is fixed a priori, the incremental recovery of structure is decoupled from tracking, thereby limiting both processes in their scope and robustness. In this work, we describe a model-based framework that supports the automatic detection and integration of low-level geometric primitives (lines) incrementally. Such primitives are not explicitly captured in the initial model, but are moving consistently with its image motion. The consistency tests used to identify new structure are based on trinocular constraints between geometric primitives. The method allows not only an increase in the model scope, but also improves tracking accuracy by including the newly recovered features in its state estimation. The formulation is a step toward automatic model building, since it allows both weaker assumptions on the availability of a prior shape representation and on the number of features that would otherwise be necessary for entirely bottom-up reconstruction. We demonstrate the proposed approach on two separate image-based tracking domains, each involving complex 3D object structure and motion.

[1]  Henning Biermann,et al.  Recovering non-rigid 3D shape from image streams , 2000, Proceedings IEEE Conference on Computer Vision and Pattern Recognition. CVPR 2000 (Cat. No.PR00662).

[2]  Steven M. Seitz Implicit Scene Reconstruction from Probability Density Functions , 1998 .

[3]  Richard Szeliski,et al.  Recovering 3D Shape and Motion from Image Streams Using Nonlinear Least Squares , 1994, J. Vis. Commun. Image Represent..

[4]  Dimitris N. Metaxas,et al.  Blended deformable models , 1994, 1994 Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.

[5]  Minas E. Spetsakis,et al.  A multi-frame approach to visual motion perception , 1991, International Journal of Computer Vision.

[6]  David J. Kriegman,et al.  Structure and Motion from Line Segments in Multiple Images , 1995, IEEE Trans. Pattern Anal. Mach. Intell..

[7]  Sven J. Dickinson,et al.  Integrating qualitative and quantitative shape recovery , 1994, International Journal of Computer Vision.

[8]  Alex Pentland,et al.  Closed-form solutions for physically-based shape modeling and recognition , 1991, Proceedings. 1991 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[9]  Sven J. Dickinson,et al.  Using Aspect Graphs to Control the Recovery and Tracking of Deformable Models , 1997 .

[10]  Dimitris N. Metaxas,et al.  Dynamic 3D models with local and global deformations: deformable superquadrics , 1990, [1990] Proceedings Third International Conference on Computer Vision.

[11]  Bernhard P. Wrobel,et al.  Multiple View Geometry in Computer Vision , 2001 .

[12]  Dimitris N. Metaxas,et al.  Incorporating illumination constraints in deformable models , 1998, Proceedings. 1998 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No.98CB36231).

[13]  Pascal Fua,et al.  Taking Advantage of Image-Based and Geometry-Based Constraints to Recover 3-D Surfaces , 1996, Comput. Vis. Image Underst..

[14]  Demetri Terzopoulos,et al.  Constraints on Deformable Models: Recovering 3D Shape and Nonrigid Motion , 1988, Artif. Intell..

[15]  Sven J. Dickinson,et al.  Generic model abstraction from examples , 2000, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[16]  R. Fletcher Practical Methods of Optimization , 1988 .

[17]  Sven J. Dickinson,et al.  Physics-based tracking of 3D objects in 2D image sequences , 1994, Proceedings of 12th International Conference on Pattern Recognition.

[18]  S Ullman,et al.  Maximizing Rigidity: The Incremental Recovery of 3-D Structure from Rigid and Nonrigid Motion , 1984, Perception.

[19]  David G. Lowe,et al.  Three-Dimensional Object Recognition from Single Two-Dimensional Images , 1987, Artif. Intell..

[20]  David G. Lowe,et al.  Fitting Parameterized Three-Dimensional Models to Images , 1991, IEEE Trans. Pattern Anal. Mach. Intell..

[21]  Michael Isard,et al.  CONDENSATION—Conditional Density Propagation for Visual Tracking , 1998, International Journal of Computer Vision.

[22]  Andrew W. Fitzgibbon,et al.  Bundle Adjustment - A Modern Synthesis , 1999, Workshop on Vision Algorithms.

[23]  Amar Mitiche,et al.  The Incremental Rigidity Scheme for Structure from Motion: The Line-Based Formulation , 1990, ECCV.

[24]  Richard I. Hartley,et al.  Lines and Points in Three Views and the Trifocal Tensor , 1997, International Journal of Computer Vision.

[25]  Andrew Zisserman,et al.  Detecting and Tracking Linear Features Efficiently , 1996, BMVC.

[26]  S. B. Kang,et al.  Recovering 3 D Shape and Motion from Image Streams using Non-Linear Least Squares , 1993 .

[27]  Dimitris N. Metaxas,et al.  Physics-based object pose and shape estimation from multiple views , 1994, Proceedings of 12th International Conference on Pattern Recognition.

[28]  Cristian Sminchisescu,et al.  Estimating Articulated Human Motion with Covariance Scaled Sampling , 2003, Int. J. Robotics Res..

[29]  Azriel Rosenfeld,et al.  3-D Shape Recovery Using Distributed Aspect Matching , 1992, IEEE Trans. Pattern Anal. Mach. Intell..

[30]  Berthold K. P. Horn Relative orientation , 1987, International Journal of Computer Vision.

[31]  David G. Lowe,et al.  Rigidity Checking of 3D Point Correspondences Under Perspective Projection , 1996, IEEE Trans. Pattern Anal. Mach. Intell..

[32]  Roberto Cipolla,et al.  Application of Lie Algebras to Visual Servoing , 2000, International Journal of Computer Vision.

[33]  David C. Hogg,et al.  Wormholes in shape space: tracking through discontinuous changes in shape , 1998, Sixth International Conference on Computer Vision (IEEE Cat. No.98CH36271).

[34]  P. Anandan,et al.  Implicit representation and scene reconstruction from probability density functions , 1999, Proceedings. 1999 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No PR00149).

[35]  J. Aggarwal,et al.  LINE-BASED COMPUTATION OF STRUCTURE AND MOTION USING ANGULAR INVARIANCE. , 1986 .

[36]  Olivier D. Faugeras,et al.  Feed-forward recovery of motion and structure from a sequence of 2D-lines matches , 1990, [1990] Proceedings Third International Conference on Computer Vision.

[37]  James L. Crowley,et al.  Measurement and integration of 3-D structures by tracking edge lines , 1990, International Journal of Computer Vision.

[38]  Andrew Zisserman,et al.  Robust Object Tracking , 2001 .

[39]  金谷 健一 Statistical optimization for geometric computation : theory and practice , 2005 .

[40]  Cristian Sminchisescu,et al.  Building Roadmaps of Minima and Transitions in Visual Models , 2004, International Journal of Computer Vision.

[41]  Andrew Zisserman,et al.  Multiple view geometry in computer visiond , 2001 .

[42]  Ruzena Bajcsy,et al.  Recovery of Parametric Models from Range Images: The Case for Superquadrics with Global Deformations , 1990, IEEE Trans. Pattern Anal. Mach. Intell..

[43]  Alex Pentland,et al.  Recursive Estimation of Motion, Structure, and Focal Length , 1995, IEEE Trans. Pattern Anal. Mach. Intell..

[44]  Cristian Sminchisescu,et al.  Estimation algorithms for ambiguous visual models : Three Dimensional Human Modeling and Motion Reconstruction in Monocular Video Sequences. (Algorithmes d'estimation pour des modèles visuels ambigus : Modélisation Humaine Tridimensionnelle et Reconstruction du Mouvement dans des Séquences Vidéo Mon , 2002 .

[45]  Cristian Sminchisescu,et al.  Building Roadmaps of Local Minima of Visual Models , 2002, ECCV.

[46]  Alex Pentland,et al.  Recovery of non-rigid motion and structure , 1991, Proceedings. 1991 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[47]  Kenichi Kanatani,et al.  Statistical optimization for geometric computation - theory and practice , 1996, Machine intelligence and pattern recognition.

[48]  Takeo Kanade,et al.  Shape and motion from image streams under orthography: a factorization method , 1992, International Journal of Computer Vision.

[49]  Amnon Shashua,et al.  Algebraic Functions For Recognition , 1995, IEEE Trans. Pattern Anal. Mach. Intell..

[50]  Roberto Cipolla,et al.  Real-Time Tracking of Multiple Articulated Structures in Multiple Views , 2000, ECCV.

[51]  Thomas S. Huang,et al.  Motion and structure from feature correspondences: a review , 1994, Proc. IEEE.

[52]  James M. Rehg,et al.  A multiple hypothesis approach to figure tracking , 1999, Proceedings. 1999 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No PR00149).

[53]  Dimitris N. Metaxas,et al.  Combining information using hard constraints , 1999, Proceedings. 1999 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No PR00149).

[54]  Alex Pentland,et al.  Recovery of Nonrigid Motion and Structure , 1991, IEEE Trans. Pattern Anal. Mach. Intell..

[55]  Thomas S. Huang,et al.  Determining 3-D motion and structure of a rigid body using straight line correspondences , 1983, ICASSP.