Monocular and Stereo Methods for AAM Learning from Video

The active appearance model (AAM) is a powerful method for modeling deformable visual objects. One of the major drawbacks of the AAM is that it requires a training set of pseudo-dense correspondences over the whole database. In this work, we investigate the utility of stereo constraints for automatic model building from video. First, we propose a new method for automatic correspondence finding in monocular images which is based on an adaptive template tracking paradigm. We then extend this method to take the scene geometry into account, proposing three approaches, each accounting for the availability of the fundamental matrix and calibration parameters or the lack thereof. The performance of the monocular method was first evaluated on a pre-annotated database of a talking face. We then compared the monocular method against its three stereo extensions using a stereo database.

[1]  H. C. Longuet-Higgins,et al.  A computer algorithm for reconstructing a scene from two projections , 1981, Nature.

[2]  Michael J. Black,et al.  The robust estimation of multiple motions: Affine and piecewise smooth flow fields , 1993 .

[3]  Michael Isard,et al.  Learning to track curves in motion , 1994, Proceedings of 1994 33rd IEEE Conference on Decision and Control.

[4]  Christopher J. Taylor,et al.  A Method of Non-Rigid Correspondence for AutomaticLandmark Identification , 1996, BMVC.

[5]  Harpreet S. Sawhney,et al.  Compact Representations of Videos Through Dominant and Multiple Motion Estimation , 1996, IEEE Trans. Pattern Anal. Mach. Intell..

[6]  Anil K. Jain,et al.  Object Tracking Using Deformable Templates , 1998, ICCV.

[7]  Timothy F. Cootes,et al.  Interpreting face images using active appearance models , 1998, Proceedings Third IEEE International Conference on Automatic Face and Gesture Recognition.

[8]  P. E. Frandsen,et al.  Unconstrained optimization , 1999 .

[9]  S. Rougeaux,et al.  Stereo 3d Lip Tracking , 2000 .

[10]  Alexander Zelinsky,et al.  Stereo 3-D Lip Tracking , 2000 .

[11]  Bernhard P. Wrobel,et al.  Multiple View Geometry in Computer Vision , 2001 .

[12]  Timothy F. Cootes,et al.  Automatically building appearance models from image sequences using salient features , 2002, Image Vis. Comput..

[13]  Robert T. Schultz,et al.  A unified non-rigid feature registration method for brain mapping , 2003, Medical Image Anal..

[14]  Mikkel B. Stegmann,et al.  Fast Registration of Cardiac Perfusion MRI , 2003 .

[15]  Tony Jebara,et al.  Images as bags of pixels , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[16]  Timothy F. Cootes,et al.  Groupwise Diffeomorphic Non-rigid Registration for Automatic Model Building , 2004, ECCV.

[17]  Simon Baker,et al.  Lucas-Kanade 20 Years On: A Unifying Framework , 2004, International Journal of Computer Vision.

[18]  Lars Kai Hansen,et al.  Mapping from Speech to Images Using Continuous State Space Models , 2004, MLMI.

[19]  Thomas Brox,et al.  High Accuracy Optical Flow Estimation Based on a Theory for Warping , 2004, ECCV.

[20]  Yongmin Li,et al.  On incremental and robust subspace learning , 2004, Pattern Recognit..

[21]  Roland Göcke,et al.  The audio-video australian English speech data corpus AVOZES , 2012, INTERSPEECH.

[22]  Jing Xiao,et al.  Fitting a Single Active Appearance Model Simultaneously to Multiple Images , 2004, BMVC.

[23]  Takahiro Ishikawa,et al.  The template update problem , 2004, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[24]  Jeff G. Schneider,et al.  Automatic construction of active appearance models as an image coding problem , 2004, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[25]  Avinash C. Kak,et al.  Accurate 3D Tracking of Rigid Objects with Occlusion Using Active Appearance Models , 2005, 2005 Seventh IEEE Workshops on Applications of Computer Vision (WACV/MOTION'05) - Volume 1.

[26]  David G. Wilson,et al.  2. Constrained Optimization , 2005 .

[27]  Timothy F. Cootes,et al.  Groupwise Construction of Appearance Models using Piece-wise Affine Deformations , 2005, BMVC.