Robust parameterized component analysis: theory and applications to 2D facial appearance models

Principal component analysis (PCA) has been successfully applied to construct linear models of shape, graylevel, and motion in images. In particular, PCA has been widely used to model the variation in the appearance of people's faces. We extend previous work on facial modeling for tracking faces in video sequences as they undergo significant changes due to facial expressions. Here we consider person-specific facial appearance models (PSFAM), which use modular PCA to model complex intra-person appearance changes. Such models require aligned visual training data; in previous work, this has involved a time consuming and error-prone hand alignment and cropping process. Instead, the main contribution of this paper is to introduce parameterized component analysis to learn a subspace that is invariant to affine (or higher order) geometric transformations. The automatic learning of a PSFAM given a training image sequence is posed as a continuous optimization problem and is solved with a mixture of stochastic and deterministic techniques achieving sub-pixel accuracy. We illustrate the use of the 2D PSFAM model with preliminary experiments relevant to applications including video-conferencing and avatar animation.

[1]  John Law,et al.  Robust Statistics—The Approach Based on Influence Functions , 1986 .

[2]  Timothy F. Cootes,et al.  Statistical models of appearance for computer vision , 1999 .

[3]  Michael J. Black,et al.  Eigentracking: Robust matching and tracking of objects using view - based representation , 1998 .

[4]  Timothy F. Cootes,et al.  Locating facial features using genetic algorithms. , 1995 .

[5]  Dorothea Heiss-Czedik,et al.  An Introduction to Genetic Algorithms. , 1997, Artificial Life.

[6]  Nikolaus F. Troje,et al.  Separation of texture and shape in images of faces for image coding and synthesis , 1997 .

[7]  Michael J. Black,et al.  A Framework for Robust Subspace Learning , 2003, International Journal of Computer Vision.

[8]  Peter Eisert,et al.  Model-based estimation of facial expression parameters from image sequences , 1997, Proceedings of International Conference on Image Processing.

[9]  B. Frey,et al.  Transformation-Invariant Clustering Using the EM Algorithm , 2003, IEEE Trans. Pattern Anal. Mach. Intell..

[10]  Stuart Geman,et al.  Statistical methods for tomographic image reconstruction , 1987 .

[11]  Michael Isard,et al.  Active Contours , 2000, Springer London.

[12]  Shai Avidan,et al.  Support Vector Tracking , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[13]  P. Anandan,et al.  Hierarchical Model-Based Motion Estimation , 1992, ECCV.

[14]  Juha Karhunen,et al.  Principal component neural networks — Theory and applications , 1998, Pattern Analysis and Applications.

[15]  Timothy F. Cootes,et al.  Active Appearance Models , 2001, IEEE Trans. Pattern Anal. Mach. Intell..

[16]  F. de la Torre Automatic learning of appearance face models , 2001, Proceedings IEEE ICCV Workshop on Recognition, Analysis, and Tracking of Faces and Gestures in Real-Time Systems.

[17]  S. Nayar,et al.  Early Visual Learning , 1996 .

[18]  Michael J. Black,et al.  Robust Principal Component Analysis for Computer Vision , 2001, ICCV.

[19]  Alex Pentland,et al.  Mixtures of eigenfeatures for real-time structure from texture , 1998, Sixth International Conference on Computer Vision (IEEE Cat. No.98CH36271).

[20]  R. Cipolle,et al.  Computer Vision and Human-Computer Interaction , 1998 .

[21]  Norbert Krüger,et al.  Face Recognition by Elastic Bunch Graph Matching , 1997, IEEE Trans. Pattern Anal. Mach. Intell..

[22]  Baback Moghaddam,et al.  Principal manifolds and Bayesian subspaces for visual recognition , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[23]  Michael J. Black,et al.  Dynamic coupled component analysis , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[24]  Michael J. Black,et al.  Recognizing Facial Expressions in Image Sequences Using Local Parameterized Models of Image Motion , 1997, International Journal of Computer Vision.

[25]  C. Eckart,et al.  The approximation of one matrix by another of lower rank , 1936 .

[26]  Frederick Mosteller,et al.  Exploring Data Tables, Trends and Shapes. , 1986 .

[27]  Alex Pentland,et al.  Probabilistic Visual Learning for Object Representation , 1997, IEEE Trans. Pattern Anal. Mach. Intell..

[28]  Michael J. Black,et al.  EigenTracking: Robust Matching and Tracking of Articulated Objects Using a View-Based Representation , 1996, International Journal of Computer Vision.

[29]  L Sirovich,et al.  Low-dimensional procedure for the characterization of human faces. , 1987, Journal of the Optical Society of America. A, Optics and image science.

[30]  Michael J. Black,et al.  Robust Parameterized Component Analysis , 2002, ECCV.

[31]  H. Schewitzer Optimal eigenfeature selection by optimal image registration , 1999, CVPR 1999.

[32]  Anand Rangarajan,et al.  A new algorithm for non-rigid point matching , 2000, Proceedings IEEE Conference on Computer Vision and Pattern Recognition. CVPR 2000 (Cat. No.PR00662).

[33]  Christopher J. Taylor,et al.  A Framework for Automatic Landmark Identification Using a New Method of Nonrigid Correspondence , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[34]  Jordi Vitrià,et al.  Eigenfiltering for flexible eigentracking (EFE) , 2000, Proceedings 15th International Conference on Pattern Recognition. ICPR-2000.

[35]  Shaogang Gong,et al.  View alignment with dynamically updated affine tracking , 1998, Proceedings Third IEEE International Conference on Automatic Face and Gesture Recognition.

[36]  Timothy F. Cootes,et al.  Determining Correspondences for Statistical Models of Appearance , 2000, ECCV.

[37]  Mitchell Feigenbaum,et al.  Local feature analysis: a statistical theory for information representation and transmission , 1998 .

[38]  Shaogang Gong,et al.  A Multi-View Nonlinear Active Shape Model Using Kernel PCA , 1999, BMVC.

[39]  Penio S. Penev,et al.  Local feature analysis: A general statistical theory for object representation , 1996 .

[40]  Timothy F. Cootes,et al.  Automatic Interpretation and Coding of Face Images Using Flexible Models , 1997, IEEE Trans. Pattern Anal. Mach. Intell..

[41]  Penio S. Penev,et al.  Facial feature tracking and pose estimation in video sequences by factorial coding of the low-dimensional entropy manifolds due to the partial symmetries of faces , 2000, 2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100).

[42]  Rajesh P. N. Rao,et al.  Development of localized oriented receptive fields by learning a translation-invariant code for natural images. , 1998, Network.

[43]  M. Turk,et al.  Eigenfaces for Recognition , 1991, Journal of Cognitive Neuroscience.

[44]  Andrew Blake,et al.  Visual Reconstruction , 1987, Deep Learning for EEG-Based Brain–Computer Interfaces.

[45]  Gregory D. Hager,et al.  Efficient Region Tracking With Parametric Models of Geometry and Illumination , 1998, IEEE Trans. Pattern Anal. Mach. Intell..

[46]  Shaogang Gong,et al.  Dynamic Vision - From Images to Face Recognition , 2000 .

[47]  Larry S. Davis,et al.  A probabilistic framework for rigid and non-rigid appearance based tracking and recognition , 2000, Proceedings Fourth IEEE International Conference on Automatic Face and Gesture Recognition (Cat. No. PR00580).

[48]  Narendra Ahuja,et al.  Detecting Faces in Images: A Survey , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[49]  Alex Pentland,et al.  View-based and modular eigenspaces for face recognition , 1994, 1994 Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.

[50]  Margrit Betke,et al.  Fast object recognition in noisy images using simulated annealing , 1995, Proceedings of IEEE International Conference on Computer Vision.

[51]  Timothy F. Cootes,et al.  Improving identification performance by integrating evidence from sequences , 1999, Proceedings. 1999 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No PR00149).

[52]  P. Holland,et al.  Robust regression using iteratively reweighted least-squares , 1977 .

[53]  Aleix M. Martínez,et al.  Recognizing Imprecisely Localized, Partially Occluded, and Expression Variant Faces from a Single Sample per Class , 2002, IEEE Trans. Pattern Anal. Mach. Intell..