Robust facial feature tracking using selected multi-resolution linear predictors

This paper proposes a learnt data-driven approach for accurate, real-time tracking of facial features using only intensity information. Constraints such as a-priori shape models or temporal models for dynamics are not required or used. Tracking facial features simply becomes the independent tracking of a set of points on the face. This allows us to cope with facial configurations not present in the training data. Tracking is achieved via linear predictors which provide a fast and effective method for mapping pixel-level information to tracked feature position displacements. To improve on this, a novel and robust biased linear predictor is proposed in this paper. Multiple linear predictors are grouped into a rigid flock to increase robustness. To further improve tracking accuracy, a novel probabilistic selection method is used to identify relevant visual areas for tracking a feature point. These selected flocks are then combined into a hierarchical multi-resolution LP model. Experimental results also show that this method performs more robustly and accurately than AAMs, without any a priori shape information and with minimal training examples.

[1]  Richard Bowden,et al.  Robust Lip-Tracking using Rigid Flocks of Selected Linear Predictors , 2008 .

[2]  Mathias Kölsch,et al.  Fast 2D Hand Tracking with Flocks of Features and Multi-Cue Integration , 2004, 2004 Conference on Computer Vision and Pattern Recognition Workshop.

[3]  Andrew Blake,et al.  Sparse Bayesian learning for efficient visual tracking , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[4]  Stephen M. Omohundro,et al.  Nonlinear manifold learning for visual speech recognition , 1995, Proceedings of IEEE International Conference on Computer Vision.

[5]  Aggelos K. Katsaggelos,et al.  Lip tracking for MPEG-4 facial animation , 2002, Proceedings. Fourth IEEE International Conference on Multimodal Interfaces.

[6]  Paul Deléglise,et al.  Statistical Lip-Appearance Models Trained Automatically Using Audio Information , 2002, EURASIP J. Adv. Signal Process..

[7]  Jesse Hoey Tracking using Flocks of Features, with Application to Assisted Handwashing , 2006, BMVC.

[8]  Alan L. Yuille,et al.  Feature extraction from faces using deformable templates , 2004, International Journal of Computer Vision.

[9]  Patrice Delmas,et al.  Automatic lip tracking: Bayesian segmentation and active contours in a cooperative scheme , 1999, Proceedings IEEE International Conference on Multimedia Computing and Systems.

[10]  Timothy F. Cootes,et al.  Extraction of Visual Features for Lipreading , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[11]  DaubiasPhilippe,et al.  Statistical lip-appearance models trained automatically using audio information , 2002 .

[12]  Jiri Matas,et al.  Tracking by an Optimal Sequence of Linear Predictors , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[13]  Eun-Jung Holden,et al.  Lip Tracking using Pattern Matching Snakes , 2002 .

[14]  Edwin R. Hancock,et al.  Regression tracking with data relevance determination , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[15]  Timothy F. Cootes,et al.  Active Appearance Models , 2001, IEEE Trans. Pattern Anal. Mach. Intell..