Regression-based Active Appearance Model initialization for facial feature tracking with missing frames

The Active Appearance Model (AAM) is receiving considerable attention in the field of facial analysis as a powerful method for modeling and segmenting deformable visual objects. Several extensions and improvements have been proposed on the original AAM, but AAMs maintain their dependence on the good initialization of model parameters to achieve accurate fitting results. AAMs are usually used directly in video tracking by searching on each subsequent frame that employs the fitting result of the previous frame for initialization. However, this model sometimes fails when large movements exist between two frames. This mechanism occurs when frames are dropped from the video due to the use of a lossy multimedia network. A regression-based approach for automatic AAM initialization is presented in this paper. After undergoing a scattered feature correspondence based on a dual-threshold matching strategy, the AAM shape points are initialized by the spatial map between local-landmark (L2L) correspondences. The map is learned based on Kernel Ridge Regression (KRR). The proposed method can successfully track the frames that are not identified with the general AAM trackers by establishing spatial relationship between local and landmark points. The initialization is robust to disturbances, which enables it to outperform key-feature-tracking or detection-based methods. We demonstrate the efficacy of the approach on two challenging facial videos with different training data and report a detailed quantitative evaluation of its performance.

[1]  Paul A. Viola,et al.  Robust Real-Time Face Detection , 2001, International Journal of Computer Vision.

[2]  Tetsunori Kobayashi,et al.  An ASM fitting method based on machine learning that provides a robust parameter initialization for AAM fitting , 2008, 2008 8th IEEE International Conference on Automatic Face & Gesture Recognition.

[3]  Daijin Kim,et al.  Adaptive active appearance model with incremental learning , 2009, Pattern Recognit. Lett..

[4]  David G. Lowe,et al.  Distinctive Image Features from Scale-Invariant Keypoints , 2004, International Journal of Computer Vision.

[5]  Timothy F. Cootes,et al.  Real time feature point tracking with automatic model selection , 2011, 2011 IEEE International Conference on Computer Vision Workshops (ICCV Workshops).

[6]  David Cristinacce,et al.  Automatic feature localisation with constrained local models , 2008, Pattern Recognit..

[7]  Aly A. Farag,et al.  Face recognition at-a-distance based on sparse-stereo reconstruction , 2009, 2009 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops.

[8]  Hui Zhang,et al.  Robust facial expression tracking based on composite constraints AAM , 2011, 2011 18th IEEE International Conference on Image Processing.

[9]  Simon Lucey,et al.  Deformable Model Fitting by Regularized Landmark Mean-Shift , 2010, International Journal of Computer Vision.

[10]  Marc Hanheide,et al.  Automatic Initialization for Facial Analysis in Interactive Robotics , 2008, ICVS.

[11]  Ying Cui,et al.  Facial Feature Points Tracking Based on AAM with Optical Flow Constrained Initialization , 2012 .

[12]  Pau-Choo Chung,et al.  Using salient region constraints on Active Appearance Model initialization for facial feature extraction , 2010, 2010 International Computer Symposium (ICS2010).

[13]  Xiaoming Liu,et al.  Video-based face model fitting using Adaptive Active Appearance Model , 2010, Image Vis. Comput..

[14]  Lale Akarun,et al.  A Database of Non-Manual Signs in Turkish Sign Language , 2007, 2007 IEEE 15th Signal Processing and Communications Applications.

[15]  Timothy F. Cootes,et al.  Active Appearance Models , 2001, IEEE Trans. Pattern Anal. Mach. Intell..