Robust facial landmark tracking via cascade regression

Recently, tremendous improvements have been achieved for facial landmark localization on static images. However, detecting and tracking facial shapes in sequential images is still challenging due to the large appearance variations in unconstrained videos. To address this issue, we present a robust facial landmark tracking system via cascade regression, which is able to deal well with some challenges emerging in the sequential images. Specially, our system employs a pose-based cascade shape regression model to predict the facial landmark locations. Pose-based cascade shape regression model decreases the shape variances in the model learning stage, making the learned regression model more robust to the large pose variances. In addition, we explore a pose tracking model to enhance the temporal consecutiveness between the adjacent frames, and leverage the Kalman filter to make the predicted shape more smooth and stable. Finally, we incorporate a re-initialization mechanism with the facial landmarks as the position priors into the system, which is able to effectively and accurately locate the face when it is misaligned or lost. Experiments on the LFPW, Helen, 300W and 300VW datasets illustrate the superiority of proposed system over the state-of-the-art approaches, and it is worthy emphasizing that our method has won the 300VW competition in the category one. HighlightsA robust facial landmark tracking system via cascade regression has been proposed.Our method achieves competing results on the LFPW, Helen, 300-W and 300-VW datasets.Our method won the 300-VW competition in category one.

[1]  Stefanos Zafeiriou,et al.  300 Faces in-the-Wild Challenge: The First Facial Landmark Localization Challenge , 2013, 2013 IEEE International Conference on Computer Vision Workshops.

[2]  Chih-Jen Lin,et al.  LIBLINEAR: A Library for Large Linear Classification , 2008, J. Mach. Learn. Res..

[3]  Vladimir Pavlovic,et al.  Face tracking and recognition with visual constraints in real-world videos , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[4]  Xin Jin,et al.  Face alignment by robust discriminative Hough voting , 2016, Pattern Recognit..

[5]  Qingshan Liu,et al.  Improving the Spatial Resolution of Landsat TM/ETM+ Through Fusion With SPOT5 Images via Learning-Based Super-Resolution , 2015, IEEE Transactions on Geoscience and Remote Sensing.

[6]  Yuhui Zheng,et al.  Adaptively determining regularisation parameters in non-local total variation regularisation for image denoising , 2015 .

[7]  Xiaogang Wang,et al.  Deep Learning Identity-Preserving Face Space , 2013, 2013 IEEE International Conference on Computer Vision.

[8]  Cheng Li,et al.  Face alignment by coarse-to-fine shape searching , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[9]  Takeo Kanade,et al.  Multi-PIE , 2008, 2008 8th IEEE International Conference on Automatic Face & Gesture Recognition.

[10]  Jian Sun,et al.  Face Alignment by Explicit Shape Regression , 2012, International Journal of Computer Vision.

[11]  Zexuan Ji,et al.  Cost-sensitive dictionary learning for face recognition , 2016, Pattern Recognit..

[12]  David Zhang,et al.  A Level Set Approach to Image Segmentation With Intensity Inhomogeneity , 2016, IEEE Transactions on Cybernetics.

[13]  Lei Zhang,et al.  Real-Time Object Tracking Via Online Discriminative Feature Selection , 2013, IEEE Transactions on Image Processing.

[14]  Huihui Song Active contours driven by regularised gradient flux flows for image segmentation , 2014 .

[15]  Jian Sun,et al.  Joint Cascade Face Detection and Alignment , 2014, ECCV.

[16]  Fernando De la Torre,et al.  Supervised Descent Method and Its Applications to Face Alignment , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[17]  Narendra Ahuja,et al.  Robust Visual Tracking Via Consistent Low-Rank Sparse Learning , 2014, International Journal of Computer Vision.

[18]  Johan A. K. Suykens,et al.  Least Squares Support Vector Machine Classifiers , 1999, Neural Processing Letters.

[19]  Josephine Sullivan,et al.  One millisecond face alignment with an ensemble of regression trees , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[20]  Thomas S. Huang,et al.  Interactive Facial Feature Localization , 2012, ECCV.

[21]  Timothy F. Cootes,et al.  Interpreting face images using active appearance models , 1998, Proceedings Third IEEE International Conference on Automatic Face and Gesture Recognition.

[22]  Pietro Perona,et al.  Robust Face Landmark Estimation under Occlusion , 2013, 2013 IEEE International Conference on Computer Vision.

[23]  Tieniu Tan,et al.  Explore semantic pixel sets based local patterns with information entropy for face recognition , 2014, EURASIP J. Image Video Process..

[24]  Lei Zhang,et al.  Fast Compressive Tracking , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[25]  Qiang Ji,et al.  Shape Augmented Regression Method for Face Alignment , 2015, 2015 IEEE International Conference on Computer Vision Workshop (ICCVW).

[26]  Qingshan Liu,et al.  Facial Shape Tracking via Spatio-Temporal Cascade Shape Regression , 2015, 2015 IEEE International Conference on Computer Vision Workshop (ICCVW).

[27]  Wei Chen,et al.  Robust visual tracking via patch based kernel correlation filters with adaptive multiple feature ensemble , 2016, Neurocomputing.

[28]  T. Başar,et al.  A New Approach to Linear Filtering and Prediction Problems , 2001 .

[29]  Huihui Song,et al.  Hyperspectral image denoising via low-rank matrix recovery , 2014 .

[30]  Xiaogang Wang,et al.  Deep Learning Face Representation by Joint Identification-Verification , 2014, NIPS.

[31]  Stefanos Zafeiriou,et al.  Offline Deformable Face Tracking in Arbitrary Videos , 2015, 2015 IEEE International Conference on Computer Vision Workshop (ICCVW).

[32]  Stefanos Zafeiriou,et al.  The First Facial Landmark Tracking in-the-Wild Challenge: Benchmark and Results , 2015, 2015 IEEE International Conference on Computer Vision Workshop (ICCVW).

[33]  Tieniu Tan,et al.  Joint space learning for video-based face recognition , 2015, 2015 3rd IAPR Asian Conference on Pattern Recognition (ACPR).

[34]  Timothy F. Cootes,et al.  Active Shape Models-Their Training and Application , 1995, Comput. Vis. Image Underst..

[35]  Yuhui Zheng,et al.  Robust visual tracking via self-similarity learning , 2017 .

[36]  Bülent Sankur,et al.  A comparative study of face landmarking techniques , 2013, EURASIP J. Image Video Process..

[37]  Jian Sun,et al.  Face Alignment at 3000 FPS via Regressing Local Binary Features , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[38]  Simon Baker,et al.  Active Appearance Models Revisited , 2004, International Journal of Computer Vision.

[39]  Timothy F. Cootes,et al.  Multi-view Constrained Local Models for Large Head Angle Facial Tracking , 2015, 2015 IEEE International Conference on Computer Vision Workshop (ICCVW).

[40]  Huihui Song Robust visual tracking via online informative feature selection , 2014 .

[41]  Ashraf A. Kassim,et al.  Facial Landmark Detection via Progressive Initialization , 2015, 2015 IEEE International Conference on Computer Vision Workshop (ICCVW).

[42]  Maja Pantic,et al.  Gauss-Newton Deformable Part Models for Face Alignment In-the-Wild , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[43]  Yuhui Zheng,et al.  Image segmentation by generalized hierarchical fuzzy C-means algorithm , 2015, J. Intell. Fuzzy Syst..

[44]  Qingshan Liu,et al.  Robust object tracking by online Fisher discrimination boosting feature selection , 2016, Comput. Vis. Image Underst..

[45]  Georgios Tzimiropoulos,et al.  Project-Out Cascaded Regression with an application to face alignment , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[46]  Shree K. Nayar,et al.  Attribute and simile classifiers for face verification , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[47]  Xuelong Li,et al.  A Variational Approach to Simultaneous Image Segmentation and Bias Correction , 2015, IEEE Transactions on Cybernetics.

[48]  Václav Hlavác,et al.  Facial Landmark Tracking by Tree-Based Deformable Part Model Based Detector , 2015, 2015 IEEE International Conference on Computer Vision Workshop (ICCVW).

[49]  Zhe L. Lin,et al.  Nonparametric Context Modeling of Local Appearance for Pose- and Expression-Robust Facial Landmark Localization , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[50]  Fernando De la Torre,et al.  Global supervised descent method , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[51]  David J. Kriegman,et al.  Localizing parts of faces using a consensus of exemplars , 2011, CVPR.

[52]  Huihui Song,et al.  Multiple change detection for multispectral remote sensing images via joint sparse representation , 2014 .

[53]  Pietro Perona,et al.  Cascaded pose regression , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[54]  Wenhan Luo,et al.  Unified Face Analysis by Iterative Multi-output Random Forests , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[55]  Qingshan Liu,et al.  Robust Visual Tracking via Convolutional Networks Without Training , 2015, IEEE Transactions on Image Processing.

[56]  Qingshan Liu,et al.  Adaptive Compressive Tracking via Online Vector Boosting Feature Selection , 2015, IEEE Transactions on Cybernetics.

[57]  Timothy F. Cootes,et al.  Active Appearance Models , 2001, IEEE Trans. Pattern Anal. Mach. Intell..

[58]  Yongkang Wong,et al.  Combined Learning of Salient Local Descriptors and Distance Metrics for Image Set Face Verification , 2012, 2012 IEEE Ninth International Conference on Advanced Video and Signal-Based Surveillance.

[59]  Ching Y. Suen,et al.  Robust face recognition based on dynamic rank representation , 2016, Pattern Recognit..

[60]  Deva Ramanan,et al.  Face detection, pose estimation, and landmark localization in the wild , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[61]  Larry S. Davis,et al.  Learning predictable binary codes for face indexing , 2015, Pattern Recognit..

[62]  Stefanos Zafeiriou,et al.  Robust Discriminative Response Map Fitting with Constrained Local Models , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[63]  Shiguang Shan,et al.  Coarse-to-Fine Auto-Encoder Networks (CFAN) for Real-Time Face Alignment , 2014, ECCV.

[64]  Stefanos Zafeiriou,et al.  A Comprehensive Performance Evaluation of Deformable Face Tracking “In-the-Wild” , 2016, International Journal of Computer Vision.

[65]  Yu Zhou,et al.  Orthogonal curved-line Gabor filter for fast fingerprint enhancement , 2014 .