论文信息 - Project-Out Cascaded Regression with an application to face alignment

Project-Out Cascaded Regression with an application to face alignment

Cascaded regression approaches have been recently shown to achieve state-of-the-art performance for many computer vision tasks. Beyond its connection to boosting, cascaded regression has been interpreted as a learning-based approach to iterative optimization methods like the Newton's method. However, in prior work, the connection to optimization theory is limited only in learning a mapping from image features to problem parameters. In this paper, we consider the problem of facial deformable model fitting using cascaded regression and make the following contributions: (a) We propose regression to learn a sequence of averaged Jacobian and Hessian matrices from data, and from them descent directions in a fashion inspired by Gauss-Newton optimization. (b) We show that the optimization problem in hand has structure and devise a learning strategy for a cascaded regression approach that takes the problem structure into account. By doing so, the proposed method learns and employs a sequence of averaged Jacobians and descent directions in a subspace orthogonal to the facial appearance variation; hence, we call it Project-Out Cascaded Regression (PO-CR). (c) Based on the principles of PO-CR, we built a face alignment system that produces remarkably accurate results on the challenging iBUG data set outperforming previously proposed systems by a large margin. Code for our system is available from http://www.cs.nott.ac.uk/~yzt/.

Georgios Tzimiropoulos | Georgios Tzimiropoulos

[1] Pietro Perona,et al. Cascaded pose regression , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[2] Carl E. Rasmussen,et al. Gaussian processes for machine learning , 2005, Adaptive computation and machine learning.

[3] Maja Pantic,et al. Optimization Problems for Fast AAM Fitting in-the-Wild , 2013, 2013 IEEE International Conference on Computer Vision.

[4] Nassir Navab,et al. Deformable Template Tracking in 1ms , 2014, BMVC.

[5] Radford M. Neal. Pattern Recognition and Machine Learning , 2007, Technometrics.

[6] Stefanos Zafeiriou,et al. A Semi-automatic Methodology for Facial Landmark Annotation , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition Workshops.

[7] David Cristinacce,et al. Automatic feature localisation with constrained local models , 2008, Pattern Recognit..

[8] David J. Kriegman,et al. Localizing Parts of Faces Using a Consensus of Exemplars , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[9] Fernando De la Torre,et al. Supervised Descent Method and Its Applications to Face Alignment , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[10] Josephine Sullivan,et al. One millisecond face alignment with an ensemble of regression trees , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[11] Petros Maragos,et al. Adaptive and constrained algorithms for inverse compositional Active Appearance Model fitting , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[12] Thomas S. Huang,et al. Interactive Facial Feature Localization , 2012, ECCV.

[13] Matthijs C. Dorst. Distinctive Image Features from Scale-Invariant Keypoints , 2011 .

[14] B. Heisele. Face Detection , 2001 .

[15] Sridha Sridharan,et al. Efficient constrained local model fitting for non-rigid face alignment , 2009, Image Vis. Comput..

[16] Manolis I. A. Lourakis,et al. SBA: A software package for generic sparse bundle adjustment , 2009, TOMS.

[17] Ioannis Patras,et al. Sieving Regression Forest Votes for Facial Feature Detection in the Wild , 2013, 2013 IEEE International Conference on Computer Vision.

[18] J. Friedman. Greedy function approximation: A gradient boosting machine. , 2001 .

[19] Maja Pantic,et al. Facial point detection using boosted regression and graph models , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[20] Simon Lucey,et al. Multi-channel Correlation Filters , 2013, 2013 IEEE International Conference on Computer Vision.

[21] Rui Caseiro,et al. Exploiting the Circulant Structure of Tracking-by-Detection with Kernels , 2012, ECCV.

[22] Bernhard Schölkopf,et al. A tutorial on support vector regression , 2004, Stat. Comput..

[23] Andrew W. Fitzgibbon,et al. Bundle Adjustment - A Modern Synthesis , 1999, Workshop on Vision Algorithms.

[24] Xiaogang Wang,et al. Deep Convolutional Network Cascade for Facial Point Detection , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[25] Luc Van Gool,et al. Real time head pose estimation with random regression forests , 2011, CVPR 2011.

[26] Timothy F. Cootes,et al. Active Shape Models-Their Training and Application , 1995, Comput. Vis. Image Underst..

[27] P. Cochat,et al. Et al , 2008, Archives de pediatrie : organe officiel de la Societe francaise de pediatrie.

[28] Gregory D. Hager,et al. Efficient Region Tracking With Parametric Models of Geometry and Illumination , 1998, IEEE Trans. Pattern Anal. Mach. Intell..

[29] Ioannis Patras,et al. Face Parts Localization Using Structured-Output Regression Forests , 2012, ACCV.

[30] Simon Baker,et al. Active Appearance Models Revisited , 2004, International Journal of Computer Vision.

[31] Maja Pantic,et al. Gauss-Newton Deformable Part Models for Face Alignment In-the-Wild , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[32] Stephen P. Boyd,et al. Convex Optimization , 2004, Algorithms and Theory of Computation Handbook.

[33] Stefanos Zafeiriou,et al. Incremental Face Alignment in the Wild , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[34] Hatice Gunes,et al. Output-associative RVM regression for dimensional and continuous emotion prediction , 2011, Face and Gesture 2011.

[35] Jian Sun,et al. Face Alignment by Explicit Shape Regression , 2012, International Journal of Computer Vision.

[36] Simon Baker,et al. Lucas-Kanade 20 Years On: A Unifying Framework , 2004, International Journal of Computer Vision.

[37] Simon Lucey,et al. Deformable Model Fitting by Regularized Landmark Mean-Shift , 2010, International Journal of Computer Vision.

[38] Jian Sun,et al. Face Alignment at 3000 FPS via Regressing Local Binary Features , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[39] Timothy F. Cootes,et al. Feature Detection and Tracking with Constrained Local Models , 2006, BMVC.

[40] Timothy F. Cootes,et al. Active Appearance Models , 2001, IEEE Trans. Pattern Anal. Mach. Intell..

[41] Geoffrey E. Hinton,et al. ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.