Simultaneous multi-descent regression and feature learning for facial landmarking in depth images

Face alignment (or facial landmarking) is an important task in many face-related applications, ranging from registration, tracking, and animation to higher-level classification problems such as face, expression, or attribute recognition. While several solutions have been presented in the literature for this task so far, reliably locating salient facial features across a wide range of posses still remains challenging. To address this issue, we propose in this paper a novel method for automatic facial landmark localization in 3D face data designed specifically to address appearance variability caused by significant pose variations. Our method builds on recent cascaded regression-based methods to facial landmarking and uses a gating mechanism to incorporate multiple linear cascaded regression models each trained for a limited range of poses into a single powerful landmarking model capable of processing arbitrary-posed input data. We develop two distinct approaches around the proposed gating mechanism: (1) the first uses a gated multiple ridge descent mechanism in conjunction with established (hand-crafted) histogram of gradients features for face alignment and achieves state-of-the-art landmarking performance across a wide range of facial poses and (2) the second simultaneously learns multiple-descent directions as well as binary features that are optimal for the alignment tasks and in addition to competitive landmarking results also ensures extremely rapid processing. We evaluate both approaches in rigorous experiments on several popular datasets of 3D face images, i.e., the FRGCv2 and Bosphorus 3D face datasets and image collections F and G from the University of Notre Dame. The results of our evaluation show that both approaches compare favorably to the state-of-the-art, while exhibiting considerable robustness to pose variations.

[1]  Luc Van Gool,et al.  Random Forests for Real Time 3D Face Analysis , 2012, International Journal of Computer Vision.

[2]  Xi Zhao,et al.  A Coarse-to-Fine Approach for 3D Facial Landmarking by Using Deep Feature Fusion , 2018, Symmetry.

[3]  Feng Liu,et al.  Joint Face Alignment and 3D Face Reconstruction with Application to Face Recognition , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[4]  Fernando De la Torre,et al.  Supervised Descent Method for Solving Nonlinear Least Squares Problems in Computer Vision , 2014, ArXiv.

[5]  Berk Gökberk,et al.  Facial Landmark Localization in Depth Images Using Supervised Ridge Descent , 2015, 2015 IEEE International Conference on Computer Vision Workshop (ICCVW).

[6]  Jiwen Lu,et al.  Simultaneous Local Binary Feature Learning and Encoding for Face Recognition , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[7]  Libo Cao,et al.  Head Pose Estimation in the Wild Assisted by Facial Landmarks Based on Convolutional Neural Networks , 2019, IEEE Access.

[8]  Jie Wang,et al.  Joint head pose and facial landmark regression from depth images , 2017, Computational Visual Media.

[9]  Ioannis A. Kakadiaris,et al.  3D Facial Landmark Detection under Large Yaw and Expression Variations , 2013, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[10]  Paul F. Whelan,et al.  3-D Facial Landmark Localization With Asymmetry Patterns and Shape Regression from Incomplete Local Features , 2015, IEEE Transactions on Cybernetics.

[11]  Fernando De la Torre,et al.  Global supervised descent method , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[12]  Alan C. Bovik,et al.  Anthropometric 3D Face Recognition , 2010, International Journal of Computer Vision.

[13]  Maurício Pamplona Segundo,et al.  Automatic 3D facial segmentation and landmark detection , 2007, 14th International Conference on Image Analysis and Processing (ICIAP 2007).

[14]  Svetlana Lazebnik,et al.  Iterative quantization: A procrustean approach to learning binary codes , 2011, CVPR 2011.

[15]  Jim Austin,et al.  A Machine-Learning Approach to Keypoint Detection and Landmarking on 3D Meshes , 2012, International Journal of Computer Vision.

[16]  Ioannis A. Kakadiaris,et al.  Using Facial Symmetry to Handle Pose Variations in Real-World 3D Face Recognition , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[17]  Naoufel Werghi,et al.  Facial landmarks detection using 3D constrained local model on mesh manifold , 2016, 2016 IEEE 59th International Midwest Symposium on Circuits and Systems (MWSCAS).

[18]  Junzhou Huang,et al.  Pose-Free Facial Landmark Fitting via Optimized Part Mixtures and Cascaded Deformable Shape Model , 2013, 2013 IEEE International Conference on Computer Vision.

[19]  M. Turk,et al.  Eigenfaces for Recognition , 1991, Journal of Cognitive Neuroscience.

[20]  Deva Ramanan,et al.  Face detection, pose estimation, and landmark localization in the wild , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[21]  K. Walker,et al.  View-based active appearance models , 2000, Proceedings Fourth IEEE International Conference on Automatic Face and Gesture Recognition (Cat. No. PR00580).

[22]  Patrick J. Flynn,et al.  Overview of the face recognition grand challenge , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[23]  Berk Gökberk,et al.  Regional Registration for Expression Resistant 3-D Face Recognition , 2010, IEEE Transactions on Information Forensics and Security.

[24]  Ping Yan,et al.  Empirical Evaluation of Advanced Ear Biometrics , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05) - Workshops.

[25]  Huimin Yu,et al.  3D face registration by depth-based template matching and active appearance model , 2014, 2014 Sixth International Conference on Wireless Communications and Signal Processing (WCSP).

[26]  Luc Van Gool,et al.  Real time 3D face alignment with Random Forests-based Active Appearance Models , 2013, 2013 10th IEEE International Conference and Workshops on Automatic Face and Gesture Recognition (FG).

[27]  William J. Christmas,et al.  Cascaded Collaborative Regression for Robust Facial Landmark Detection Trained Using a Mixture of Synthetic and Real Images With Dynamic Weighting , 2015, IEEE Transactions on Image Processing.

[28]  Fernando De la Torre,et al.  Supervised Descent Method and Its Applications to Face Alignment , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[29]  Benjamin Johnston,et al.  A review of image-based automatic facial landmark identification techniques , 2018, EURASIP Journal on Image and Video Processing.

[30]  Ioannis A. Kakadiaris,et al.  Accurate Landmarking of Three-Dimensional Facial Data in the Presence of Facial Expressions and Occlusions Using a Three-Dimensional Statistical Facial Feature Model , 2011, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[31]  Jungsik Park,et al.  A Framework for Virtual 3D Manipulation of Face in Video , 2018, 2018 IEEE Conference on Virtual Reality and 3D User Interfaces (VR).

[32]  Patrick J. Flynn,et al.  Rotated Profile Signatures for robust 3D feature detection , 2008, 2008 8th IEEE International Conference on Automatic Face & Gesture Recognition.

[33]  Maurício Pamplona Segundo,et al.  Automatic Face Segmentation and Facial Landmark Detection in Range Images , 2010, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[34]  Fernando De la Torre,et al.  A Functional Regression Approach to Facial Landmark Tracking , 2016, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[35]  Mohammed Bennamoun,et al.  An Efficient Multimodal 2D-3D Hybrid Approach to Automatic Face Recognition , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[36]  Nick Pears,et al.  Landmark Localisation in 3D Face Data , 2009, 2009 Sixth IEEE International Conference on Advanced Video and Signal Based Surveillance.

[37]  Ioannis A. Kakadiaris,et al.  Automatic 2.5-D Facial Landmarking and Emotion Annotation for Social Interaction Assistance , 2016, IEEE Transactions on Cybernetics.

[38]  Shuicheng Yan,et al.  Multi-view face alignment using direct appearance models , 2002, Proceedings of Fifth IEEE International Conference on Automatic Face Gesture Recognition.

[39]  Le Zhang,et al.  Facial landmark automatic identification from three dimensional (3D) data by using Hidden Markov Model (HMM) , 2017 .

[40]  Chun Chen,et al.  Robust 3D Face Landmark Localization Based on Local Coordinate Coding , 2014, IEEE Transactions on Image Processing.

[41]  Peter Peer,et al.  Localization of Facial Landmarks in Depth Images Using Gated Multiple Ridge Descent , 2018, 2018 IEEE International Work Conference on Bioinspired Intelligence (IWOBI).

[42]  Jiwen Lu,et al.  Simultaneous Local Binary Feature Learning and Encoding for Homogeneous and Heterogeneous Face Recognition , 2018, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[43]  Arman Savran,et al.  Bosphorus Database for 3D Face Analysis , 2008, BIOID.

[44]  Harry Shum,et al.  A Bayesian mixture model for multi-view face alignment , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[45]  Kevin Walker,et al.  Towards Real-Time Facial Landmark Detection in Depth Data Using Auxiliary Information , 2018, Symmetry.

[46]  Qiang Ji,et al.  Facial Landmark Detection: A Literature Survey , 2018, International Journal of Computer Vision.

[47]  Matti Pietikäinen,et al.  Computer Vision Using Local Binary Patterns , 2011, Computational Imaging and Vision.

[48]  Christian Huitema,et al.  Real-time 3D face tracking based on active appearance model constrained by depth data , 2014, Image Vis. Comput..

[49]  Tal Hassner,et al.  Deep Face Recognition: A Survey , 2018, 2018 31st SIBGRAPI Conference on Graphics, Patterns and Images (SIBGRAPI).