论文信息 - A Dictionary Learning-Based 3D Morphable Shape Model

A Dictionary Learning-Based 3D Morphable Shape Model

Face analysis from 2D images and videos is a central task in many multimedia applications. Methods developed to this end perform either face recognition or facial expression recognition, and in both cases results are negatively influenced by variations in pose, illumination, and resolution of the face. Such variations have a lower impact on 3D face data, which has given the way to the idea of using a 3D morphable model as an intermediate tool to enhance face analysis on 2D data. In this paper, we propose a new approach for constructing a 3D morphable shape model (called DL-3DMM) and show our solution can reach the accuracy of deformation required in applications where fine details of the face are concerned. For constructing the model, we start from a set of 3D face scans with large variability in terms of ethnicity and expressions. Across these training scans, we compute a point-to-point dense alignment, which is accurate also in the presence of topological variations of the face. The DL-3DMM is constructed by learning a dictionary of basis components on the aligned scans. The model is then fitted to 2D target faces using an efficient regularized ridge-regression guided by 2D/3D facial landmark correspondences in order to generate pose-normalized face images. Comparison between the DL-3DMM and the standard PCA-based 3DMM demonstrates that in general a lower reconstruction error can be obtained with our solution. Application to action unit detection and emotion recognition from 2D images and videos shows competitive results with state of the art methods on two benchmark datasets.

[1] Gérard G. Medioni,et al. Pose-Aware Face Recognition in the Wild , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[2] Fernando De la Torre,et al. Supervised Descent Method and Its Applications to Face Alignment , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[3] Josephine Sullivan,et al. One millisecond face alignment with an ensemble of regression trees , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[4] C. Cacou. Anthropometry of the head and face , 1995 .

[5] Fernando De la Torre,et al. Selective Transfer Machine for Personalized Facial Action Unit Detection , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[6] Fernando De la Torre,et al. Detecting depression from facial actions and vocal prosody , 2009, 2009 3rd International Conference on Affective Computing and Intelligent Interaction and Workshops.

[7] Thomas Gerig,et al. Gaussian Process Morphable Models , 2016, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[8] Tong Zhang,et al. A Deep Neural Network-Driven Feature Learning Method for Multi-view Facial Expression Recognition , 2016, IEEE Transactions on Multimedia.

[9] Jun Wang,et al. A 3D facial expression database for facial behavior research , 2006, 7th International Conference on Automatic Face and Gesture Recognition (FGR06).

[10] Anil K. Jain,et al. Deformation Modeling for Robust 3D Face Matching , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[11] Thomas Vetter,et al. A morphable model for the synthesis of 3D faces , 1999, SIGGRAPH.

[12] William A. P. Smith,et al. 3D morphable face models revisited , 2009, CVPR.

[13] Ioannis A. Kakadiaris,et al. 3D Facial Landmark Detection under Large Yaw and Expression Variations , 2013, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[14] Zhihong Zeng,et al. A Survey of Affect Recognition Methods: Audio, Visual, and Spontaneous Expressions , 2009, IEEE Trans. Pattern Anal. Mach. Intell..

[15] Subramanian Ramanathan,et al. Human Facial Expression Recognition using a 3D Morphable Model , 2006, 2006 International Conference on Image Processing.

[16] J A Sethian,et al. Computing geodesic paths on manifolds. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[17] Sami Romdhani,et al. Estimating 3D shape and texture using pixel intensity, edges, specular highlights, texture constraints and a prior , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[18] Ling Guan,et al. Kernel Cross-Modal Factor Analysis for Information Fusion With Application to Bimodal Emotion Recognition , 2012, IEEE Transactions on Multimedia.

[19] Eva Krumhuber,et al. Perception of linear and nonlinear motion properties using a FACS validated 3D facial model , 2010, APGV '10.

[20] Matti Pietikäinen,et al. A comparative study of texture measures with classification based on featured distributions , 1996, Pattern Recognit..

[21] Jean-Philippe Thiran,et al. Action Units and Their Cross-Correlations for Prediction of Cognitive Load during Driving , 2017, IEEE Transactions on Affective Computing.

[22] Maja Pantic,et al. Meta-Analysis of the First Facial Expression Recognition Challenge , 2012, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[23] Sami Romdhani,et al. A 3D Face Model for Pose and Illumination Invariant Face Recognition , 2009, 2009 Sixth IEEE International Conference on Advanced Video and Signal Based Surveillance.

[24] Allen Y. Yang,et al. Robust Face Recognition via Sparse Representation , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[25] Dacheng Tao,et al. Robust Face Recognition via Multimodal Deep Face Representation , 2015, IEEE Transactions on Multimedia.

[26] Takeo Kanade,et al. The Extended Cohn-Kanade Dataset (CK+): A complete dataset for action unit and emotion-specified expression , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Workshops.

[27] Alberto Del Bimbo,et al. Effective 3D based frontalization for unconstrained face recognition , 2016, 2016 23rd International Conference on Pattern Recognition (ICPR).

[28] Alberto Del Bimbo,et al. Dictionary Learning Based 3D Morphable Model Construction for Face Recognition with Varying Expression and Pose , 2015, 2015 International Conference on 3D Vision.

[29] Tomaso A. Poggio,et al. Reanimating Faces in Images and Video , 2003, Comput. Graph. Forum.

[30] Thomas Vetter,et al. Face Recognition Based on Fitting a 3D Morphable Model , 2003, IEEE Trans. Pattern Anal. Mach. Intell..

[31] Mohan M. Trivedi,et al. Face Expression Recognition by Cross Modal Data Association , 2013, IEEE Transactions on Multimedia.

[32] Jerzy Martyna,et al. Spontaneous Facial Expression Recognition: Automatic Aggression Detection , 2012, HAIS.

[33] William J. Christmas,et al. 3D Face Tracking and Texture Fusion in the Wild , 2016, ArXiv.

[34] Mrinal Kanti Bhowmik,et al. An Approach for Automatic Pain Detection through Facial Expression , 2015, IHCI.

[35] Maria E. Jabon,et al. Facial expression analysis for predicting unsafe driving behavior , 2011, IEEE Pervasive Computing.

[36] Toby P. Breckon,et al. A comparison of features for regression-based driver head pose estimation under varying illumination conditions , 2014, 2014 International Workshop on Computational Intelligence for Multimedia Understanding (IWCIM).

[37] F. Staal,et al. Describing Crouzon and Pfeiffer syndrome based on principal component analysis. , 2015, Journal of cranio-maxillo-facial surgery : official publication of the European Association for Cranio-Maxillo-Facial Surgery.

[38] Michael Spann,et al. Facial Expression Recognition Using FAPs-Based 3DMMM , 2013 .

[39] Gang Hua,et al. Hierarchical-PEP model for real-world face recognition , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[40] Lijun Yin,et al. Static and dynamic 3D facial expression recognition: A comprehensive survey , 2012, Image Vis. Comput..

[41] Xiangyu Zhu,et al. Face Alignment in Full Pose Range: A 3D Total Solution , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[42] Alberto Del Bimbo,et al. Pose Independent Face Recognition by Localizing Local Binary Patterns via Deformation Components , 2014, 2014 22nd International Conference on Pattern Recognition.

[43] Qiang Ji,et al. Data-Free Prior Model for Facial Action Unit Recognition , 2013, IEEE Transactions on Affective Computing.

[44] Patrik Huber,et al. A 3D Face Modelling Approach for Pose-Invariant Face Recognition in a Human-Robot Environment , 2016, RoboCup.

[45] Liming Chen,et al. Muscular Movement Model-Based Automatic 3D/4D Facial Expression Recognition , 2015, IEEE Transactions on Multimedia.

[46] Stan Z. Li,et al. Towards Pose Robust Face Recognition , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[47] Guillermo Sapiro,et al. Online dictionary learning for sparse coding , 2009, ICML '09.

[48] Adrian Hilton,et al. Perceived Emotionality of Linear and Non-Linear AUs Synthesised using a 3D Dynamic Morphable Facial Model , 2015, AVSP.

[49] Alan Brunton,et al. Multilinear Wavelets: A Statistical Shape Space for Human Faces , 2014, ECCV.

[50] Stefanos Zafeiriou,et al. A 3D Morphable Model Learnt from 10,000 Faces , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[51] Jian Sun,et al. Blessing of Dimensionality: High-Dimensional Feature and Its Efficient Compression for Face Verification , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[52] Patrick J. Flynn,et al. Overview of the face recognition grand challenge , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[53] Qiang Ji,et al. Constrained Joint Cascade Regression Framework for Simultaneous Facial Action Unit Recognition and Facial Landmark Detection , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[54] Qiang Ji,et al. Capturing Global Semantic Relationships for Facial Action Unit Recognition , 2013, 2013 IEEE International Conference on Computer Vision.

[55] Volker Blanz,et al. Realistic inverse lighting from a single 2D image of a face, taken under unknown and complex lighting , 2015, 2015 11th IEEE International Conference and Workshops on Automatic Face and Gesture Recognition (FG).

[56] Chi-Ho Chan,et al. Face Recognition Using a Unified 3D Morphable Model , 2016, ECCV.

[57] Sami Romdhani,et al. Optimal Step Nonrigid ICP Algorithms for Surface Registration , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[58] Nicu Sebe,et al. Learning Personalized Models for Facial Expression Analysis and Gesture Recognition , 2016, IEEE Transactions on Multimedia.

[59] Xiangyu Zhu,et al. Discriminative 3D morphable model fitting , 2015, 2015 11th IEEE International Conference and Workshops on Automatic Face and Gesture Recognition (FG).

[60] Thomas Vetter,et al. Expression invariant 3D face recognition with a Morphable Model , 2008, 2008 8th IEEE International Conference on Automatic Face & Gesture Recognition.

[61] Ming Yang,et al. DeepFace: Closing the Gap to Human-Level Performance in Face Verification , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[62] Adrian Hilton,et al. A FACS valid 3D dynamic action unit database with applications to 3D dynamic morphable facial modeling , 2011, 2011 International Conference on Computer Vision.

[63] Lei Zhang,et al. Image-driven re-targeting and relighting of facial expressions , 2005, International 2005 Computer Graphics.

[64] J. Kittler,et al. Face Recognition Using a Unified 3 D Morphable Model , 2016 .

[65] C. Hjortsjö. Man's face and mimic language , 1969 .