论文信息 - Action Unit Detection by Learning the Deformation Coefficients of a 3D Morphable Model

Action Unit Detection by Learning the Deformation Coefficients of a 3D Morphable Model

Facial Action Units (AUs) correspond to the deformation/contraction of individual facial muscles or their combinations. As such, each AU affects just a small portion of the face, with deformations that are asymmetric in many cases. Generating and analyzing AUs in 3D is particularly relevant for the potential applications it can enable. In this paper, we propose a solution for 3D AU detection and synthesis by developing on a newly defined 3D Morphable Model (3DMM) of the face. Differently from most of the 3DMMs existing in the literature, which mainly model global variations of the face and show limitations in adapting to local and asymmetric deformations, the proposed solution is specifically devised to cope with such difficult morphings. During a training phase, the deformation coefficients are learned that enable the 3DMM to deform to 3D target scans showing neutral and facial expression of the same individual, thus decoupling expression from identity deformations. Then, such deformation coefficients are used, on the one hand, to train an AU classifier, on the other, they can be applied to a 3D neutral scan to generate AU deformations in a subject-independent manner. The proposed approach for AU detection is validated on the Bosphorus dataset, reporting competitive results with respect to the state-of-the-art, even in a challenging cross-dataset setting. We further show the learned coefficients are general enough to synthesize realistic 3D face instances with AUs activation.

[1] Alberto Del Bimbo,et al. Dictionary Learning Based 3D Morphable Model Construction for Face Recognition with Varying Expression and Pose , 2015, 2015 International Conference on 3D Vision.

[2] Michael J. Black,et al. Learning a model of facial shape and expression from 4D scans , 2017, ACM Trans. Graph..

[3] Yaser Sheikh,et al. Modeling Facial Geometry Using Compositional VAEs , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[4] Michael G. Strintzis,et al. Bilinear Models for 3-D Face and Facial Expression Recognition , 2008, IEEE Transactions on Information Forensics and Security.

[5] Matti Pietikäinen,et al. CS-3DLBP and geometry based person independent 3D facial action unit detection , 2013, 2013 International Conference on Biometrics (ICB).

[6] Chen Chen,et al. Dense Semantic and Topological Correspondence of 3D Faces without Landmarks , 2018, ECCV.

[7] Ling Guan,et al. A Deformable 3-D Facial Expression Model for Dynamic Human Emotional State Recognition , 2013, IEEE Transactions on Circuits and Systems for Video Technology.

[8] Liming Chen,et al. Muscular Movement Model-Based Automatic 3D/4D Facial Expression Recognition , 2015, IEEE Transactions on Multimedia.

[9] Matthew Turk,et al. A Morphable Model For The Synthesis Of 3D Faces , 1999, SIGGRAPH.

[10] Lijun Yin,et al. Static and dynamic 3D facial expression recognition: A comprehensive survey , 2012, Image Vis. Comput..

[11] Feng Liu,et al. 3D Face Modeling From Diverse Raw Scan Data , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[12] Dimitrios Hatzinakos,et al. Emotion Recognition from 2D Facial Expressions , 2019, 2019 IEEE Canadian Conference of Electrical and Computer Engineering (CCECE).

[13] Michael J. Black,et al. Generating 3D faces using Convolutional Mesh Autoencoders , 2018, ECCV.

[14] Alberto Del Bimbo,et al. A Dictionary Learning-Based 3D Morphable Shape Model , 2017, IEEE Transactions on Multimedia.

[15] C. Qi. Deep Learning on Point Sets for 3 D Classification and Segmentation , 2016 .

[16] Juyong Zhang,et al. Disentangled Representation Learning for 3D Face Shape , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[17] H. Zou,et al. Regularization and variable selection via the elastic net , 2005 .

[18] Tieniu Tan,et al. Combining Statistics of Geometrical and Correlative Features for 3D Face Recognition , 2006, BMVC.

[19] Alberto Del Bimbo,et al. A Sparse and Locally Coherent Morphable Face Model for Dense Semantic Correspondence Across Heterogeneous 3D Faces , 2020, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[20] Carlos D. Castillo,et al. SfSNet: Learning Shape, Reflectance and Illuminance of Faces 'in the Wild' , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[21] Sami Romdhani,et al. A 3D Face Model for Pose and Illumination Invariant Face Recognition , 2009, 2009 Sixth IEEE International Conference on Advanced Video and Signal Based Surveillance.

[22] Jian Sun,et al. Multimodal 2D+3D Facial Expression Recognition With Deep Fusion Convolutional Neural Network , 2017, IEEE Transactions on Multimedia.

[23] Syed Zulqarnain Gilani,et al. Dense 3D Face Correspondence , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[24] Patrick J. Flynn,et al. Overview of the face recognition grand challenge , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[25] Marcus A. Magnor,et al. Sparse localized deformation components , 2013, ACM Trans. Graph..

[26] Dimitrios Hatzinakos,et al. Learned 3D Shape Representations Using Fused Geometrically Augmented Images: Application to Facial Expression and Action Unit Detection , 2019, IEEE Transactions on Circuits and Systems for Video Technology.

[27] BerrettiStefano,et al. Representation, Analysis, and Recognition of 3D Humans , 2018 .

[28] P. Ekman,et al. Facial action coding system: a technique for the measurement of facial movement , 1978 .

[29] Alan Brunton,et al. Multilinear Wavelets: A Statistical Shape Space for Human Faces , 2014, ECCV.

[30] Stefanos Zafeiriou,et al. Binary Pattern Analysis for 3D Facial Action Unit Detection , 2012, BMVC.

[31] Guillermo Sapiro,et al. Online Learning for Matrix Factorization and Sparse Coding , 2009, J. Mach. Learn. Res..

[32] Thomas Gerig,et al. Gaussian Process Morphable Models , 2016, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[33] Leonidas J. Guibas,et al. PointNet: Deep Learning on Point Sets for 3D Classification and Segmentation , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[34] Xiaoou Tang,et al. Automatic facial expression recognition on a single 3D face by exploring shape deformation , 2009, ACM Multimedia.

[35] Alberto Del Bimbo,et al. Rendering Realistic Subject-Dependent Expression Images by Learning 3DMM Deformation Coefficients , 2018, ECCV Workshops.

[36] Shaun J. Canavan,et al. Facial Action Unit Detection using 3D Facial Landmarks , 2020, ArXiv.

[37] Stefanos Zafeiriou,et al. Local normal binary patterns for 3D facial action unit detection , 2012, 2012 19th IEEE International Conference on Image Processing.

[38] Jun Wang,et al. A 3D facial expression database for facial behavior research , 2006, 7th International Conference on Automatic Face and Gesture Recognition (FGR06).

[39] Catalin-Daniel Caleanu. Face expression recognition: A brief overview of the last decade , 2013, 2013 IEEE 8th International Symposium on Applied Computational Intelligence and Informatics (SACI).

[40] Yiying Tong,et al. FaceWarehouse: A 3D Facial Expression Database for Visual Computing , 2014, IEEE Transactions on Visualization and Computer Graphics.