Comparative Evaluation of 3 D versus 2 D Modality for Automatic Detection of Facial Action Units

Automatic detection of facial expressions attracts great attention due to its potential applications in human computer interaction as well as in human facial behavior research. Most of the research has so far been performed in 2D. However, as the limitations of 2D data are understood, expression analysis research is being pursued in 3D face modality. 3D can capture true facial surface data and is less disturbed by illumination and head pose. At this junction we have conducted a comparative evaluation of 3D and 2D face modalities. We have investigated extensively 25 Action Units (AU) defined in the Facial Action Coding System. For fairness we map facial surface geometry into 2D and apply totally data-driven techniques in order to avoid biases due to design. We have demonstrated that overall 3D data performs better, especially for lower face AUs and that there is room for improvement by fusion of 2D and 3D modalities. Our study involves the determination of the best feature set from 2D and 3D modalities and of the most effective classifier, both from among several alternatives. Our detailed analysis puts into evidence the merits and some shortcomings of 3D modality over 2D in classifying facial expressions from single images.

[1]  Bruno Lévy,et al.  Least squares conformal maps for automatic texture atlas generation , 2002, ACM Trans. Graph..

[2]  Maja Pantic,et al.  Automatic Analysis of Facial Expressions: The State of the Art , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[3]  Hasan Demirel,et al.  Facial Expression Recognition Using 3D Facial Feature Distances , 2007, ICIAR.

[4]  Gwen Littlewort,et al.  Toward Practical Smile Detection , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[5]  K. Craig,et al.  Genuine, suppressed and faked facial expressions of pain in children , 2006, Pain.

[6]  Simon Baron-Cohen,et al.  Reading the mind in the face: A cross-cultural and developmental study , 1996 .

[7]  M. Sayette,et al.  Facial reactions to smoking cues relate to ambivalence about smoking. , 2008, Psychology of addictive behaviors : journal of the Society of Psychologists in Addictive Behaviors.

[8]  Maja Pantic,et al.  Dynamics of facial expression: recognition of facial actions and their temporal segments from face profile image sequences , 2006, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[9]  Gerhard Rigoll,et al.  A novel sensor system for 3D face scanning based on infrared coded light , 2008, Electronic Imaging.

[10]  Takeo Kanade,et al.  Recognizing Action Units for Facial Expression Analysis , 2001, IEEE Trans. Pattern Anal. Mach. Intell..

[11]  Dimiter M. Dimitrov,et al.  Linking Maternal Perceptions to Behavior: Nurturing Attitudes and Facial Expressions of Affect , 2005 .

[12]  Lijun Yin,et al.  Recognizing partial facial action units based on 3D dynamic range data for facial expression recognition , 2008, 2008 8th IEEE International Conference on Automatic Face & Gesture Recognition.

[13]  Takeo Kanade,et al.  Comprehensive database for facial expression analysis , 2000, Proceedings Fourth IEEE International Conference on Automatic Face and Gesture Recognition (Cat. No. PR00580).

[14]  Amanda C.C. Williams,et al.  Facial expression of pain: An evolutionary account , 2002, Behavioral and Brain Sciences.

[15]  Bruno Lévy,et al.  Mesh parameterization: theory and practice , 2007, SIGGRAPH Courses.

[16]  P. Ekman,et al.  What the face reveals : basic and applied studies of spontaneous expression using the facial action coding system (FACS) , 2005 .

[17]  P. Ekman,et al.  Facial action coding system: a technique for the measurement of facial movement , 1978 .

[18]  Gwen Littlewort,et al.  Automatic Recognition of Facial Actions in Spontaneous Expressions , 2006, J. Multim..

[19]  Song Zhang,et al.  High-resolution, real-time 3D imaging with fringe analysis , 2010, Journal of Real-Time Image Processing.

[20]  Alexandru Telea,et al.  An Image Inpainting Technique Based on the Fast Marching Method , 2004, J. Graphics, GPU, & Game Tools.

[21]  Marian Stewart Bartlett,et al.  Classifying Facial Actions , 1999, IEEE Trans. Pattern Anal. Mach. Intell..

[22]  Beat Fasel,et al.  Automati Fa ial Expression Analysis: A Survey , 1999 .

[23]  Arman Savran,et al.  Bosphorus Database for 3D Face Analysis , 2008, BIOID.

[24]  Michael G. Strintzis,et al.  3D facial expression recognition using swarm intelligence , 2008, 2008 IEEE International Conference on Acoustics, Speech and Signal Processing.

[25]  Qiang Ji,et al.  Facial Action Unit Recognition by Exploiting Their Dynamic and Semantic Relationships , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[26]  John G. Daugman,et al.  Complete discrete 2-D Gabor transforms by neural networks for image analysis and compression , 1988, IEEE Trans. Acoust. Speech Signal Process..

[27]  Arman Savran,et al.  Facial action unit detection: 3D versus 2D modality , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Workshops.

[28]  Paul J. Besl,et al.  A Method for Registration of 3-D Shapes , 1992, IEEE Trans. Pattern Anal. Mach. Intell..

[29]  M. Knapp,et al.  Nonverbal communication in human interaction , 1972 .

[30]  Sen Wang,et al.  High resolution tracking of non-rigid 3D motion of densely sampled data using harmonic maps , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[31]  Eva Krumhuber,et al.  Are you joking? The moderating role of smiles in the perception of verbal statements , 2009 .

[32]  Jun Wang,et al.  3D Facial Expression Recognition Based on Primitive Surface Feature Distribution , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[33]  Sotiris Malassiotis,et al.  Real-time 2D+3D facial action and expression recognition , 2010, Pattern Recognit..

[34]  P. Ekman,et al.  The asymmetry of facial actions is inconsistent with models of hemispheric specialization. , 1985, Psychophysiology.

[35]  Ioannis Pitas,et al.  Application of non-negative and local non negative matrix factorization to facial expression recognition , 2004, Proceedings of the 17th International Conference on Pattern Recognition, 2004. ICPR 2004..

[36]  Qingshan Liu,et al.  Boosting Coded Dynamic Features for Facial Action Units and Facial Expression Recognition , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[37]  Changbo Hu,et al.  AAM derived face representations for robust facial action recognition , 2006, 7th International Conference on Automatic Face and Gesture Recognition (FGR06).

[38]  Michael G. Strintzis,et al.  Bilinear elastically deformable models with application to 3D face and facial expression recognition , 2008, 2008 8th IEEE International Conference on Automatic Face & Gesture Recognition.

[39]  P. Ekman,et al.  Is the startle reaction an emotion? , 1985, Journal of personality and social psychology.

[40]  Sen Wang,et al.  Conformal Geometry and Its Applications on 3D Shape Matching, Recognition, and Stitching , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[41]  Stan Z. Li,et al.  Learning spatially localized, parts-based representation , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[42]  J. Cohn,et al.  A Psychometric Evaluation of the Facial Action Coding System for Assessing Spontaneous Expression , 2001 .

[43]  P. Ekman,et al.  Appearing truthful generalizes across different deception situations. , 2004, Journal of personality and social psychology.

[44]  Thomas S. Huang,et al.  3D facial expression recognition based on automatically selected features , 2008, 2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops.

[45]  P. Ekman,et al.  Constants across cultures in the face and emotion. , 1971, Journal of personality and social psychology.

[46]  J. Cohn,et al.  Impact of depression on response to comedy: a dynamic facial coding analysis. , 2007, Journal of abnormal psychology.

[47]  Patrik O. Hoyer,et al.  Non-negative Matrix Factorization with Sparseness Constraints , 2004, J. Mach. Learn. Res..