Estimation of facial action intensities on 2D and 3D data

The paradigm of Facial Action Coding System (FACS) offers a comprehensive solution for facial expression measurements. FACS defines atomic expression components called Action Units (AUs) and describes their strength on a five-point scale. Despite considerable progress in AU detection, the AU intensity estimation has not been much investigated. We propose SVM-based regression on AU feature space, and investigate person-independent estimation of 25 AUs that appear singly or in various combinations. Our method is novel in that we use regression for estimating intensities and comparatively evaluate the performances of 2D and 3D modalities. The proposed technique shows improvements over the state-of-the-art person-independent estimation, and that especially the 3D modality offers significant advantages for intensity coding. We have also found that fusion of 2D and 3D can boost the estimation performance, especially when modalities compensate for each other's shortcomings.

[1]  Gwen Littlewort,et al.  Automatic Recognition of Facial Actions in Spontaneous Expressions , 2006, J. Multim..

[2]  Song Zhang,et al.  High-resolution, real-time 3D imaging with fringe analysis , 2010, Journal of Real-Time Image Processing.

[3]  Gerhard Rigoll,et al.  A novel sensor system for 3D face scanning based on infrared coded light , 2008, Electronic Imaging.

[4]  Daniel S. Messinger,et al.  A framework for automated measurement of the intensity of non-posed Facial Action Units , 2009, 2009 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops.

[5]  Qingshan Liu,et al.  RankBoost with l1 regularization for facial expression recognition and intensity estimation , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[6]  Emmanuel Dellandréa,et al.  AU recognition on 3D faces based on an extended statistical facial feature model , 2010, 2010 Fourth IEEE International Conference on Biometrics: Theory, Applications and Systems (BTAS).

[7]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[8]  Arman Savran,et al.  Facial action unit detection: 3D versus 2D modality , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Workshops.

[9]  Mann Oo. Hay Emotion recognition in human-computer interaction , 2012 .

[10]  Vladimir Vapnik,et al.  The Nature of Statistical Learning , 1995 .

[11]  Nikolaos D. Doulamis An adaptable emotionally rich pervasive computing system , 2006, 2006 14th European Signal Processing Conference.

[12]  Beat Fasel,et al.  Automati Fa ial Expression Analysis: A Survey , 1999 .