Modelling human perception of static facial expressions

A recent internet based survey of over 35,000 samples has shown that when different human observers are asked to assign labels to static human facial expressions, different individuals categorize differently the same image. This fact results in a lack of an unique ground-truth, an assumption held by the large majority of existing models for classification. This is especially true for highly ambiguous expressions, especially in the lack of a dynamic context. In this paper we propose to address this shortcoming by the use of discrete choice models (DCM) to describe the choice a human observer is faced to when assigning labels to static facial expressions. Different models of increasing complexity are specified to capture the causal effect between features of an image and its associated expression, using several combinations of different measurements. The sets of measurements we used are largely inspired by FACS but also introduce some new ideas, specific to a static framework. These models are calibrated using maximum likelihood techniques and they are compared with each other using a likelihood ratio test, in order to test for significance in the improvement resulting from adding supplemental features. Through a cross-validation procedure we assess the validity of our approach against overfitting and we provide a comparison with an alternative model based on Neural Networks for benchmark purposes.

[1]  Simon Baker,et al.  Active Appearance Models Revisited , 2004, International Journal of Computer Vision.

[2]  Qiang Ji,et al.  Active and dynamic information fusion for facial expression understanding from image sequences , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[3]  Maja Pantic,et al.  Fully Automatic Facial Action Unit Detection and Temporal Analysis , 2006, 2006 Conference on Computer Vision and Pattern Recognition Workshop (CVPRW'06).

[4]  Gwen Littlewort,et al.  Recognizing facial expression: machine learning and application to spontaneous behavior , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[5]  Takeo Kanade,et al.  Recognizing Action Units for Facial Expression Analysis , 2001, IEEE Trans. Pattern Anal. Mach. Intell..

[6]  Maja Pantic,et al.  An Expert System for Recognition of Facial Actions and their Intensity , 2000, AAAI/IAAI.

[7]  Moshe Ben-Akiva,et al.  Discrete Choice Analysis: Theory and Application to Travel Demand , 1985 .

[8]  Christian Wallraven,et al.  Learning from humans: Computational modeling of face recognition , 2004, Network.

[9]  Timothy F. Cootes,et al.  Active Appearance Models , 2001, IEEE Trans. Pattern Anal. Mach. Intell..

[10]  Yongzhao Zhan,et al.  Facial expression features extraction based on Gabor wavelet transformation , 2004, 2004 IEEE International Conference on Systems, Man and Cybernetics (IEEE Cat. No.04CH37583).

[11]  Jeffrey F. Cohn,et al.  Foundations of human computing: facial expression and emotion , 2006, ICMI '06.

[12]  Donato Malerba,et al.  Comparing Dissimilarity Measures For Probabilistic Symbolic Objects , 2002 .

[13]  Michael J. Swain,et al.  Color indexing , 1991, International Journal of Computer Vision.

[14]  Maja Pantic,et al.  Combined Support Vector Machines and Hidden Markov Models for Modeling Facial Action Temporal Dynamics , 2007, ICCV-HCI.

[15]  Solomon Kullback,et al.  Information Theory and Statistics , 1960 .

[16]  Michel Bierlaire,et al.  BIOGEME: a free package for the estimation of discrete choice models , 2003 .

[17]  Enrico Grosso,et al.  Generalization in Holistic versus Analytic Processing of Faces , 2007, 14th International Conference on Image Analysis and Processing (ICIAP 2007).

[18]  Tomaso A. Poggio,et al.  Example-Based Learning for View-Based Human Face Detection , 1998, IEEE Trans. Pattern Anal. Mach. Intell..

[19]  Franck Davoine,et al.  Appearance factorization based facial expression recognition and synthesis , 2004, ICPR 2004.

[20]  Alex Pentland,et al.  Coding, Analysis, Interpretation, and Recognition of Facial Expressions , 1997, IEEE Trans. Pattern Anal. Mach. Intell..

[21]  Takeo Kanade,et al.  Comprehensive database for facial expression analysis , 2000, Proceedings Fourth IEEE International Conference on Automatic Face and Gesture Recognition (Cat. No. PR00580).

[22]  Takeo Kanade,et al.  Recognizing lower face action units for facial expression analysis , 2000, Proceedings Fourth IEEE International Conference on Automatic Face and Gesture Recognition (Cat. No. PR00580).

[23]  Maja Pantic,et al.  Web-based database for facial expression analysis , 2005, 2005 IEEE International Conference on Multimedia and Expo.

[24]  K. Mardia,et al.  Statistical Shape Analysis , 1998 .

[25]  M. Bartlett,et al.  Machine Analysis of Facial Expressions , 2007 .

[26]  S. Carey,et al.  Why faces are and are not special: an effect of expertise. , 1986 .

[27]  Anil K. Jain,et al.  Handbook of Face Recognition, 2nd Edition , 2011 .

[28]  Paul A. Viola,et al.  Robust Real-Time Face Detection , 2001, International Journal of Computer Vision.

[29]  David Heckerman,et al.  A Tutorial on Learning with Bayesian Networks , 1999, Innovations in Bayesian Networks.

[30]  Rogério Schmidt Feris,et al.  Manifold Based Analysis of Facial Expression , 2004, CVPR Workshops.

[31]  Jean-Philippe Thiran,et al.  Discrete Choice Models for Static Facial Expression Recognition , 2006, ACIVS.

[32]  A. Schwaninger Perception and representation of faces , 2003 .

[33]  Maja Pantic,et al.  Dynamics of facial expression: recognition of facial actions and their temporal segments from face profile image sequences , 2006, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[34]  D. McFadden Econometric Models of Probabilistic Choice , 1981 .

[35]  Leonidas J. Guibas,et al.  The Earth Mover's Distance as a Metric for Image Retrieval , 2000, International Journal of Computer Vision.

[36]  Peter Meer,et al.  Edge Detection with Embedded Confidence , 2001, IEEE Trans. Pattern Anal. Mach. Intell..

[37]  Wagner A. Kamakura,et al.  Book Review: Structural Analysis of Discrete Data with Econometric Applications , 1982 .

[38]  M. Turk,et al.  Probabilistic expression analysis on manifolds , 2004, Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004..

[39]  Nicu Sebe,et al.  Facial expression recognition from video sequences: temporal and static modeling , 2003, Comput. Vis. Image Underst..

[40]  Zhengyou Zhang,et al.  Comparison between geometry-based and Gabor-wavelets-based facial expression recognition using multi-layer perceptron , 1998, Proceedings Third IEEE International Conference on Automatic Face and Gesture Recognition.

[41]  Takeo Kanade,et al.  Evaluation of Gabor-wavelet-based facial action unit recognition in image sequences of increasing complexity , 2002, Proceedings of Fifth IEEE International Conference on Automatic Face Gesture Recognition.

[42]  Takeo Kanade,et al.  A statistical approach to 3d object detection applied to faces and cars , 2000 .

[43]  P. Ekman An argument for basic emotions , 1992 .

[44]  C. Manski The structure of random utility models , 1977 .

[45]  Takeo Kanade,et al.  Facial Expression Analysis , 2011, AMFG.

[46]  J. N. Bassili Facial motion in the perception of faces and of emotional expression. , 1978, Journal of experimental psychology. Human perception and performance.

[47]  Michel Bierlaire,et al.  A theoretical analysis of the cross-nested logit model , 2006, Ann. Oper. Res..

[48]  S. Ullman,et al.  Generalization to Novel Images in Upright and Inverted Faces , 1993, Perception.

[49]  Jean-Philippe Thiran,et al.  Facial Expressions Evaluation Survey , 2007 .

[50]  Maja Pantic,et al.  Non-rigid registration using free-form deformations for recognition of facial actions and their temporal dynamics , 2008, 2008 8th IEEE International Conference on Automatic Face & Gesture Recognition.

[51]  Michael J. Lyons,et al.  Coding facial expressions with Gabor wavelets , 1998, Proceedings Third IEEE International Conference on Automatic Face and Gesture Recognition.

[52]  Heinrich H. Bülthoff,et al.  Proceedings of the First IEEE International Workshop on Biologically Motivated Computer Vision , 2000 .

[53]  Michael A. Arbib,et al.  The handbook of brain theory and neural networks , 1995, A Bradford book.

[54]  Iven Van Mechelen,et al.  Probabilistic feature analysis of facial perception of emotions , 2005 .

[55]  Beat Fasel,et al.  Automatic facial expression analysis: a survey , 2003, Pattern Recognit..

[56]  Alex Pentland,et al.  View-based and modular eigenspaces for face recognition , 1994, 1994 Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.

[57]  Pat Langley,et al.  An Analysis of Bayesian Classifiers , 1992, AAAI.

[58]  Yoram Singer,et al.  Improved Boosting Algorithms Using Confidence-rated Predictions , 1998, COLT' 98.

[59]  Roberto Cabeza,et al.  Features are Also Important: Contributions of Featural and Configural Processing to Face Recognition , 2000, Psychological science.

[60]  S. Carey Becoming a face expert. , 1992, Philosophical transactions of the Royal Society of London. Series B, Biological sciences.

[61]  Gwen Littlewort,et al.  Fully Automatic Facial Action Recognition in Spontaneous Behavior , 2006, 7th International Conference on Automatic Face and Gesture Recognition (FGR06).

[62]  Narendra Ahuja,et al.  Detecting Faces in Images: A Survey , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[63]  Adrian Schwaninger,et al.  Role of Featural and Configural Information in Familiar and Unfamiliar Face Recognition , 2002, Biologically Motivated Computer Vision.

[64]  M. Farah,et al.  What is "special" about face perception? , 1998, Psychological review.

[65]  Takeo Kanade,et al.  Neural Network-Based Face Detection , 1998, IEEE Trans. Pattern Anal. Mach. Intell..