论文信息 - Detecting Decision Ambiguity from Facial Images

Detecting Decision Ambiguity from Facial Images

In situations when potentially costly decisions are being made, faces of people tend to reflect a level of certainty about the appropriateness of the chosen decision. This fact is known from the psychological literature. In the paper, we propose a method that uses facial images for automatic detection of the decision ambiguity state of a subject. To train and test the method, we collected a large-scale dataset from "Who Wants to Be a Millionaire?" -- a popular TV game show. The videos provide examples of various mental states of contestants, including uncertainty, doubts and hesitation. The annotation of the videos is done automatically from on-screen graphics. The problem of detecting decision ambiguity is formulated as binary classification. Video-clips where a contestant asks for help (audience, friend, 50:50) are considered as positive samples; if he (she) replies directly as negative ones. We propose a baseline method combining a deep convolutional neural network with an SVM. The method has an error rate of 24%. The error of human volunteers on the same dataset is 45%, close to chance.

[1] P. Ekman. Emotions Revealed: Recognizing Faces and Feelings to Improve Communication and Emotional Life , 2003 .

[2] Jian Sun,et al. Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[3] Ivan Laptev,et al. Learning and Transferring Mid-level Image Representations Using Convolutional Neural Networks , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[4] Jean-Marc Odobez,et al. Engagement-based Multi-party Dialog with a Humanoid Robot , 2011, SIGDIAL Conference.

[5] Jizhou Sun,et al. Puzzlement Detection from Facial Expression Using Active Appearance Models and Support Vector Machines , 2014 .

[6] Andrew Zisserman,et al. Deep Face Recognition , 2015, BMVC.

[7] Maja Pantic,et al. Web-based database for facial expression analysis , 2005, 2005 IEEE International Conference on Multimedia and Expo.

[8] Stefanos Zafeiriou,et al. 300 Faces In-The-Wild Challenge: database and results , 2016, Image Vis. Comput..

[9] C. Hjortsjö. Man's face and mimic language , 1969 .

[10] Marcus Liwicki,et al. DeXpression: Deep Convolutional Neural Network for Expression Recognition , 2015, ArXiv.

[11] Heinrich H. Bülthoff,et al. Manipulating Video Sequences to Determine the Components of Conversational Facial Expressions , 2005, TAP.

[12] Kenneth Sundaraj,et al. Detecting Driver Drowsiness Based on Sensors: A Review , 2012, Sensors.

[13] Fernando De la Torre,et al. Detecting depression from facial actions and vocal prosody , 2009, 2009 3rd International Conference on Affective Computing and Intelligent Interaction and Workshops.

[14] Peter Robinson,et al. OpenFace: An open source facial behavior analysis toolkit , 2016, 2016 IEEE Winter Conference on Applications of Computer Vision (WACV).

[15] Matthew Stone,et al. Modeling Facial Expression of Uncertainty in Conversational Animation , 2006, ZiF Workshop.

[16] Tamás D. Gedeon,et al. Collecting Large, Richly Annotated Facial-Expression Databases from Movies , 2012, IEEE MultiMedia.

[17] Abhishek Das,et al. Grad-CAM: Visual Explanations from Deep Networks via Gradient-Based Localization , 2016, 2017 IEEE International Conference on Computer Vision (ICCV).

[18] Louis-Philippe Morency,et al. I Can Already Guess Your Answer: Predicting Respondent Reactions during Dyadic Negotiation , 2015, IEEE Transactions on Affective Computing.

[19] Takeo Kanade,et al. The Extended Cohn-Kanade Dataset (CK+): A complete dataset for action unit and emotion-specified expression , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Workshops.

[20] Jiri Matas,et al. WaldBoost - learning for time constrained sequential detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[21] Karl Ricanek,et al. MORPH: a longitudinal image database of normal adult age-progression , 2006, 7th International Conference on Automatic Face and Gesture Recognition (FGR06).

[22] Thomas Hofmann,et al. Support Vector Machines for Multiple-Instance Learning , 2002, NIPS.

[23] Tal Hassner,et al. Emotion Recognition in the Wild via Convolutional Neural Networks and Mapped Binary Patterns , 2015, ICMI.

[24] M. Pantic,et al. Faces InThe-Wild Challenge : Database and Results , 2016 .

[25] Nicu Sebe,et al. The more the merrier: Analysing the affect of a group of people in images , 2015, 2015 11th IEEE International Conference and Workshops on Automatic Face and Gesture Recognition (FG).

[26] Shaogang Gong,et al. Facial expression recognition based on Local Binary Patterns: A comprehensive study , 2009, Image Vis. Comput..

[27] Marwan Mattar,et al. Labeled Faces in the Wild: A Database forStudying Face Recognition in Unconstrained Environments , 2008 .

[28] Jiri Matas,et al. Deep TextSpotter: An End-to-End Trainable Scene Text Localization and Recognition Framework , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[29] L. Bonfiglioli,et al. Expression and communication of doubt/uncertainty through facial expression , 2014 .