Hierarchical Group-level Emotion Recognition in the Wild

We propose a method for group-level emotion recognition in the wild. The main novelty of our method lies with the recognition of group-level emotions using a hierarchical classification approach. We consider that using the facial expressions of people will only be effective in differentiating images labeled as "Positive" because those labeled as "Neutral" or "Negative" are likely to include similar facial expressions (i.e., less discriminative). Therefore, we first perform binary classification based on facial expression recognition to distinguish "Positive" labels that include discriminative facial expressions (e.g., smile) from the others. We evaluate outcomes that are not classified as "Positive" at the first classification by exploiting scene features that describe what type of events (e.g., demonstration or funeral) are taking place in the image. Classification using scene features will not only be effective in differentiating "Negative" and "Neutral" labels but also in recognizing "Positive" labels, where facial expression features show less discriminative characteristics. The other novelty of the proposed method is to the exploitation of visual attention. Using visual attention allows us to estimate which faces are the main subjects in the target image, thereby suppressing the influences of faces in the background that contribute less to group-level emotion. We demonstrate the effectiveness of our proposed method through experiments using a public dataset.

[1]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[2]  Stephan K. Chalup,et al.  Group emotion recognition in the wild by combining deep neural networks for facial expression classification and scene-context analysis , 2017, ICMI.

[3]  Bo Sun,et al.  A new deep-learning framework for group emotion recognition , 2017, ICMI.

[4]  Tamás D. Gedeon,et al.  Collecting Large, Richly Annotated Facial-Expression Databases from Movies , 2012, IEEE MultiMedia.

[5]  Mohan S. Kankanhalli,et al.  Emotional Attention: A Study of Image Sentiment and Visual Attention , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[6]  B. Balaji,et al.  Multi-level feature fusion for group-level emotion recognition , 2017, ICMI.

[7]  Yu Qiao,et al.  Joint Face Detection and Alignment Using Multitask Cascaded Convolutional Networks , 2016, IEEE Signal Processing Letters.

[8]  Kai Wang,et al.  Group emotion recognition with individual facial emotion CNNs and global image based CNNs , 2017, ICMI.

[9]  Ioannis Patras,et al.  Comparison of Fine-Tuning and Extension Strategies for Deep Convolutional Neural Networks , 2017, MMM.

[10]  Nhan Le Thanh,et al.  Detecting depression using multimodal approach of emotion recognition , 2012, 2012 IEEE International Conference on Complex Systems (ICCS).

[11]  Javier Hernandez,et al.  Mood meter: counting smiles in the wild , 2012, UbiComp.

[12]  Li-Wei Kang,et al.  Seeing through the expression: Bridging the gap between expression and emotion recognition , 2013, 2013 IEEE International Conference on Multimedia and Expo (ICME).

[13]  Qi Zhao,et al.  Saliency in Crowd , 2014, ECCV.

[14]  M. Bradley,et al.  Affective reactions to acoustic stimuli. , 2000, Psychophysiology.

[15]  Bhanu Pratap Singh Rawat,et al.  Group Affect Prediction Using Emotion Heatmaps and Scene Information , 2017, ArXiv.

[16]  Matti Pietikäinen,et al.  Multimodal Framework for Analyzing the Affect of a Group of People , 2018, IEEE Transactions on Multimedia.

[17]  Andrew Zisserman,et al.  Deep Face Recognition , 2015, BMVC.

[18]  Angelo Cangelosi,et al.  Emotion recognition in the wild using deep neural networks and Bayesian classifiers , 2017, ICMI.

[19]  Junping Du,et al.  Reliable Crowdsourcing and Deep Locality-Preserving Learning for Expression Recognition in the Wild , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[20]  Pablo Laguna,et al.  Human emotion recognition using heart rate variability analysis with spectral bands based on respiration , 2015, 2015 37th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC).

[21]  Asaf Varol,et al.  A research on estimation of emotion using EEG signals and brain computer interfaces , 2017, 2017 International Conference on Computer Science and Engineering (UBMK).

[22]  Tamás D. Gedeon,et al.  Static facial expression analysis in tough conditions: Data, evaluation protocol and benchmark , 2011, 2011 IEEE International Conference on Computer Vision Workshops (ICCV Workshops).

[23]  Arman Savran,et al.  Automatic Detection of Emotion Valence on Faces Using Consumer Depth Cameras , 2013, 2013 IEEE International Conference on Computer Vision Workshops.

[24]  Xin Guo,et al.  Group-level emotion recognition using deep models on image scene, faces, and skeletons , 2017, ICMI.

[25]  Shan Li,et al.  Deep Facial Expression Recognition: A Survey , 2018, IEEE Transactions on Affective Computing.

[26]  Cheonshu Park,et al.  An Emotion Expression System for the Emotional Robot , 2007, 2007 IEEE International Symposium on Consumer Electronics.

[27]  Michael S. Bernstein,et al.  ImageNet Large Scale Visual Recognition Challenge , 2014, International Journal of Computer Vision.

[28]  Deborah Sturm,et al.  eMot-iCan: Design of an assessment game for emotion recognition in players with Autism , 2016, 2016 IEEE International Conference on Serious Games and Applications for Health (SeGAH).

[29]  Nicu Sebe,et al.  The more the merrier: Analysing the affect of a group of people in images , 2015, 2015 11th IEEE International Conference and Workshops on Automatic Face and Gesture Recognition (FG).