论文信息 - Fusion of Smile, Valence and NGram Features for Automatic Affect Detection

Fusion of Smile, Valence and NGram Features for Automatic Affect Detection

This paper addresses the problem of feature fusion between smile, as a visual feature, and text, as a transcription result. The influence of smile over semantic data has been considered before, without investigating multiple approaches for the fusion. This problem is multi-modal, which makes it more difficult. The goal of this article is to investigate how this fusion could increase the current interactivity of a dialogue system by boosting the automatic detection rate of the sentiments expressed by a human user. There are two original propositions in our approach. The first lies in the use of a segmented detection for text data, rather than predicting a single label for every document (video). Second, this paper studies the importance of several features in the process of multi-modal fusion. Our approach uses basic features, such as NGrams, Smile Presence or Valence to find the best fusion approach. Moreover, we test a two level classification approach, using a SVM.

[1] George A. Miller,et al. WordNet: A Lexical Database for English , 1995, HLT.

[2] John C. Platt. Using Analytic QP and Sparseness to Speed Training of Support Vector Machines , 1998, NIPS.

[3] S. Dumais. Latent Semantic Analysis. , 2005 .

[4] Björn W. Schuller,et al. AVEC 2012: the continuous audio/visual emotion challenge , 2012, ICMI '12.

[5] Jason Weston,et al. Natural Language Processing (Almost) from Scratch , 2011, J. Mach. Learn. Res..

[6] Carlo Strapparava,et al. WordNet Affect: an Affective Extension of WordNet , 2004, LREC.

[7] Mitsuru Ishizuka,et al. A chat system based on emotion estimation from text and embodied conversational messengers , 2005, Proceedings of the 2005 International Conference on Active Media Technology, 2005. (AMT 2005)..

[8] Carlo Strapparava,et al. Learning to identify emotions in text , 2008, SAC '08.

[9] Rada Mihalcea,et al. Towards multimodal sentiment analysis: harvesting opinions from the web , 2011, ICMI '11.

[10] Carlo Strapparava,et al. Lexical Resources and Semantic Similarity for Affective Evaluative Expressions Generation , 2005, ACII.

[11] E. Vesterinen,et al. Affective Computing , 2009, Encyclopedia of Biometrics.

[12] Hugo Liu,et al. ConceptNet — A Practical Commonsense Reasoning Tool-Kit , 2004 .

[13] Björn Schuller,et al. Opensmile: the munich versatile and fast open-source audio feature extractor , 2010, ACM Multimedia.

[14] Rafael A. Calvo,et al. Affect Detection: An Interdisciplinary Review of Models, Methods, and Their Applications , 2010, IEEE Transactions on Affective Computing.

[15] Ian H. Witten,et al. The WEKA data mining software: an update , 2009, SKDD.

[16] S. Sathiya Keerthi,et al. Improvements to Platt's SMO Algorithm for SVM Classifier Design , 2001, Neural Computation.

[17] Andrea Esuli,et al. SentiWordNet 3.0: An Enhanced Lexical Resource for Sentiment Analysis and Opinion Mining , 2010, LREC.

[18] Magalie Ochs. AFFIMO: Toward an open-source system to detect AFFinities and eMOtions in user’s sentences , 2012 .

[19] Björn W. Schuller,et al. Categorical and dimensional affect analysis in continuous input: Current trends and future directions , 2013, Image Vis. Comput..