An emotion recognition mechanism based on the combination of mutual information and semantic clues

Recently, text-based emotion recognition has become more and more highly regarded. After exploring the related research, we have found that: (1) In the semantics-based solution, the newly emerging compound words cannot be handled. (2) In the statistics-based solution, the emotional values are computed only via the linear text clues; it lacks syntactic and semantic structural information. Therefore, in our effort to overcome the two concerns above, in the system presented here we construct an extensible lexicon and use semantic clues to analyze the sentences. In this work, we collected the sentences posted to the Plurk website as our corpus. The emoticons are classified into four types based on Thayer’s 2-D Model which is composed of valence (positive/negative emotions) and arousal (the strength of emotions). The system will pre-process the sentence to eliminate the useless information, and then transform it to an emotion lexicon. Additionally, this research analyzes three kinds of semantic clues: negation, transition, and coordinating conjunctions. The final emotion is decided by SVM and the merging algorithm proposed in this work. Moreover, the recognition rate is promoted from 40.3 to 68.15% after the emotion lexicon is improved, and the semantic structural information is applied. The experimental results show that the work is promising.

[1]  Yuji Matsumoto,et al.  Chinese Word Segmentation by Classification of Characters , 2005, Int. J. Comput. Linguistics Chin. Lang. Process..

[2]  Yuji Matsumoto,et al.  Combination of Machine Learning Methods for Optimum Chinese Word Segmentation , 2005, IJCNLP.

[3]  Yuji Matsumoto,et al.  Chunking with Support Vector Machines , 2001, NAACL.

[4]  Keh-Jiann Chen,et al.  Automatic Semantic Role Assignment for a Tree Structure , 2004, SIGHAN@ACL.

[5]  Yuji Matsumoto,et al.  Use of Support Vector Learning for Chunk Identification , 2000, CoNLL/LLL.

[6]  Qun Liu,et al.  HHMM-based Chinese Lexical Analyzer ICTCLAS , 2003, SIGHAN.

[7]  R. Thayer The biopsychology of mood and arousal , 1989 .

[8]  Hsin-Hsi Chen,et al.  Writer Meets Reader: Emotion Analysis of Social Media from Both the Writer's and Reader's Perspectives , 2009, 2009 IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology.

[9]  Nello Cristianini,et al.  An introduction to Support Vector Machines , 2000 .

[10]  Loïc Kessous,et al.  Multimodal emotion recognition from expressive faces, body gestures and speech , 2007, AIAI.

[11]  Keh-Jiann Chen,et al.  Chinese Treebanks and Grammar Extraction , 2004, IJCNLP.

[12]  Andreas Christmann,et al.  Support vector machines , 2008, Data Mining and Knowledge Discovery Handbook.

[13]  Nianwen Xue,et al.  Chinese Word Segmentation as Character Tagging , 2003, ROCLING/IJCLCLP.

[14]  Hsin-Hsi Chen,et al.  Building Emotion Lexicon from Weblog Corpora , 2007, ACL.

[15]  L. C. De Silva,et al.  Bimodal emotion recognition , 2000, Proceedings Fourth IEEE International Conference on Automatic Face and Gesture Recognition (Cat. No. PR00580).

[16]  Ishwar K. Sethi,et al.  Investigation of combining SVM and decision tree for emotion classification , 2005, Seventh IEEE International Symposium on Multimedia (ISM'05).

[17]  I. Song,et al.  Working Set Selection Using Second Order Information for Training Svm, " Complexity-reduced Scheme for Feature Extraction with Linear Discriminant Analysis , 2022 .

[18]  Dustin Boswell,et al.  Introduction to Support Vector Machines , 2002 .

[19]  Hugo Liu,et al.  A Corpus-based Approach to Finding Happiness , 2006, AAAI Spring Symposium: Computational Approaches to Analyzing Weblogs.