Emotion‐aware polarity lexicons for Twitter sentiment analysis

Theoretical frameworks in psychology map the relationships between emotions and sentiments. In this paper we study the role of such mapping for computational emotion detection from text (e.g. social media) with a aim to understand the usefulness of an emotion-rich corpus of documents (e.g. tweets) to learn polarity lexicons for sentiment analysis. We propose two different methods that leverage a corpus of emotion-labelled tweets to learn word-polarity lexicons. The proposed methods model the emotion corpus using a generative unigram mixture model (UMM), combined with the emotion-sentiment mapping proposed in Psychology for automated generation of word-polarity lexicons that capture emotion-rich vocabulary. We comparatively evaluate the quality of the proposed mixture model in learning emotion-aware sentiment lexicons with those generated using supervised latent dirichlet allocation (sLDA) and word-document frequency (WDF) statistics. Sentiment analysis experiments on benchmark Twitter data sets confirm the quality of our proposed lexicons. Further a comparative analysis with sLDA, WDF based emotion-aware lexicons and standard sentiment lexicons that are agnostic to emotion knowledge suggest that the proposed lexicons lead to a significantly better performance in both sentiment classification and sentiment intensity prediction tasks.

[1]  Marshall S. Smith,et al.  The general inquirer: A computer approach to content analysis. , 1967 .

[2]  R. Plutchik A GENERAL PSYCHOEVOLUTIONARY THEORY OF EMOTION , 1980 .

[3]  Erkki Sutinen,et al.  Are They Different? Affect, Feeling, Emotion, Sentiment, and Opinion Detection in Text , 2014, IEEE Transactions on Affective Computing.

[4]  Uzay Kaymak,et al.  Exploiting Emoticons in Polarity Classification of Text , 2015, J. Web Eng..

[5]  Bing Liu,et al.  Mining and summarizing customer reviews , 2004, KDD.

[6]  K. P. Chow,et al.  A Topic Model for Building Fine-grained Domain-specific Emotion Lexicon , 2014, ACL.

[7]  Vidyasagar Potdar,et al.  Emotion detection state of the art , 2012, CUBE.

[8]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[9]  Stewart Massie,et al.  Generating a Word-Emotion Lexicon from #Emotional Tweets , 2014, *SEMEVAL.

[10]  Erik Cambria,et al.  SenticNet 3: A Common and Common-Sense Knowledge Base for Cognition-Driven Sentiment Analysis , 2014, AAAI.

[11]  Andrea Esuli,et al.  SentiWordNet 3.0: An Enhanced Lexical Resource for Sentiment Analysis and Opinion Mining , 2010, LREC.

[12]  Daling Wang,et al.  A word-emoticon mutual reinforcement ranking model for building sentiment lexicon from massive collection of microblogs , 2014, World Wide Web.

[13]  Saif Mohammad,et al.  CROWDSOURCING A WORD–EMOTION ASSOCIATION LEXICON , 2013, Comput. Intell..

[14]  Huan Liu,et al.  Unsupervised sentiment analysis with emotional signals , 2013, WWW.

[15]  Danah Boyd,et al.  Tweet, Tweet, Retweet: Conversational Aspects of Retweeting on Twitter , 2010, 2010 43rd Hawaii International Conference on System Sciences.

[16]  P. Ekman An argument for basic emotions , 1992 .

[17]  Saif Mohammad,et al.  NRC-Canada: Building the State-of-the-Art in Sentiment Analysis of Tweets , 2013, *SEMEVAL.

[18]  Zengfu Wang,et al.  An Emotion Space Model for Recognition of Emotions in Spoken Chinese , 2005, ACII.

[19]  Erik Cambria,et al.  EmoSenticSpace: A novel framework for affective common-sense reasoning , 2014, Knowl. Based Syst..

[20]  Hugo Liu,et al.  ConceptNet — A Practical Commonsense Reasoning Tool-Kit , 2004 .

[21]  Vidyasagar Potdar,et al.  Computational approaches for emotion detection in text , 2010, 4th IEEE International Conference on Digital Ecosystems and Technologies.

[22]  Lillian Lee,et al.  Opinion Mining and Sentiment Analysis , 2008, Found. Trends Inf. Retr..

[23]  Mingliang Chen,et al.  Building emotional dictionary for sentiment analysis of online news , 2014, World Wide Web.

[24]  Diana Inkpen,et al.  Hierarchical Approach to Emotion Recognition and Classification in Texts , 2010, Canadian Conference on AI.

[25]  Wei Gao,et al.  Build Emotion Lexicon from Microblogs by Combining Effects of Seed Words and Emoticons in a Heterogeneous Graph , 2015, HT.

[26]  Saif Mohammad,et al.  #Emotional Tweets , 2012, *SEMEVAL.

[27]  Preslav Nakov,et al.  SemEval-2015 Task 10: Sentiment Analysis in Twitter , 2015, *SEMEVAL.

[28]  Amit P. Sheth,et al.  Harnessing Twitter "Big Data" for Automatic Emotion Identification , 2012, 2012 International Conference on Privacy, Security, Risk and Trust and 2012 International Confernece on Social Computing.

[29]  Janyce Wiebe,et al.  Recognizing Contextual Polarity in Phrase-Level Sentiment Analysis , 2005, HLT.

[30]  Xu Ling,et al.  Topic sentiment mixture: modeling facets and opinions in weblogs , 2007, WWW '07.

[31]  Yiqun Liu,et al.  Microblog Sentiment Analysis with Emoticon Space Model , 2014, Journal of Computer Science and Technology.

[32]  Ellen Riloff,et al.  Bootstrapped Learning of Emotion Hashtags #hashtags4you , 2013, WASSA@NAACL-HLT.

[33]  David M. Blei,et al.  Supervised Topic Models , 2007, NIPS.

[34]  Stewart Massie,et al.  Lexicon Generation for Emotion Detection from Text , 2017, IEEE Intelligent Systems.