Generating a Word-Emotion Lexicon from #Emotional Tweets

Research in emotion analysis of text suggest that emotion lexicon based features are superior to corpus based n-gram features. However the static nature of the general purpose emotion lexicons make them less suited to social media analysis, where the need to adopt to changes in vocabulary usage and context is crucial. In this paper we propose a set of methods to extract a word-emotion lexicon automatically from an emotion labelled corpus of tweets. Our results confirm that the features derived from these lexicons outperform the standard Bag-of-words features when applied to an emotion classification task. Furthermore, a comparative analysis with both manually crafted lexicons and a state-of-the-art lexicon generated using Point-Wise Mutual Information, show that the lexicons generated from the proposed methods lead to significantly better classification performance.

[1]  Anthony C. Boucouvalas,et al.  Representing Emotional Momentum within Expressive Internet Communication , 2006, EuroIMSA.

[2]  Saif Mohammad,et al.  Portable Features for Classifying Emotional Text , 2012, NAACL.

[3]  Danah Boyd,et al.  Tweet, Tweet, Retweet: Conversational Aspects of Retweeting on Twitter , 2010, 2010 43rd Hawaii International Conference on System Sciences.

[4]  Hsin-Hsi Chen,et al.  Emotion Classification Using Web Blog Corpora , 2007, IEEE/WIC/ACM International Conference on Web Intelligence (WI'07).

[5]  Sutanu Chakraborti,et al.  Finding Relevant Tweets , 2012, WAIM.

[6]  Cecilia Ovesdotter Alm,et al.  Emotions from Text: Machine Learning for Text-based Emotion Prediction , 2005, HLT.

[7]  P. Shaver,et al.  Emotion knowledge: further exploration of a prototype approach. , 1987, Journal of personality and social psychology.

[8]  William M. Pottenger,et al.  Classification of Emotions in Internet Chat: An Application of Machine Learning Using Speech Phonemes , 2003 .

[9]  Hugo Liu,et al.  A Corpus-based Approach to Finding Happiness , 2006, AAAI Spring Symposium: Computational Approaches to Analyzing Weblogs.

[10]  Stan Szpakowicz,et al.  Using Roget’s Thesaurus for Fine-grained Emotion Recognition , 2008, IJCNLP.

[11]  Virginia Francisco,et al.  Automated Mark Up of Affective Information in English Texts , 2006, TSD.

[12]  Mark Steyvers,et al.  Identifying Emotions, Intentions, and Attitudes in Text Using a Game with a Purpose , 2010, HLT-NAACL 2010.

[13]  Helmut Prendinger,et al.  A Linguistic Interpretation of the OCC Emotion Model for Affect Sensing from Text , 2009, Affective Information Processing.

[14]  Sanda M. Harabagiu,et al.  EmpaTweet: Annotating and Detecting Emotions on Twitter , 2012, LREC.

[15]  Johan Bollen,et al.  Modeling Public Mood and Emotion: Twitter Sentiment and Socio-Economic Phenomena , 2009, ICWSM.

[16]  Tapas Ray The ‘story’ of digital excess in revolutions of the Arab Spring , 2011 .

[17]  Mitsuru Ishizuka,et al.  Emotion Estimation and Reasoning Based on Affective Textual Interaction , 2005, ACII.

[18]  Saif Mohammad,et al.  #Emotional Tweets , 2012, *SEMEVAL.

[19]  Saif Mohammad,et al.  CROWDSOURCING A WORD–EMOTION ASSOCIATION LEXICON , 2013, Comput. Intell..

[20]  Carlo Strapparava,et al.  WordNet Affect: an Affective Extension of WordNet , 2004, LREC.

[21]  Karthik Visweswariah,et al.  Unsupervised Solution Post Identification from Discussion Forums , 2014, ACL.

[22]  Julia Skinner,et al.  Social Media and Revolution: The Arab Spring and the Occupy Movement as Seen through Three Information Studies Paradigms , 2011 .

[23]  Ellen Riloff,et al.  Bootstrapped Learning of Emotion Hashtags #hashtags4you , 2013, WASSA@NAACL-HLT.

[24]  P. Ekman An argument for basic emotions , 1992 .

[25]  R. Plutchik A GENERAL PSYCHOEVOLUTIONARY THEORY OF EMOTION , 1980 .

[26]  Amit P. Sheth,et al.  Harnessing Twitter "Big Data" for Automatic Emotion Identification , 2012, 2012 International Conference on Privacy, Security, Risk and Trust and 2012 International Confernece on Social Computing.

[27]  Karthik Visweswariah,et al.  Two-part segmentation of text documents , 2012, CIKM '12.

[28]  Mitsuru Ishizuka,et al.  Recognition of Affect, Judgment, and Appreciation in Text , 2010, COLING.

[29]  Deepak Khemani,et al.  Interpretable and reconfigurable clustering of document datasets by deriving word-based rules , 2011, Knowledge and Information Systems.

[30]  C. W. Hughes Emotion: Theory, Research and Experience , 1982 .

[31]  W. Bruce Croft,et al.  Statistical language modeling for information retrieval , 2006, Annu. Rev. Inf. Sci. Technol..

[32]  Mingliang Chen,et al.  BuildingWord-Emotion Mapping Dictionary for Online News , 2012, SDAD@ECML/PKDD.

[33]  Saif Mohammad,et al.  Tracking Sentiment in Mail: How Genders Differ on Emotional Axes , 2011, WASSA@ACL.