Unsupervised graph-based pattern extraction for multilingual emotion classification

The connected society we live in today has allowed online users to willingly share opinions on an unprecedented scale. Motivated by the advent of mass opinion sharing, it is then crucial to devise algorithms that efficiently identify the emotions expressed within the opinionated content. Traditional opinion-based classifiers require extracting high-dimensional feature representations, which become computationally expensive to process and can misrepresent or deteriorate the accuracy of a classifier. In this paper, we propose an unsupervised graph-based approach for extracting Twitter-specific emotion-bearing patterns to be used as features. By utilizing a more representative list of patterns, as features, we improved the precision and recall of a given emotion classification task. Due to its novel bootstrapping process, the full system is also adaptable to different domains and languages. The experimented results demonstrate that the extracted patterns are effective in identifying emotions for English, Spanish, and French Twitter streams. We also provide detailed experiments and offer an extended version of our algorithm to support the classification of Indonesian microblog posts. Overall, our empirical experimented results demonstrate that the proposed approach bears desirable characteristics such as accuracy, generality, adaptability, minimal supervision, and coverage.

[1]  Hsinchun Chen,et al.  Sentiment analysis in multiple languages: Feature selection for opinion classification in Web forums , 2008, TOIS.

[2]  Turchi Marco,et al.  Comparative Experiments for Multilingual Sentiment Analysis using Machine Translation , 2012 .

[3]  Rada Mihalcea,et al.  Porting Multilingual Subjectivity Resources across Languages , 2013, IEEE Transactions on Affective Computing.

[4]  Yiqun Liu,et al.  Emotion Tokens: Bridging the Gap among Multilingual Twitter Sentiment Analysis , 2011, AIRS.

[5]  Minyi Guo,et al.  Emoticon Smoothed Language Models for Twitter Sentiment Analysis , 2012, AAAI.

[6]  Jeonghee Yi,et al.  Sentiment analysis: capturing favorability using natural language processing , 2003, K-CAP '03.

[7]  Agus Zainal Arifin,et al.  Emotion Detection of Tweets in Indonesian Language using Non-Negative Matrix Factorization , 2014 .

[8]  Ellen Riloff,et al.  Learning Extraction Patterns for Subjective Expressions , 2003, EMNLP.

[9]  P. Waila,et al.  Sentiment analysis of Movie reviews and Blog posts , 2013, 2013 3rd IEEE International Advance Computing Conference (IACC).

[10]  Changqin Quan,et al.  Customer emotion detection by emotion expression analysis on adverbs , 2015, Inf. Technol. Manag..

[11]  Alan F. Smeaton,et al.  Classifying sentiment in microblogs: is brevity an advantage? , 2010, CIKM.

[12]  Vaibhavi N Patodkar,et al.  Twitter as a Corpus for Sentiment Analysis and Opinion Mining , 2016 .

[13]  Johan Bollen,et al.  Twitter mood predicts the stock market , 2010, J. Comput. Sci..

[14]  Yi-Shin Chen,et al.  EmoViz: Mining the world's interest through emotion analysis , 2015, 2015 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM).

[15]  M. Pell,et al.  Preferential decoding of emotion from human non-linguistic vocalizations versus speech prosody , 2015, Biological Psychology.

[16]  Mirna Adriani,et al.  Automatically Building a Corpus for Sentiment Analysis on Indonesian Tweets , 2014, PACLIC.

[17]  Cindy K. Chung,et al.  The development and psychometric properties of LIWC2007 , 2007 .

[18]  Luis F. Luna-Reyes,et al.  Information systems to support sustainable consumption and sustainable supply , 2015, Inf. Technol. Manag..

[19]  Bernard J. Jansen,et al.  Twitter power: Tweets as electronic word of mouth , 2009, J. Assoc. Inf. Sci. Technol..

[20]  Lillian Lee,et al.  Opinion Mining and Sentiment Analysis , 2008, Found. Trends Inf. Retr..

[21]  Berkant Barla Cambazoglu,et al.  A large-scale sentiment analysis for Yahoo! answers , 2012, WSDM '12.

[22]  Hang Lei,et al.  An Empirical Study on Sentiment Classification of Chinese Review using Word Embedding , 2015, PACLIC.

[23]  Takashi Inui,et al.  Latent Variable Models for Semantic Orientations of Phrases , 2006, EACL.

[24]  John Blitzer,et al.  Biographies, Bollywood, Boom-boxes and Blenders: Domain Adaptation for Sentiment Classification , 2007, ACL.

[25]  S. Albayrak,et al.  Language-Independent Twitter Sentiment Analysis , 2012 .

[26]  Ellen Riloff,et al.  Bootstrapped Learning of Emotion Hashtags #hashtags4you , 2013, WASSA@NAACL-HLT.

[27]  Bing Liu,et al.  Sentiment Analysis and Opinion Mining , 2012, Synthesis Lectures on Human Language Technologies.

[28]  Bo Pang,et al.  Thumbs up? Sentiment Classification using Machine Learning Techniques , 2002, EMNLP.

[29]  Arifin,et al.  CLASSIFICATION OF EMOTIONS IN INDONESIAN TEXTSUSING K-NN METHOD , 2012 .

[30]  Claire Cardie,et al.  Multi-Level Structured Models for Document-Level Sentiment Classification , 2010, EMNLP.

[31]  Mykola Pechenizkiy,et al.  Pattern-Based Emotion Classification on Social Media , 2015, Advances in Social Media Analysis.

[32]  Rada Mihalcea,et al.  Multilingual Subjectivity: Are More Languages Better? , 2010, COLING.

[33]  Ari Rappoport,et al.  Enhanced Sentiment Learning Using Twitter Hashtags and Smileys , 2010, COLING.

[34]  David Yarowsky,et al.  Exploring Sentiment in Social Media: Bootstrapping Subjectivity Clues from Multilingual Twitter Streams , 2013, ACL.

[35]  Yuji Matsumoto,et al.  Emotion Classification Using Massive Examples Extracted from the Web , 2008, COLING.

[36]  Maulahikmah Galinium,et al.  Automatic mood classification of Indonesian tweets using linguistic approach , 2013, 2013 International Conference on Information Technology and Electrical Engineering (ICITEE).

[37]  Saif Mohammad,et al.  CROWDSOURCING A WORD–EMOTION ASSOCIATION LEXICON , 2013, Comput. Intell..

[38]  Hua Xu,et al.  Hierarchical emotion classification and emotion component analysis on chinese micro-blog posts , 2015, Expert Syst. Appl..

[39]  Xiaoming Chen,et al.  A New Method for Sentiment Classification in Text Retrieval , 2005, IJCNLP.

[40]  Christopher Joseph Pal,et al.  Cross Lingual Adaptation: An Experiment on Sentiment Classifications , 2010, ACL.

[41]  Tobias Günther,et al.  Sentiment Analysis of Microblogs , 2013 .

[42]  Yi-Shin Chen,et al.  Analyzing event opinion transition through summarized emotion visualization , 2015, 2015 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM).

[43]  Hiroshi Kanayama,et al.  Fully Automatic Lexicon Expansion for Domain-oriented Sentiment Analysis , 2006, EMNLP.

[44]  Isabell M. Welpe,et al.  Predicting Elections with Twitter: What 140 Characters Reveal about Political Sentiment , 2010, ICWSM.

[45]  Johanna D. Moore,et al.  Twitter Sentiment Analysis: The Good the Bad and the OMG! , 2011, ICWSM.

[46]  David M. Pennock,et al.  Mining the peanut gallery: opinion extraction and semantic classification of product reviews , 2003, WWW '03.

[47]  Alexandra Balahur,et al.  Comparative Experiments for Multilingual Sentiment Analysis Using Machine Translation , 2012, SDAD@ECML/PKDD.

[48]  Hua Xu,et al.  Exploiting effective features for chinese sentiment classification , 2011, Expert Syst. Appl..

[49]  Yuji Matsumoto,et al.  Collecting Evaluative Expressions for Opinion Extraction , 2004, IJCNLP.