Métodos para Análise de Sentimentos no Twitter

Sentiment analysis has being used in several applications including the analysis of the repercussion of events in online social networks (OSNs), as well as to summarize public perception about products and brands on discussions on those systems. There are multiple methods to measure sentiments, varying from lexical-based approaches to machine learning methods. Despite the wide use and popularity of some those methods, it is unclear which method is better for identifying the polarity (i.e. positive or negative) of a message, as the current literature does not provide a comparison among existing methods. This comparison is crucial to allow us to understand the potential limitations, advantages, and disadvantages of popular methods in the context of OSNs messages. This work aims at filling this gap by presenting a comparison between 8 popular sentiment analysis methods. Our analysis compares these methods in terms of coverage and in terms of correct sentiment identification. We also develop a new method that combines existing approaches in order to provide the best coverage results with competitive accuracy. Finally, we present iFeel, a Web service which provides an open API for accessing and comparing results across different sentiment methods for a given text.

[1]  Jonathon Read,et al.  Using Emoticons to Reduce Dependency in Machine Learning Techniques for Sentiment Classification , 2005, ACL.

[2]  Meeyoung Cha,et al.  Emoticon Style: Interpreting Differences in Emoticons Across Cultures , 2013, ICWSM.

[3]  M. Bradley,et al.  Affective Normsfor English Words (ANEW): Stimuli, instruction manual and affective ratings (Tech Report C-1) , 1999 .

[4]  Fabrício Benevenuto,et al.  O Que Tweets Contendo Emoticons Podem Revelar Sobre Sentimentos Coletivos , 2013 .

[5]  J. Pennebaker,et al.  The Psychological Meaning of Words: LIWC and Computerized Text Analysis Methods , 2010 .

[6]  Virgílio A. F. Almeida,et al.  Detecting Spammers on Twitter , 2010 .

[7]  Andrea Esuli,et al.  SENTIWORDNET: A Publicly Available Lexical Resource for Opinion Mining , 2006, LREC.

[8]  Krishna P. Gummadi,et al.  Measuring User Influence in Twitter: The Million Follower Fallacy , 2010, ICWSM.

[9]  George A. Miller,et al.  WordNet: A Lexical Database for English , 1995, HLT.

[10]  Claire Cardie,et al.  OpinionFinder: A System for Subjectivity Analysis , 2005, HLT.

[11]  Erik Cambria,et al.  Towards Crowd Validation of the UK National Health Service , 2010 .

[12]  Sune Lehmann,et al.  Tweetin' in the Rain: Exploring Societal-Scale Effects of Weather on Mood , 2012, ICWSM.

[13]  D. Watson,et al.  Development and validation of brief measures of positive and negative affect: the PANAS scales. , 1988, Journal of personality and social psychology.

[14]  Johan Bollen,et al.  Modeling Public Mood and Emotion: Twitter Sentiment and Socio-Economic Phenomena , 2009, ICWSM.

[15]  Shrikanth S. Narayanan,et al.  A System for Real-time Twitter Sentiment Analysis of 2012 U.S. Presidential Election Cycle , 2012, ACL.

[16]  Swapna Somasundaran,et al.  Discourse Level Opinion Interpretation , 2008, COLING.

[17]  Erik Cambria,et al.  SenticNet: A Publicly Available Semantic Resource for Opinion Mining , 2010, AAAI Fall Symposium: Commonsense Knowledge.