and other research outputs Evaluation datasets for Twitter sentiment analysis : a survey and a new dataset , the STS-Gold Conference Item How to

Sentiment analysis over Twitter offers organisations and individuals a fast and effective way to monitor the publics’ feelings towards them and their competitors. To assess the performance of sentiment analysis methods over Twitter a small set of evaluation datasets have been released in the last few years. In this paper we present an overview of eight publicly available and manually annotated evaluation datasets for Twitter sentiment analysis. Based on this review, we show that a common limitation of most of these datasets, when assessing sentiment analysis at target (entity) level, is the lack of distinctive sentiment annotations among the tweets and the entities contained in them. For example, the tweet “I love iPhone, but I hate iPad” can be annotated with a mixed sentiment label, but the entity iPhone within this tweet should be annotated with a positive sentiment label. Aiming to overcome this limitation, and to complement current evaluation datasets, we present STS-Gold, a new evaluation dataset where tweets and targets (entities) are annotated individually and therefore may present different sentiment labels. This paper also provides a comparative study of the various datasets along several dimensions including: total number of tweets, vocabulary size and sparsity. We also investigate the pair-wise correlation among these dimensions as well as their correlations to the sentiment classification performance on different datasets.

[1]  Jeremy Ellman,et al.  TJP: Using Twitter to Analyze the Polarity of Contexts , 2013, *SEMEVAL.

[2]  Guillermo Sapiro,et al.  If you are happy and you know it... tweet , 2012, CIKM '12.

[3]  Saif Mohammad,et al.  NRC-Canada: Building the State-of-the-Art in Sentiment Analysis of Tweets , 2013, *SEMEVAL.

[4]  Wei Hu,et al.  Mutually Enhancing Community Detection and Sentiment Analysis on Twitter Networks , 2013 .

[5]  Brendan T. O'Connor,et al.  Part-of-Speech Tagging for Twitter: Annotation, Features, and Experiments , 2010, ACL.

[6]  Luis Alfonso Ureña López,et al.  SINAI: Machine Learning and Emotion of the Crowd for Sentiment Analysis in Microblogs , 2013, *SEMEVAL.

[7]  Susumu Horiguchi,et al.  Learning to classify short and sparse text & web with hidden topics from large-scale data collections , 2008, WWW.

[8]  Harith Alani,et al.  Semantic smoothing for Twitter sentiment analysis , 2011 .

[9]  Preslav Nakov,et al.  SemEval-2013 Task 2: Sentiment Analysis in Twitter , 2013, *SEMEVAL.

[10]  Mike Thelwall,et al.  Sentiment strength detection for the social web , 2012, J. Assoc. Inf. Sci. Technol..

[11]  Robert Remus,et al.  ASVUniOfLeipzig: Sentiment Analysis in Twitter using Data-driven Machine Learning Techniques , 2013, *SEMEVAL.

[12]  Marcelo Mendoza,et al.  Combining strengths, emotions and polarities for boosting Twitter sentiment analysis , 2013, WISDOM '13.

[13]  Jason Baldridge,et al.  Twitter Polarity Classification with Label Propagation over Lexical Links and the Follower Graph , 2011, ULNLP@EMNLP.

[14]  David A. Shamma,et al.  Characterizing debate performance via aggregated twitter sentiment , 2010, CHI.

[15]  Harith Alani,et al.  Semantic Sentiment Analysis of Twitter , 2012, SEMWEB.

[16]  Vasudeva Varma,et al.  Mining Sentiments from Tweets , 2012, WASSA@ACL.

[17]  Harith Alani,et al.  Alleviating Data Sparsity for Twitter Sentiment Analysis , 2012, #MSM.

[18]  Klaus Krippendorff,et al.  Content Analysis: An Introduction to Its Methodology , 1980 .

[19]  Mohamed S. Kamel,et al.  Automatic Extraction of Domain-Specific Stopwords from Labeled Documents , 2008, ECIR.

[20]  David A. Shamma,et al.  Tweet the debates: understanding community annotation of uncollected sources , 2009, WSM@MM.

[21]  Mike Thelwall,et al.  Sentiment in short strength detection informal text , 2010 .

[22]  Huan Liu,et al.  Unsupervised sentiment analysis with emotional signals , 2013, WWW.

[23]  Minyi Guo,et al.  Emoticon Smoothed Language Models for Twitter Sentiment Analysis , 2012, AAAI.

[24]  Huan Liu,et al.  Exploiting social relations for sentiment analysis in microblogging , 2013, WSDM.