Evaluation Datasets for Twitter Sentiment Analysis: A survey and a new dataset, the STS-Gold

Sentiment analysis over Twitter offers organisations and individuals a fast and effective way to monitor the publics' feelings towards them and their competitors. To assess the performance of sentiment analysis methods over Twitter a small set of evaluation datasets have been released in the last few years. In this paper we present an overview of eight publicly available and manually annotated evaluation datasets for Twitter sentiment analysis. Based on this review, we show that a common limitation of most of these datasets, when assessing sentiment analysis at target (entity) level, is the lack of distinctive sentiment annotations among the tweets and the entities contained in them. For example, the tweet "I love iPhone, but I hate iPad" can be annotated with a mixed sentiment label, but the entity iPhone within this tweet should be annotated with a positive sentiment label. Aiming to overcome this limitation, and to complement current evaluation datasets, we present STS-Gold, a new evaluation dataset where tweets and targets (entities) are annotated individually and therefore may present different sentiment labels. This paper also provides a comparative study of the various datasets along several dimensions including: total number of tweets, vocabulary size and sparsity. We also investigate the pair-wise correlation among these dimensions as well as their correlations to the sentiment classification performance on different datasets.

[1]  Susumu Horiguchi,et al.  Learning to classify short and sparse text & web with hidden topics from large-scale data collections , 2008, WWW.

[2]  ThelwallMike,et al.  Sentiment strength detection in short informal text , 2010 .

[3]  Harith Alani,et al.  Semantic smoothing for Twitter sentiment analysis , 2011 .

[4]  Klaus Krippendorff,et al.  Content Analysis: An Introduction to Its Methodology , 1980 .

[5]  A. Montejo-R,et al.  SINAI: Machine Learning and Emotion of the Crowd for Sentiment Analysis in Microblogs , 2013 .

[6]  Jeremy Ellman,et al.  TJP: Using Twitter to Analyze the Polarity of Contexts , 2013, *SEMEVAL.

[7]  Guillermo Sapiro,et al.  If you are happy and you know it... tweet , 2012, CIKM '12.

[8]  Huan Liu,et al.  Exploiting social relations for sentiment analysis in microblogging , 2013, WSDM.

[9]  Robert Remus,et al.  ASVUniOfLeipzig: Sentiment Analysis in Twitter using Data-driven Machine Learning Techniques , 2013, *SEMEVAL.

[10]  Jason Baldridge,et al.  Twitter Polarity Classification with Label Propagation over Lexical Links and the Follower Graph , 2011, ULNLP@EMNLP.

[11]  Mohamed S. Kamel,et al.  Automatic Extraction of Domain-Specific Stopwords from Labeled Documents , 2008, ECIR.

[12]  K. Thompson,et al.  If You're Happy and You Know It , 2012 .

[13]  Mike Thelwall,et al.  Sentiment strength detection for the social web , 2012, J. Assoc. Inf. Sci. Technol..

[14]  David A. Shamma,et al.  Characterizing debate performance via aggregated twitter sentiment , 2010, CHI.

[15]  Saif Mohammad,et al.  NRC-Canada: Building the State-of-the-Art in Sentiment Analysis of Tweets , 2013, *SEMEVAL.

[16]  Wei Hu,et al.  Mutually Enhancing Community Detection and Sentiment Analysis on Twitter Networks , 2013 .

[17]  David A. Shamma,et al.  Tweet the debates: understanding community annotation of uncollected sources , 2009, WSM@MM.

[18]  Brendan T. O'Connor,et al.  Part-of-Speech Tagging for Twitter: Annotation, Features, and Experiments , 2010, ACL.

[19]  Mike Thelwall,et al.  Sentiment in short strength detection informal text , 2010 .

[20]  Preslav Nakov,et al.  SemEval-2013 Task 2: Sentiment Analysis in Twitter , 2013, *SEMEVAL.

[21]  Marcelo Mendoza,et al.  Combining strengths, emotions and polarities for boosting Twitter sentiment analysis , 2013, WISDOM '13.

[22]  Huan Liu,et al.  Unsupervised sentiment analysis with emotional signals , 2013, WWW.

[23]  Minyi Guo,et al.  Emoticon Smoothed Language Models for Twitter Sentiment Analysis , 2012, AAAI.

[24]  Harith Alani,et al.  Alleviating Data Sparsity for Twitter Sentiment Analysis , 2012, #MSM.

[25]  Harith Alani,et al.  Semantic Sentiment Analysis of Twitter , 2012, SEMWEB.

[26]  Vasudeva Varma,et al.  Mining Sentiments from Tweets , 2012, WASSA@ACL.