Monitoring the Twitter sentiment during the Bulgarian elections

We present a generic approach to real-time monitoring of the Twitter sentiment and show its application to the Bulgarian parliamentary elections in May 2013. Our approach is based on building high quality sentiment classification models from manually annotated tweets. In particular, we have developed a user-friendly annotation platform, a feature selection procedure based on maximizing prediction accuracy, and a binary SVM classifier extended with a neutral zone. We have also considerably improved the language detection in tweets. The evaluation results show that before and after the Bulgarian elections, negative sentiment about political parties prevailed. Both, the volume and the difference between the negative and positive tweets for individual parties closely match the election results. The later result is somehow surprising, but consistent with the prevailing negative sentiment during the elections.

[1]  Thorsten Joachims,et al.  Making large scale SVM learning practical , 1998 .

[2]  W. B. Cavnar,et al.  N-gram-based text categorization , 1994 .

[3]  Isabell M. Welpe,et al.  Predicting Elections with Twitter: What 140 Characters Reveal about Political Sentiment , 2010, ICWSM.

[4]  ˇ IvanaLu Efficient Discrimination Between Closely Related Languages , 2012 .

[5]  Lillian Lee,et al.  Opinion Mining and Sentiment Analysis , 2008, Found. Trends Inf. Retr..

[6]  Jörg Tiedemann,et al.  Efficient Discrimination Between Closely Related Languages , 2012, COLING.

[7]  Guido Caldarelli,et al.  S 1 Appendix , 2016 .

[8]  Nada Lavrac,et al.  Predictive Sentiment Analysis of Tweets: A Stock Market Application , 2013, CHI-KDD.

[9]  Panagiotis Takis Metaxas,et al.  How (Not) to Predict Elections , 2011, 2011 IEEE Third Int'l Conference on Privacy, Security, Risk and Trust and 2011 IEEE Third Int'l Conference on Social Computing.

[10]  Brendan T. O'Connor,et al.  From Tweets to Polls: Linking Text Sentiment to Public Opinion Time Series , 2010, ICWSM.

[11]  Johan Bos,et al.  Predicting the 2011 Dutch Senate Election Results with Twitter , 2012 .

[12]  Panagiotis Takis Metaxas,et al.  Limits of Electoral Predictions Using Twitter , 2011, ICWSM.

[13]  Guido Caldarelli,et al.  Emotional Dynamics in the Age of Misinformation , 2015, PloS one.

[14]  Ee-Peng Lim,et al.  Tweets and Votes: A Study of the 2011 Singapore General Election , 2012, 2012 45th Hawaii International Conference on System Sciences.

[15]  Daniel Gayo-Avello,et al.  Don't turn social media into another 'Literary Digest' poll , 2011, Commun. ACM.

[16]  Andreas Jungherr,et al.  Tweets and Votes, a Special Relationship , 2013 .

[17]  S. Battiston,et al.  Sentiment leaning of influential communities in social networks , 2015 .

[18]  Andreas Jungherr,et al.  Tweets and votes, a special relationship: the 2009 federal election in germany , 2013, PLEAD '13.

[19]  Bernardo A. Huberman,et al.  Predicting the Future with Social Media , 2010, Web Intelligence.

[20]  Guido Caldarelli,et al.  Twitter-Based Analysis of the Dynamics of Collective Attention to Political Parties , 2015, PloS one.

[21]  Lada A. Adamic,et al.  The Party Is Over Here: Structure and Content in the 2010 Election , 2011, ICWSM.

[22]  Isabell M. Welpe,et al.  Where There is a Sea There are Pirates: Response to Jungherr, Jürgens, and Schoen , 2012 .

[23]  Owen Rambow,et al.  Sentiment Analysis of Twitter Data , 2011 .

[24]  Ronen Feldman,et al.  Book Reviews: The Text Mining Handbook: Advanced Approaches to Analyzing Unstructured Data by Ronen Feldman and James Sanger , 2008, CL.

[25]  Andrew McCallum,et al.  A comparison of event models for naive bayes text classification , 1998, AAAI 1998.

[26]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[27]  Ron Zacharski,et al.  Language Recognition for Mono-and Multi-lingual Documents , 1999 .

[28]  J. Bollen,et al.  More Tweets, More Votes: Social Media as a Quantitative Indicator of Political Behavior , 2013, PloS one.

[29]  Gabriella Pasi,et al.  Human-Computer Interaction and Knowledge Discovery in Complex, Unstructured, Big Data , 2013, Lecture Notes in Computer Science.

[30]  Clive Souter,et al.  Natural Language Identification using Corpus-Based Models , 2017 .

[31]  Kazem Jahanbakhsh,et al.  The Predictive Power of Social Media: On the Predictability of U.S. Presidential Elections using Twitter , 2014, ArXiv.

[32]  JungherrAndreas,et al.  Why the Pirate Party Won the German Election of 2009 or The Trouble With Predictions , 2012 .

[33]  Tiejun Zhao,et al.  Target-dependent Twitter Sentiment Classification , 2011, ACL.

[34]  Daniel Gayo-Avello,et al.  "I Wanted to Predict Elections with Twitter and all I got was this Lousy Paper" - A Balanced Survey on Election Prediction using Twitter Data , 2012, ArXiv.

[35]  Eni Mustafaraj,et al.  Can Collective Sentiment Expressed on Twitter Predict Political Elections? , 2011, AAAI.

[36]  Wiley Interscience Journal of the American Society for Information Science and Technology , 2013 .

[37]  A. Smeaton,et al.  On Using Twitter to Monitor Political Sentiment and Predict Election Results , 2011 .

[38]  Mike Thelwall,et al.  Sentiment in Twitter events , 2011, J. Assoc. Inf. Sci. Technol..

[39]  Laura Schweitzer,et al.  Advances In Kernel Methods Support Vector Learning , 2016 .

[40]  Guido Caldarelli,et al.  A Multi-Level Geographical Study of Italian Political Elections from Twitter Data , 2014, PloS one.

[41]  Bo Pang,et al.  Thumbs up? Sentiment Classification using Machine Learning Techniques , 2002, EMNLP.

[42]  Jörg Tiedemann,et al.  News from OPUS — A collection of multilingual parallel corpora with tools and interfaces , 2009 .

[43]  Johan Bollen,et al.  Twitter mood predicts the stock market , 2010, J. Comput. Sci..

[44]  Tom Fawcett,et al.  An introduction to ROC analysis , 2006, Pattern Recognit. Lett..

[45]  Nada Lavrac,et al.  Stream-based active learning for sentiment analysis in the financial domain , 2014, Inf. Sci..

[46]  Peter A. Flach,et al.  Machine Learning , 2012 .

[47]  A. J. Morales,et al.  Characterizing and modeling an electoral campaign in the context of Twitter: 2011 Spanish Presidential Election as a case study , 2012, Chaos.