A model for the Twitter sentiment curve

Twitter is among the most used online platforms for the political communications, due to the concision of its messages (which is particularly suitable for political slogans) and the quick diffusion of messages. Especially when the argument stimulate the emotionality of users, the content on Twitter is shared with extreme speed and thus studying the tweet sentiment if of utmost importance to predict the evolution of the discussions and the register of the relative narratives. In this article, we present a model able to reproduce the dynamics of the sentiments of tweets related to specific topics and periods and to provide a prediction of the sentiment of the future posts based on the observed past. The model is a recent variant of the Pólya urn, introduced and studied in Aletti and Crimaldi (2019, 2020), which is characterized by a “local” reinforcement, i.e. a reinforcement mechanism mainly based on the most recent observations, and by a random persistent fluctuation of the predictive mean. In particular, this latter feature is capable of capturing the trend fluctuations in the sentiment curve. While the proposed model is extremely general and may be also employed in other contexts, it has been tested on several Twitter data sets and demonstrated greater performances compared to the standard Pólya urn model. Moreover, the different performances on different data sets highlight different emotional sensitivities respect to a public event.

[1]  Johan Bollen,et al.  Modeling Public Mood and Emotion: Twitter Sentiment and Socio-Economic Phenomena , 2009, ICWSM.

[2]  Roberto Di Pietro,et al.  The Paradigm-Shift of Social Spambots: Evidence, Theories, and Tools for the Arms Race , 2017, WWW.

[3]  Markus Kuba,et al.  On Generalized Pólya Urn Models , 2013, Journal of Applied Probability.

[4]  Weitong Chen,et al.  A survey of sentiment analysis in social media , 2018, Knowledge and Information Systems.

[5]  Jingbo Zhu,et al.  Aspect-Based Opinion Polling from Customer Reviews , 2011, IEEE Transactions on Affective Computing.

[6]  William F. Rosenberger,et al.  Central limit Theorem for an Adaptive Randomly Reinforced Urn Model , 2015, 1502.06130.

[7]  G. Pólya,et al.  Über die Statistik verketteter Vorgänge , 1923 .

[8]  Giacomo Aletti,et al.  Interacting reinforced stochastic processes: Statistical inference based on the weighted empirical means , 2018, 1811.10255.

[9]  Guido Caldarelli,et al.  Extracting significant signal of news consumption from social networks: the case of Twitter in Italian political elections , 2019, Palgrave Communications.

[10]  W. Rosenberger,et al.  Nonparametric covariate-adjusted response-adaptive design based on a functional urn model , 2016, Annals of Statistics.

[11]  Brendan T. O'Connor,et al.  From Tweets to Polls: Linking Text Sentiment to Public Opinion Time Series , 2010, ICWSM.

[12]  Hosam M. Mahmoud,et al.  Polya Urn Models , 2008 .

[13]  Ravi Kumar,et al.  Dynamics of conversations , 2010, KDD.

[14]  Xueming Qian,et al.  Rating Prediction Based on Social Sentiment From Textual Reviews , 2016, IEEE Transactions on Multimedia.

[15]  Roberto Di Pietro,et al.  Fame for sale: Efficient detection of fake Twitter followers , 2015, Decis. Support Syst..

[16]  Guido Caldarelli,et al.  Weighted networks as randomly reinforced urn processes. , 2013, Physical review. E, Statistical, nonlinear, and soft matter physics.

[17]  Luca Pratelli,et al.  Asymptotics for randomly reinforced urns with random barriers , 2015, Journal of Applied Probability.

[18]  Li Bing,et al.  Public Sentiment Analysis in Twitter Data for Prediction of a Company's Stock Price Movements , 2014, 2014 IEEE 11th International Conference on e-Business Engineering.

[19]  Filippo Menczer,et al.  The spread of fake news by social bots , 2017, ArXiv.

[20]  Xiangji Huang,et al.  Mining Online Reviews for Predicting Sales Performance: A Case Study in the Movie Domain , 2012, IEEE Transactions on Knowledge and Data Engineering.

[21]  Yang Li,et al.  Interpreting the Public Sentiment Variations on Twitter , 2014, IEEE Transactions on Knowledge and Data Engineering.

[22]  Filippo Menczer,et al.  The rise of social bots , 2014, Commun. ACM.

[23]  Mohammad Atique,et al.  Sentiment Analysis for Social Media: A Survey , 2015, 2015 2nd International Conference on Information Science and Security (ICISS).

[24]  Shuai Gao,et al.  Modeling and Predicting Retweeting Dynamics on Microblogging Platforms , 2015, WSDM.

[25]  Emily B. Fox,et al.  A Bayesian Approach for Predicting the Popularity of Tweets , 2013, ArXiv.

[26]  I. Crimaldi,et al.  The Rescaled Polya Urn: local reinforcement and chi-squared goodness of fit test. , 2019 .

[27]  Giacomo Aletti,et al.  Dynamics of an adaptive randomly reinforced urn , 2015, Bernoulli.

[28]  Yiran Chen,et al.  Quantitative Study of Individual Emotional States in Social Networks , 2012, IEEE Transactions on Affective Computing.

[29]  S. Wood Generalized Additive Models: An Introduction with R, Second Edition , 2017 .

[30]  Caitlin Evans Wagner,et al.  The hybrid media system: Politics and power , 2014, New Media Soc..

[31]  Renaud Lambiotte,et al.  TiDeH: Time-Dependent Hawkes Process for Predicting Retweet Dynamics , 2016, ICWSM.

[32]  Guido Caldarelli,et al.  Emotional Dynamics in the Age of Misinformation , 2015, PloS one.

[33]  I. Crimaldi,et al.  Generalized Rescaled P\'olya urn and its statistical applications. , 2020 .

[34]  Justin M. Rao,et al.  Filter Bubbles, Echo Chambers, and Online News Consumption , 2016 .

[35]  Lada A. Adamic,et al.  The Anatomy of Large Facebook Cascades , 2013, ICWSM.

[36]  Cécile Mailler,et al.  Multiple drawing multi-colour urns by stochastic approximation , 2018, J. Appl. Probab..

[37]  Fuji Ren,et al.  Predicting User-Topic Opinions in Twitter with Social and Topical Context , 2013, IEEE Transactions on Affective Computing.

[38]  Steven Skiena,et al.  Building Sentiment Lexicons for All Major Languages , 2014, ACL.

[39]  Johan Bollen,et al.  Twitter mood predicts the stock market , 2010, J. Comput. Sci..

[40]  Codina Cotar,et al.  On a Preferential Attachment and Generalized Pólya's Urn Model , 2012, 1203.5758.

[41]  G. Caldarelli,et al.  Analysis of online misinformation during the peak of the COVID-19 pandemics in Italy , 2020, ArXiv.

[42]  Thomas Schillemans THE HYBRID MEDIA SYSTEM: POLITICS AND POWER , 2014 .

[43]  ZhuJingbo,et al.  Aspect-Based Opinion Polling from Customer Reviews , 2011 .

[44]  Isabell M. Welpe,et al.  Predicting Elections with Twitter: What 140 Characters Reveal about Political Sentiment , 2010, ICWSM.

[45]  Bernard J. Jansen,et al.  Twitter power: Tweets as electronic word of mouth , 2009, J. Assoc. Inf. Sci. Technol..

[46]  Arvid Kappas,et al.  Collective Emotions Online and Their Influence on Community Life , 2011, PloS one.

[47]  Giacomo Aletti,et al.  Synchronization of reinforced stochastic processes with a network-based interaction , 2016, 1607.08514.

[48]  Sophie Laruelle,et al.  Randomized urn models revisited using Stochastic Approximation , 2011, 1101.2786.

[49]  Siddhartha Bhattacharyya,et al.  A Survey of Sentiment Analysis from Social Media Data , 2020, IEEE Transactions on Computational Social Systems.

[50]  Fabiana Zollo,et al.  Toward a Better Understanding of Emotional Dynamics on Facebook , 2017, COMPLEX NETWORKS.

[51]  Filippo Menczer,et al.  Arming the public with artificial intelligence to counter social bots , 2019, Human Behavior and Emerging Technologies.

[52]  Guido Caldarelli,et al.  The role of bot squads in the political propaganda on Twitter , 2019, Communications Physics.

[53]  A. Flammini,et al.  Uncovering Coordinated Networks on Social Media , 2020, ICWSM.

[54]  Amit P. Sheth,et al.  Predictive Analysis on Twitter: Techniques and Applications , 2018 .

[55]  Jure Leskovec,et al.  SEISMIC: A Self-Exciting Point Process Model for Predicting Tweet Popularity , 2015, KDD.

[56]  Alireza Sahami Shirazi,et al.  Limited individual attention and online virality of low-quality information , 2017, Nature Human Behaviour.

[57]  A survey of random processes with reinforcement , 2007, math/0610076.

[58]  Guido Caldarelli,et al.  Echo Chambers: Emotional Contagion and Group Polarization on Facebook , 2016, Scientific Reports.

[59]  Fehmi Cirak,et al.  ZEITSCHRIFT FÜR ANGEWANDTE MATHEMATIK UND MECHANIK , 1952, Über Stammfaktoren bei ternären quadratischen Formen.

[60]  Scott A. Golder,et al.  Diurnal and Seasonal Mood Vary with Work, Sleep, and Daylength Across Diverse Cultures , 2011 .

[61]  Irene Crimaldi,et al.  Central limit theorems for a hypergeometric randomly reinforced urn , 2015, J. Appl. Probab..