Text mining and sentiment analysis of COVID-19 tweets

The human severe acute respiratory syndrome coronavirus 2 (SARS-Cov-2), causing the COVID-19 disease, has continued to spread all over the world. It menacingly affects not only public health and global economics but also mental health and mood. While the impact of the COVID-19 pandemic has been widely studied, relatively fewer discussions about the sentimental reaction of the population have been available. In this article, we scrape COVID-19 related tweets on the microblogging platform, Twitter, and examine the tweets from Feb 24, 2020 to Oct 14, 2020 in four Canadian cities (Toronto, Montreal, Vancouver, and Calgary) and four U.S. cities (New York, Los Angeles, Chicago, and Seattle). Applying the Vader and NRC approaches, we evaluate the sentiment intensity scores and visualize the information over different periods of the pandemic. Sentiment scores for the tweets concerning three anti-epidemic measures, masks, vaccine, and lockdown, are computed for comparisons. The results of four Canadian cities are compared with four cities in the United States. We study the causal relationships between the infected cases, the tweet activities, and the sentiment scores of COVID-19 related tweets, by integrating the echo state network method with convergent cross-mapping. Our analysis shows that public sentiments regarding COVID-19 vary in different time periods and locations. In general, people have a positive mood about COVID-19 and masks, but negative in the topics of vaccine and lockdown. The causal inference shows that the sentiment influences people’s activities on Twitter, which is also correlated to the daily number of infections. Department of Statistical and Actuarial Sciences, University of Western Ontario, London, Ontario, Canada Department of Computer Science, University of Western Ontario, London, Ontario, Canada Corresponding author. Email: gyi5@uwo.ca ar X iv :2 10 6. 15 35 4v 1 [ cs .S I] 2 6 Ju n 20 21

[1]  Sachin N. Deshmukh,et al.  Review on Sentiment Lexicons , 2018, 2018 3rd International Conference on Communication and Electronics Systems (ICCES).

[2]  Stefan Rotter,et al.  Extending stability through hierarchical clusters in Echo State Networks , 2022 .

[3]  Syed Tanzeel Rabani,et al.  Machine learning based approaches for detecting COVID-19 using clinical text data , 2020, International journal of information technology : an official journal of Bharati Vidyapeeth's Institute of Computer Applications and Management.

[4]  Peter J. Kwantes,et al.  Comparing Methods for Single Paragraph Similarity Analysis , 2011, Top. Cogn. Sci..

[5]  Herbert Jaeger,et al.  Reservoir computing approaches to recurrent neural network training , 2009, Comput. Sci. Rev..

[6]  Johanna D. Moore,et al.  Twitter Sentiment Analysis: The Good the Bad and the OMG! , 2011, ICWSM.

[7]  Alireza Goudarzi,et al.  Exploring transfer function nonlinearity in echo state networks , 2015, 2015 IEEE Symposium on Computational Intelligence for Security and Defense Applications (CISDA).

[8]  George Sugihara,et al.  Distinguishing time-delayed causal interactions using convergent cross mapping , 2015, Scientific Reports.

[9]  C. K. Pastor Sentiment Analysis of Filipinos and Effects of Extreme Community Quarantine Due to Coronavirus (COVID-19) Pandemic , 2020, Journal of critical reviews.

[10]  Basant Agarwal,et al.  Sentiment analysis of social media response on the Covid19 outbreak , 2020, Brain, Behavior, and Immunity.

[11]  J. C. Saire,et al.  Text Mining Approach to Analyze Coronavirus Impact: Mexico City as Case of Study , 2020, medRxiv.

[12]  Yu Huang,et al.  Detecting causality from time series in a machine learning framework. , 2020, Chaos.

[13]  T. Zhu,et al.  Public discourse and sentiment during the COVID 19 pandemic: Using Latent Dirichlet Allocation for topic modeling on Twitter , 2020, PloS one.

[14]  Huan Liu,et al.  Twitter Data Analytics , 2013, SpringerBriefs in Computer Science.

[15]  Ted Kwartler,et al.  Text Mining in Practice with R , 2017 .

[16]  Chun Xiao,et al.  Examination of community sentiment dynamics due to covid-19 pandemic: a case study from Australia , 2020, ArXiv.

[17]  Owen Rambow,et al.  Sentiment Analysis of Twitter Data , 2011 .

[18]  Lei Zhang,et al.  A Survey of Opinion Mining and Sentiment Analysis , 2012, Mining Text Data.

[19]  Patrick Paroubek,et al.  Twitter as a Corpus for Sentiment Analysis and Opinion Mining , 2010, LREC.

[20]  Yinping Yang,et al.  Global Sentiments Surrounding the COVID-19 Pandemic on Twitter: Analysis of Twitter Trends , 2020, JMIR public health and surveillance.

[21]  George Sugihara,et al.  Convergent Cross Mapping: Theory and an Example , 2018 .

[22]  Herbert Jaeger,et al.  Optimization and applications of echo state networks with leaky- integrator neurons , 2007, Neural Networks.

[23]  Mathisca de Gunst,et al.  Analysis of Twitter data with the Bayesian fused graphical lasso , 2020, PloS one.

[24]  Eric Gilbert,et al.  VADER: A Parsimonious Rule-Based Model for Sentiment Analysis of Social Media Text , 2014, ICWSM.