Sentiment analysis and topic modeling for COVID-19 vaccine discussions

The outbreak of the novel Coronavirus Disease 2019 (COVID-19) has lasted for nearly two years and caused unprecedented impacts on people's daily life around the world. Even worse, the emergence of the COVID-19 Delta variant once again puts the world in danger. Fortunately, many countries and companies have started to develop coronavirus vaccines since the beginning of this disaster. Till now, more than 20 vaccines have been approved by the World Health Organization (WHO), bringing light to people besieged by the pandemic. The promotion of COVID-19 vaccination around the world also brings a lot of discussions on social media about different aspects of vaccines, such as efficacy and security. However, there does not exist much research work to systematically analyze public opinion towards COVID-19 vaccines. In this study, we conduct an in-depth analysis of tweets related to the coronavirus vaccine on Twitter to understand the trending topics and their corresponding sentimental polarities regarding the country and vaccine levels. The results show that a majority of people are confident in the effectiveness of vaccines and are willing to get vaccinated. In contrast, the negative tweets are often associated with the complaints of vaccine shortages, side effects after injections and possible death after being vaccinated. Overall, this study exploits popular NLP and topic modeling methods to mine people's opinions on the COVID-19 vaccines on social media and to analyse and visualise them objectively. Our findings can improve the readability of the noisy information on social media and provide effective data support for the government and policy makers.

[1]  Yanchun Zhang,et al.  Neural attention with character embeddings for hay fever detection from twitter , 2019, Health Information Science and Systems.

[2]  Erika Bonnevie,et al.  Quantifying the rise of vaccine opposition on Twitter during the COVID-19 pandemic , 2020, Journal of Communication in Healthcare.

[3]  M. De Domenico,et al.  Assessing the risks of ‘infodemics’ in response to COVID-19 epidemics , 2020, Nature Human Behaviour.

[4]  Guangyan Huang,et al.  Modeling User Preferences on Spatiotemporal Topics for Point-of-Interest Recommendation , 2017, 2017 IEEE International Conference on Services Computing (SCC).

[5]  Shuiqiao Yang,et al.  Detecting Topic and Sentiment Dynamics Due to COVID-19 Pandemic Using Social Media , 2020, ADMA.

[6]  Shui Yu,et al.  Analysis and Insights for Myths Circulating on Twitter During the COVID-19 Pandemic , 2020, IEEE Open Journal of the Computer Society.

[7]  Jianxin Li,et al.  Evidence-driven dubious decision making in online shopping , 2018, World Wide Web.

[8]  Fei Wu,et al.  Analyzing COVID-19 on Online Social Media: Trends, Sentiments and Emotions , 2020, ArXiv.

[9]  Jianlong Zhou,et al.  Detecting Community Depression Dynamics Due to COVID-19 Pandemic in Australia , 2020, IEEE Transactions on Computational Social Systems.

[10]  Eric Gilbert,et al.  VADER: A Parsimonious Rule-Based Model for Sentiment Analysis of Social Media Text , 2014, ICWSM.

[11]  Yanchun Zhang,et al.  Sentence level topic models for associated topics extraction , 2018, World Wide Web.

[12]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[13]  Jianxin Li,et al.  Influence Propagation Model for Clique-Based Community Detection in Social Networks , 2018, IEEE Transactions on Computational Social Systems.

[14]  Yanchun Zhang,et al.  Automated detection of mild and multi-class diabetic eye diseases using deep learning , 2020, Health Information Science and Systems.

[15]  Yi Wang,et al.  Decision-based evasion attacks on tree ensemble classifiers , 2020, World Wide Web.

[16]  Christian S. Jensen,et al.  Efficient targeted influence minimization in big social networks , 2020, World Wide Web.

[17]  Garving K. Luli,et al.  COVID-19 Vaccine–Related Discussion on Twitter: Topic Modeling and Sentiment Analysis , 2021, Journal of Medical Internet Research.

[18]  Etsuko Ishii,et al.  Model Generalization on COVID-19 Fake News Detection , 2021, CONSTRAINT@AAAI.

[19]  M. Thelwall,et al.  Covid-19 vaccine hesitancy on English-language Twitter , 2021 .

[20]  Guanjin Wang,et al.  Tweet Topics and Sentiments Relating to COVID-19 Vaccination Among Australian Twitter Users: Machine Learning Analysis , 2021, Journal of Medical Internet Research.

[21]  Fang Chen,et al.  Examination of Community Sentiment Dynamics due to COVID-19 Pandemic: A Case Study from a State in Australia , 2020, SN Computer Science.

[22]  Wei Liu,et al.  Deep fusion of multimodal features for social media retweet time prediction , 2020, World Wide Web.

[23]  Shuiqiao Yang,et al.  Discovering Topic Representative Terms for Short Text Clustering , 2019, IEEE Access.

[24]  Jiao Yin,et al.  Vulnerability exploitation time prediction: an integrated framework for dynamic imbalanced learning , 2021, World Wide Web.