Sentiment analysis of messages on Twitter related to COVID-19 using deep learning approach

The widespread situation of the Coronavirus-19 (COVID-19) pandemic is a tangible and pressing concern. Many changes in terms lifestyle are necessary to reduce the chance of infection. While citizens have gone through different emotions, they would share their thought and interactions on social media, especially on Twitter. COVID-19 related messages can imply social emotion. This study performs sentiment analysis on tweets and annotated them into six classes of positive and negative feelings consist of anger, disgust, fear, sadness, joy, and surprise. We analyzed both textual information and historical data. We collected 120,642 unique tweets datasets between 1 January 2020 and 30 June 2021. We compared the performance of five neural network models which are multi-layer perceptron, RNN, LSTM, Bidirectional LSTM, and GRU with several metrics consists of accuracy, F1 score, precision, and recall. The results show that LSTM model has the highest accuracy score at 77.4% while GRU has the best F1 score at 77.13%. These models could be used to monitor the movement of negative emotions. In addition, we provide interesting insights from sentiment analysis with tweet data and historical reported of infected cases, and vaccination data.

[1]  Ramesh Singh,et al.  Applications of sentiment analysis and machine learning techniques in disease outbreak prediction – A review , 2021 .

[2]  Kitsuchart Pasupa,et al.  Hybrid Deep Learning Models for Thai Sentiment Analysis , 2021, Cognitive Computation.

[3]  Ali Al-Laith,et al.  Monitoring People's Emotions and Symptoms from Arabic Tweets during the COVID-19 Pandemic , 2021, Inf..

[4]  Peter W. Eklund,et al.  COVIDSenti: A Large-Scale Benchmark Twitter Data Set for COVID-19 Sentiment Analysis , 2021, IEEE Transactions on Computational Social Systems.

[5]  Sarana Nutanong,et al.  WangchanBERTa: Pretraining transformer-based Thai Language Models , 2021, ArXiv.

[6]  Klaifer Garcia,et al.  Topic detection and sentiment analysis in Twitter content related to COVID-19 from Brazil and the USA , 2020, Applied Soft Computing.

[7]  T. Mackey,et al.  Machine Learning to Detect Self-Reporting of Symptoms, Testing Access, and Recovery Associated With COVID-19 on Twitter: Retrospective Big Data Infoveillance Study , 2020, JMIR public health and surveillance.

[8]  Purnima Kubde,et al.  Emotional Analysis using Twitter Data during Pandemic Situation: COVID-19 , 2020, 2020 5th International Conference on Communication and Electronics Systems (ICCES).

[9]  Marcel Salathé,et al.  COVID-Twitter-BERT: A natural language processing model to analyse COVID-19 content on Twitter , 2020, Frontiers in Artificial Intelligence.

[10]  Pattarawat Chormai,et al.  AttaCut: A Fast and Accurate Neural Thai Word Segmenter , 2019, ArXiv.

[11]  Kitsuchart Pasupa,et al.  Thai sentiment analysis with deep learning techniques: A comparative study based on word embedding, POS-tag, and sentic features , 2019, Sustainable Cities and Society.

[12]  Lukasz Kaiser,et al.  Attention is All you Need , 2017, NIPS.

[13]  Jürgen Schmidhuber,et al.  Long Short-Term Memory , 1997, Neural Computation.

[14]  Mohammad Abu Kausar,et al.  Public Sentiment Analysis on Twitter Data during COVID-19 Outbreak , 2021, International Journal of Advanced Computer Science and Applications.

[15]  Ming-Wei Chang,et al.  BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding , 2019, NAACL.