Classifying vaccine sentiment tweets by modelling domain-specific representation and commonsense knowledge into context-aware attentive GRU

Vaccines are an important public health measure, but vaccine hesitancy and refusal can create clusters of low vaccine coverage and reduce the effectiveness of vaccination programs. Social media provides an opportunity to estimate emerging risks to vaccine acceptance by including geographical location and detailing vaccine-related concerns. Methods for classifying social media posts, such as vaccine-related tweets, use language models (LMs) trained on general domain text. However, challenges to measuring vaccine sentiment at scale arise from the absence of tonal stress and gestural cues and may not always have additional information about the user, e.g., past tweets or social connections. Another challenge in LMs is the lack of ‘commonsense’ knowledge that are apparent in users' metadata, i.e., emoticons, positive and negative words etc. In this study, to classify vaccine sentiment tweets with limited information, we present a novel end-to-end framework consisting of interconnected components that use domain-specific LM trained on vaccine-related tweets and models commonsense knowledge into a bidirectional gated recurrent network (CK-BiGRU) with context-aware attention. We further leverage syntactical, user metadata and sentiment information to capture the sentiment of a tweet. We experimented using two popular vaccine-related Twitter datasets and demonstrate that our proposed approach outperforms state-of-the-art models in identifying pro-vaccine, anti-vaccine and neutral tweets.

[1]  Marcel Salathé,et al.  Assessing Vaccination Sentiments with Online Social Media: Implications for Infectious Disease Dynamics and Control , 2011, PLoS Comput. Biol..

[2]  Mark Dredze,et al.  You Are What You Tweet: Analyzing Twitter for Public Health , 2011, ICWSM.

[3]  Imran Razzak,et al.  A survey of pre-processing techniques to improve short-text quality: a case study on hate speech detection on twitter , 2020, Multimedia Tools and Applications.

[4]  Thomas Wolf,et al.  DistilBERT, a distilled version of BERT: smaller, faster, cheaper and lighter , 2019, ArXiv.

[5]  Kevin A Padrez,et al.  Twitter as a Tool for Health Research: A Systematic Review , 2017, American journal of public health.

[6]  Martin Mueller,et al.  Crowdbreaks: Tracking Health Trends Using Public Social Media Data and Crowdsourcing , 2018, Front. Public Health.

[7]  Saif Mohammad,et al.  Obtaining Reliable Human Ratings of Valence, Arousal, and Dominance for 20,000 English Words , 2018, ACL.

[8]  Ido Dagan,et al.  context2vec: Learning Generic Context Embedding with Bidirectional LSTM , 2016, CoNLL.

[9]  Peter W. Eklund,et al.  COVIDSenti: A Large-Scale Benchmark Twitter Data Set for COVID-19 Sentiment Analysis , 2021, IEEE Transactions on Computational Social Systems.

[10]  Gang Wang,et al.  Global Context-Aware Attention LSTM Networks for 3D Action Recognition , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[11]  Marcel Salathé,et al.  COVID-Twitter-BERT: A natural language processing model to analyse COVID-19 content on Twitter , 2020, Frontiers in Artificial Intelligence.

[12]  Imran Razzak,et al.  A Comprehensive Survey on Word Representation Models: From Classical to State-of-the-Art Word Representation Language Models , 2020, ACM Trans. Asian Low Resour. Lang. Inf. Process..

[13]  Zhen Xu,et al.  Incorporating loose-structured knowledge into conversation modeling via recall-gate LSTM , 2016, 2017 International Joint Conference on Neural Networks (IJCNN).

[14]  Ming-Wei Chang,et al.  BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding , 2019, NAACL.

[15]  J. Leask,et al.  How heuristics and cognitive biases affect vaccination decisions. , 2020, Cadernos de saude publica.

[16]  Luke S. Zettlemoyer,et al.  Dissecting Contextual Word Embeddings: Architecture and Representation , 2018, EMNLP.

[17]  Jeffrey Pennington,et al.  GloVe: Global Vectors for Word Representation , 2014, EMNLP.

[18]  A. Holton,et al.  Twitter as a source of vaccination information: content drivers and what they are saying. , 2013, American journal of infection control.

[19]  Erik Cambria,et al.  SenticNet 5: Discovering Conceptual Primitives for Sentiment Analysis by Means of Context Embeddings , 2018, AAAI.

[20]  Xujuan Zhou,et al.  Using social connection information to improve opinion mining: Identifying negative sentiment about HPV vaccines on Twitter , 2015, MedInfo.

[21]  Yoshua Bengio,et al.  A Neural Knowledge Language Model , 2016, ArXiv.

[22]  S. Golder,et al.  Systematic review on the prevalence, frequency and comparative value of adverse events data in social media. , 2015, British journal of clinical pharmacology.

[23]  K. Mandl,et al.  Associations Between Exposure to and Expression of Negative Opinions About Human Papillomavirus Vaccines on Social Media: An Observational Study , 2015, Journal of medical Internet research.

[24]  Jaewoo Kang,et al.  BioBERT: a pre-trained biomedical language representation model for biomedical text mining , 2019, Bioinform..

[25]  Jeffrey Dean,et al.  Distributed Representations of Words and Phrases and their Compositionality , 2013, NIPS.

[26]  Samarth Swarup,et al.  Semantic network analysis of vaccine sentiment in online social media. , 2017, Vaccine.

[27]  Yang Xiang,et al.  X-A-BiLSTM: a Deep Learning Approach for Depression Detection in Imbalanced Data , 2018, 2018 IEEE International Conference on Bioinformatics and Biomedicine (BIBM).

[28]  M. Shigematsu,et al.  Using Social Media for Actionable Disease Surveillance and Outbreak Management: A Systematic Literature Review , 2015, PloS one.

[29]  Federica Fornaciari,et al.  Semantic Network Analysis , 2016 .

[30]  Guozheng Rao,et al.  Sentiment Analysis Methods for HPV Vaccines Related Tweets Based on Transfer Learning , 2020, Healthcare.

[31]  Jingcheng Du,et al.  Optimization on machine learning based approaches for sentiment analysis on HPV vaccines related tweets , 2017, Journal of Biomedical Semantics.

[32]  Yoshua Bengio,et al.  Empirical Evaluation of Gated Recurrent Neural Networks on Sequence Modeling , 2014, ArXiv.

[33]  Zhiyong Feng,et al.  MGL-CNN: A Hierarchical Posts Representations Model for Identifying Depressed Individuals in Online Forums , 2020, IEEE Access.

[34]  Dat Quoc Nguyen,et al.  BERTweet: A pre-trained language model for English Tweets , 2020, EMNLP.

[35]  Lukasz Kaiser,et al.  Attention is All you Need , 2017, NIPS.

[36]  Rui Zhang,et al.  Mining Twitter to Assess the Determinants of Health Behavior towards Human Papillomavirus Vaccination in the United States , 2019, J. Am. Medical Informatics Assoc..

[37]  Matloob Khushi,et al.  BioALBERT: A Simple and Effective Pre-trained Language Model for Biomedical Named Entity Recognition , 2020, 2021 International Joint Conference on Neural Networks (IJCNN).

[38]  E. Cvejic,et al.  Concerns and motivations about COVID-19 vaccination , 2020, The Lancet Infectious Diseases.

[39]  Erik Cambria,et al.  Targeted Aspect-Based Sentiment Analysis via Embedding Commonsense Knowledge into an Attentive LSTM , 2018, AAAI.

[40]  Kevin Gimpel,et al.  ALBERT: A Lite BERT for Self-supervised Learning of Language Representations , 2019, ICLR.

[41]  K. Mandl,et al.  Limited Role of Bots in Spreading Vaccine-Critical Information Among Active Twitter Users in the United States: 2017-2019. , 2020, American Journal of Public Health.

[42]  Yonatan Belinkov,et al.  Linguistic Knowledge and Transferability of Contextual Representations , 2019, NAACL.

[43]  Elise de Doncker,et al.  Twitter sentiment analysis with a deep neural network: An enhanced approach using user behavioral information , 2019, Cognitive Systems Research.

[44]  J. Leask,et al.  Using social media for vaccination promotion: Practices and challenges , 2020, Digital health.

[45]  Katarzyna Musial,et al.  Transformer based Deep Intelligent Contextual Embedding for Twitter sentiment analysis , 2020, Future Gener. Comput. Syst..

[46]  Christopher D. Manning,et al.  Introduction to Information Retrieval , 2010, J. Assoc. Inf. Sci. Technol..

[47]  Katarzyna Musial,et al.  DICE: Deep Intelligent Contextual Embedding for Twitter Sentiment Analysis , 2019, 2019 International Conference on Document Analysis and Recognition (ICDAR).

[48]  Brian Borsari,et al.  Systematic review of surveillance by social media platforms for illicit drug use , 2017, Journal of public health.

[49]  Zhen-Hua Ling,et al.  Align, Mask and Select: A Simple Method for Incorporating Commonsense Knowledge into Language Representation Models , 2019, ArXiv.

[50]  Richard Socher,et al.  Learned in Translation: Contextualized Word Vectors , 2017, NIPS.

[51]  Omer Levy,et al.  RoBERTa: A Robustly Optimized BERT Pretraining Approach , 2019, ArXiv.

[52]  Marianthi Markatou,et al.  Text mining for the Vaccine Adverse Event Reporting System: medical text classification using informative feature selection , 2011, J. Am. Medical Informatics Assoc..

[53]  Marcel Salathé,et al.  The dynamics of health behavior sentiments on a large online social network , 2012, EPJ Data Science.

[54]  Joyce Y. Chai,et al.  Recent Advances in Natural Language Inference: A Survey of Benchmarks, Resources, and Approaches , 2019 .