Health, Psychosocial, and Social issues emanating from COVID-19 pandemic based on Social Media Comments using Natural Language Processing

The COVID-19 pandemic has caused a global health crisis that affects many aspects of human lives. In the absence of vaccines and antivirals, several behavioural change and policy initiatives, such as physical distancing, have been implemented to control the spread of the coronavirus. Social media data can reveal public perceptions toward how governments and health agencies across the globe are handling the pandemic, as well as the impact of the disease on people regardless of their geographic locations in line with various factors that hinder or facilitate the efforts to control the spread of the pandemic globally. This paper aims to investigate the impact of the COVID-19 pandemic on people globally using social media data. We apply natural language processing (NLP) and thematic analysis to understand public opinions, experiences, and issues with respect to the COVID-19 pandemic using social media data. First, we collect over 47 million COVID-19-related comments from Twitter, Facebook, YouTube, and three online discussion forums. Second, we perform data preprocessing which involves applying NLP techniques to clean and prepare the data for automated theme extraction. Third, we apply context-aware NLP approach to extract meaningful keyphrases or themes from over 1 million randomly selected comments, as well as compute sentiment scores for each theme and assign sentiment polarity based on the scores using lexicon-based technique. Fourth, we categorize related themes into broader themes. A total of 34 negative themes emerged, out of which 15 are health-related issues, psychosocial issues, and social issues related to the COVID-19 pandemic from the public perspective. In addition, 20 positive themes emerged from our results. Finally, we recommend interventions that can help address the negative issues based on the positive themes and other remedial ideas rooted in research.

[1]  Ezekiel J Emanuel,et al.  Fair Allocation of Scarce Medical Resources in the Time of Covid-19. , 2020, The New England journal of medicine.

[2]  Mark Dredze,et al.  You Are What You Tweet: Analyzing Twitter for Public Health , 2011, ICWSM.

[3]  Yiu Chung Lau,et al.  Temporal dynamics in viral shedding and transmissibility of COVID-19 , 2020, Nature Medicine.

[4]  M. Topaz,et al.  Mining social media data to assess the risk of skin and soft tissue infections from allergen immunotherapy. , 2019, The Journal of allergy and clinical immunology.

[5]  Daniel Dajun Zeng,et al.  Electronic cigarette usage patterns: a case study combining survey and social media data , 2018, J. Am. Medical Informatics Assoc..

[6]  Saeed Hassanpour,et al.  Identifying substance use risk based on deep neural networks and Instagram social media data , 2018, Neuropsychopharmacology.

[7]  Mike Conway,et al.  Recent Advances in Using Natural Language Processing to Address Public Health Research Questions Using Social Media and ConsumerGenerated Data , 2019, Yearbook of Medical Informatics.

[8]  D. Jamison,et al.  The Inclusive Cost of Pandemic Influenza Risk , 2016 .

[9]  M. Dockery,et al.  Working from home in the COVID-19 lockdown , 2020 .

[10]  Austin L. Wright,et al.  Poverty and economic dislocation reduce compliance with COVID-19 shelter-in-place protocols , 2020, Journal of Economic Behavior & Organization.

[11]  Ross Arena,et al.  A tale of two pandemics: How will COVID-19 and global trends in physical inactivity and sedentary behavior affect one another? , 2020, Progress in Cardiovascular Diseases.

[12]  Y. K. Tse,et al.  Examining customer perception and behaviour through social media research – An empirical study of the United Airlines overbooking crisis , 2019, Transportation Research Part E: Logistics and Transportation Review.

[13]  Yan Bai,et al.  Presumed Asymptomatic Carrier Transmission of COVID-19. , 2020, JAMA.

[14]  P. Washer Factors in the Emergence of Infectious Diseases , 2010 .

[15]  E. Wilkins COVID-19: Information and resources , 2020 .

[16]  Sandeep Soni,et al.  Racism is a Virus: Anti-Asian Hate and Counterhate in Social Media during the COVID-19 Crisis , 2020, ArXiv.

[17]  G. Onder,et al.  Case-Fatality Rate and Characteristics of Patients Dying in Relation to COVID-19 in Italy. , 2020, JAMA.

[18]  Ruifu Yang,et al.  An investigation of transmission control measures during the first 50 days of the COVID-19 epidemic in China , 2020, Science.

[19]  Markus Poschke,et al.  Working from home across countries , 2020 .

[20]  Carlo Lai,et al.  Danger in danger: Interpersonal violence during COVID-19 quarantine , 2020, Psychiatry Research.

[21]  Kate E. Jones,et al.  Global trends in emerging infectious diseases , 2008, Nature.

[22]  Andrew R. McNeill,et al.  Twitter Influence on UK Vaccination and Antiviral Uptake during the 2009 H1N1 Pandemic , 2016, Front. Public Health.

[23]  Mohit Tyagi,et al.  Significant applications of virtual reality for COVID-19 pandemic , 2020, Diabetes & Metabolic Syndrome: Clinical Research & Reviews.

[24]  Johan Bos,et al.  Predicting the 2011 Dutch Senate Election Results with Twitter , 2012 .

[25]  Mike Conway,et al.  Tracking Health Related Discussions on Reddit for Public Health Applications , 2017, AMIA.

[26]  Aldo A. Faisal,et al.  Artificial Intelligence, Data Sensors and Interconnectivity: Future Opportunities for Heart Failure , 2019, Cardiac failure review.

[27]  R. Lynfield,et al.  Red Book: 2018-2021 report of the committee on infectious diseases. , 2018 .

[28]  Dongbo Wang,et al.  The influence of word normalization in English document clustering , 2012, 2012 IEEE International Conference on Computer Science and Automation Engineering (CSAE).

[29]  David E Bloom,et al.  Emerging infectious diseases: A proactive approach , 2017, Proceedings of the National Academy of Sciences.

[30]  D. Ganster,et al.  Impact of family-supportive work variables on work-family conflict and strain: A control perspective. , 1995 .

[31]  Huan Liu,et al.  Data Mining in Social Media , 2011, Social Network Data Analytics.

[32]  J. Gitaka,et al.  COVID-19: Are Africa’s diagnostic challenges blunting response effectiveness? , 2020, AAS open research.

[33]  Rita Orji,et al.  Social Media and Sentiment Analysis: The Nigeria Presidential Election 2019 , 2019, 2019 IEEE 10th Annual Information Technology, Electronics and Mobile Communication Conference (IEMCON).

[34]  George P. Yang,et al.  Anti-Asian sentiment in the United States – COVID-19 and history , 2020, The American Journal of Surgery.

[35]  H. Ashrafi-rizi,et al.  Information Typology in Coronavirus (COVID-19) Crisis; a Commentary , 2020, Archives of academic emergency medicine.

[36]  Chaoyang Li,et al.  Open Access Research Article Physical Activity and Optimal Self-rated Health of Adults with and without Diabetes , 2022 .

[37]  Bing Liu,et al.  Sentence Subjectivity and Sentiment Classification , 2015, Sentiment Analysis.

[38]  H. Zeilhofer,et al.  Social Media Surveillance of Multiple Sclerosis Medications Used During Pregnancy and Breastfeeding: Content Analysis , 2019, Journal of medical Internet research.

[39]  Eric Gilbert,et al.  VADER: A Parsimonious Rule-Based Model for Sentiment Analysis of Social Media Text , 2014, ICWSM.

[40]  L. Festinger,et al.  A Theory of Cognitive Dissonance , 2017 .

[41]  S. Morse,et al.  Factors in the emergence of infectious diseases. , 1995, Emerging infectious diseases.

[42]  Cdc Covid- Response Team Preliminary estimates of the prevalence of selected underlying health conditions among patients with coronavirus disease 2019 — United States, February 12–March 28, 2020 , 2020 .

[43]  G. Rubin,et al.  The psychological impact of quarantine and how to reduce it: rapid review of the evidence , 2020, The Lancet.

[44]  Beatrice Santorini,et al.  The Penn Treebank: An Overview , 2003 .

[45]  Modeling compliance with COVID-19 prevention guidelines: the critical role of trust in science , 2020, Psychology, health & medicine.

[46]  Lyle H. Ungar,et al.  Understanding and Measuring Psychological Stress using Social Media , 2018, ICWSM.

[47]  Q. Nguyen,et al.  Census Tract Food Tweets and Chronic Disease Outcomes in the U.S., 2015–2018 , 2019, International journal of environmental research and public health.

[48]  A. Carrasco-Labra,et al.  Social Media Research Strategy to Understand Clinician and Public Perception of Health Care Messages , 2019, JDR clinical and translational research.

[49]  Canada needs to rapidly escalate public health interventions for its COVID-19 mitigation strategies , 2020, Infectious Disease Modelling.

[50]  Alberto Maria Segre,et al.  The Use of Twitter to Track Levels of Disease Activity and Public Concern in the U.S. during the Influenza A H1N1 Pandemic , 2011, PloS one.

[51]  Elisabeth Mahase,et al.  Coronavirus: covid-19 has killed more people than SARS and MERS combined, despite lower case fatality rate , 2020, BMJ.

[52]  A. Fauci,et al.  The challenge of emerging and re-emerging infectious diseases , 2004, Nature.

[53]  Jürgen Ziegler,et al.  Rating-based Preference Elicitation for Recommendation of Stress Intervention , 2019, UMAP.

[54]  Ling Zhang,et al.  Timely mental health care for the 2019 novel coronavirus outbreak is urgently needed , 2020, The Lancet Psychiatry.

[55]  David A. Hoffman,et al.  Increasing access to care: telehealth during COVID-19 , 2020, Journal of law and the biosciences.

[56]  Graeme Hirst,et al.  Exploring the Privacy-Preserving Properties of Word Embeddings: Algorithmic Validation Study , 2020, Journal of medical Internet research.

[57]  Widodo Budiharto,et al.  Prediction and analysis of Indonesia Presidential election from Twitter using sentiment analysis , 2018, Journal of Big Data.

[58]  Matteo Cella,et al.  Measuring attitudes towards mental health using social media: investigating stigma and trivialisation , 2018, Social Psychiatry and Psychiatric Epidemiology.

[59]  Jusub Kim,et al.  Designing a Scalable, Accessible, and Effective Mobile App Based Solution for Common Mental Health Problems , 2020, Int. J. Hum. Comput. Interact..

[60]  M. Mckenney,et al.  Alarming trends in US domestic violence during the COVID-19 pandemic , 2020, The American Journal of Emergency Medicine.

[61]  Vasudeva Varma,et al.  Pattern based keyword extraction for contextual advertising , 2010, CIKM '10.

[62]  Xuetao Cao COVID-19: immunopathology and its implications for therapy , 2020, Nature Reviews Immunology.

[63]  Heiko Spallek,et al.  Using Natural Language Processing to Enable In-depth Analysis of Clinical Messages Posted to an Internet Mailing List: A Feasibility Study , 2011, Journal of medical Internet research.

[64]  De-Min Han,et al.  Gender Differences in Patients With COVID-19: Focus on Severity and Mortality , 2020, Frontiers in Public Health.

[65]  Naved Iqbal,et al.  Intolerance of uncertainty, depression, and anxiety: Examining the indirect and moderating effects of worry. , 2017, Asian journal of psychiatry.

[66]  K. Peleg,et al.  Self-Isolation Compliance In The COVID-19 Era Influenced By Compensation: Findings From A Recent Survey In Israel. , 2020, Health affairs.

[67]  Isabell M. Welpe,et al.  Predicting Elections with Twitter: What 140 Characters Reveal about Political Sentiment , 2010, ICWSM.

[68]  Carl Gutwin,et al.  KEA: practical automatic keyphrase extraction , 1999, DL '99.

[69]  Han‐Na Kim,et al.  Smartphone-Based Health Program for Improving Physical Activity and Tackling Obesity for Young Adults: A Systematic Review and Meta-Analysis , 2019, International journal of environmental research and public health.

[70]  Rita Orji,et al.  Deep Sentiment Classification and Topic Discovery on Novel Coronavirus or COVID-19 Online Discussions: NLP Using LSTM Recurrent Neural Network Approach , 2020, bioRxiv.

[71]  Alex R. Piquero,et al.  Staying Home, Staying Safe? A Short-Term Analysis of COVID-19 on Dallas Domestic Violence , 2020, American journal of criminal justice : AJCJ.

[72]  G. Eysenbach,et al.  Pandemics in the Age of Twitter: Content Analysis of Tweets during the 2009 H1N1 Outbreak , 2010, PloS one.

[73]  Tamara Pilishvili,et al.  Geographic Differences in COVID-19 Cases, Deaths, and Incidence — United States, February 12–April 7, 2020 , 2020, MMWR. Morbidity and mortality weekly report.

[74]  Laura E. Barnes,et al.  "Is This an STD? Please Help!": Online Information Seeking for Sexually Transmitted Diseases on Reddit , 2018, ICWSM.

[75]  E. Holmes,et al.  A new coronavirus associated with human respiratory disease in China , 2020, Nature.

[76]  Jennifer A. Asmuth,et al.  Context Sensitivity of Relational Nouns , 2005 .

[77]  Gianluca Demartini,et al.  Novel insights into views towards H1N1 during the 2009 Pandemic: a thematic analysis of Twitter data , 2019, Health information and libraries journal.

[78]  Scott A. Hale,et al.  Does Campaigning on Social Media Make a Difference? Evidence From Candidate Use of Twitter During the 2015 and 2017 U.K. Elections , 2017, Communication Research.

[79]  P. Horby,et al.  Estimated global mortality associated with the first 12 months of 2009 pandemic influenza A H1N1 virus circulation: a modelling study. , 2012, The Lancet. Infectious diseases.

[80]  Jian-min Jin,et al.  Gender Differences in Patients With COVID-19: Focus on Severity and Mortality , 2020, Frontiers in Public Health.

[81]  Rohini K. Srihari,et al.  Using Verbs and Adjectives to Automatically Classify Blog Sentiment , 2006, AAAI Spring Symposium: Computational Approaches to Analyzing Weblogs.

[82]  G. Eysenbach,et al.  Social Media: A Review and Tutorial of Applications in Medicine and Health Care , 2014, Journal of medical Internet research.

[83]  Louise Isham,et al.  The pandemic paradox: The consequences of COVID‐19 on domestic violence , 2020, Journal of clinical nursing.

[84]  Davy Weissenbacher,et al.  Pharmacoepidemiologic Evaluation of Birth Defects from Health-Related Postings in Social Media During Pregnancy , 2018, Drug Safety.

[85]  M. McHugh Interrater reliability: the kappa statistic , 2012, Biochemia medica.

[86]  J. Xue,et al.  The Impact of COVID-19 Epidemic Declaration on Psychological Consequences: A Study on Active Weibo Users , 2020, International journal of environmental research and public health.

[87]  Minghui Li,et al.  Monitoring transmissibility and mortality of COVID-19 in Europe , 2020, International Journal of Infectious Diseases.

[88]  Sonya A. Grier,et al.  Race in the Marketplace and COVID-19 , 2021 .

[89]  Xiaojun Wang,et al.  Decoding the sentiment dynamics of online retailing customers: Time series analysis of social media , 2019, Comput. Hum. Behav..

[90]  Beatrice Santorini,et al.  Part-of-Speech Tagging Guidelines for the Penn Treebank Project (3rd Revision) , 1990 .

[91]  P. Biddinger,et al.  Novel Coronavirus and Old Lessons - Preparing the Health System for the Pandemic. , 2020, The New England journal of medicine.

[92]  Carmela Comito,et al.  Exploiting Social Media to enhance Clinical Decision Support , 2019, WI.

[93]  Yunpeng Ji,et al.  Potential association between COVID-19 mortality and health-care resource availability , 2020, The Lancet Global Health.

[94]  J. Xiang,et al.  Clinical course and risk factors for mortality of adult inpatients with COVID-19 in Wuhan, China: a retrospective cohort study , 2020, The Lancet.

[95]  J. Rocklöv,et al.  The reproductive number of COVID-19 is higher compared to SARS coronavirus , 2020, Journal of travel medicine.

[96]  Xiang Xie,et al.  COVID-19 and the cardiovascular system , 2020, Nature Reviews Cardiology.

[97]  Gerald Nelson,et al.  English: An Essential Grammar , 2001 .

[98]  Rita Orji,et al.  Detecting Factors Responsible for Diabetes Prevalence in Nigeria using Social Media and Machine Learning , 2019, 2019 15th International Conference on Network and Service Management (CNSM).

[99]  Oladapo Oyebode,et al.  Using Machine Learning and Thematic Analysis Methods to Evaluate Mental Health Apps Based on User Reviews , 2020, IEEE Access.

[100]  Yi-Feng Xu,et al.  Patients with mental health disorders in the COVID-19 epidemic , 2020, The Lancet Psychiatry.

[101]  Jiun-Ruey Hu,et al.  COVID-19 and Asian American Pacific Islanders , 2020, Journal of General Internal Medicine.

[102]  M. Keshavan,et al.  COVID-19, mobile health and serious mental illness , 2020, Schizophrenia Research.

[103]  Sudha Seshadri,et al.  Physical inactivity, cardiometabolic disease, and risk of dementia: an individual-participant meta-analysis , 2019, BMJ.