Quantifying COVID-19 Content in the Online Health Opinion War Using Machine Learning

A huge amount of potentially dangerous COVID-19 misinformation is appearing online. Here we use machine learning to quantify COVID-19 content among online opponents of establishment health guidance, in particular vaccinations (“anti-vax”). We find that the anti-vax community is developing a less focused debate around COVID-19 than its counterpart, the pro-vaccination (“pro-vax”) community. However, the anti-vax community exhibits a broader range of “flavors” of COVID-19 topics, and hence can appeal to a broader cross-section of individuals seeking COVID-19 guidance online, e.g. individuals wary of a mandatory fast-tracked COVID-19 vaccine or those seeking alternative remedies. Hence the anti-vax community looks better positioned to attract fresh support going forward than the pro-vax community. This is concerning since a widespread lack of adoption of a COVID-19 vaccine will mean the world falls short of providing herd immunity, leaving countries open to future COVID-19 resurgences. We provide a mechanistic model that interprets these results and could help in assessing the likely efficacy of intervention strategies. Our approach is scalable and hence tackles the urgent problem facing social media platforms of having to analyze huge volumes of online health misinformation and disinformation.

[1]  Kenneth E. Shirley,et al.  LDAvis: A method for visualizing and interpreting topics , 2014 .

[2]  A. Kata A postmodern Pandora's box: anti-vaccination misinformation on the Internet. , 2010, Vaccine.

[3]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[4]  Michael Röder,et al.  Exploring the Space of Topic Coherence Measures , 2015, WSDM.

[5]  Zhenfeng Cao,et al.  Generalized Gelation Theory Describes Onset of Online Extremist Support. , 2018, Physical review letters.

[6]  David A. Broniatowski,et al.  Discordance Between Human Papillomavirus Twitter Images and Disparities in Human Papillomavirus Risk and Disease in the United States: Mixed-Methods Analysis , 2018, Journal of medical Internet research.

[7]  Y. Vorobyeva,et al.  New online ecology of adversarial aggregates: ISIS and beyond , 2016, Science.

[8]  H. Larson Blocking information on COVID-19 can fuel the spread of misinformation , 2020, Nature.

[9]  R. Leahy,et al.  Hidden resilience and adaptive dynamics of the global online hate ecology , 2019, Nature.

[10]  Tawfiq Ammari,et al.  “Thanks for your interest in our Facebook group, but it's only for dads”: Social Roles of Stay-at-Home Dads , 2016, CSCW.

[11]  Marco R. Spruit,et al.  Full-Text or Abstract? Examining Topic Coherence Scores Using Latent Dirichlet Allocation , 2017, 2017 IEEE International Conference on Data Science and Advanced Analytics (DSAA).

[12]  David A. Broniatowski,et al.  Weaponized Health Communication: Twitter Bots and Russian Trolls Amplify the Vaccine Debate , 2018, American journal of public health.