Monitoring Real-time Spatial Public Health Discussions in the Context of Vaccine Hesitancy

Social media provide the potential to keep up with public discussions more quickly, at lower cost, and at potentially higher granularity and scope than do traditional surveys. This paper details a preliminary system of real-time geographical monitoring and analysis using the context of the vaccine-hesitancy discussion across the United States, a valuable backdrop for such a system because of the diverse and impactful nature of the vaccination discussions as they appear, change, and influence the public. We combine various methods in machine learning to geolocate, categorize, and classify vaccination discussions on Twitter. As a proof of concept, we show analyses with a prominent anti-vaccine discussion that validate the system with results from traditional surveys, yet also provide valuable spatial statistical power on top of such surveys on maps of the United States. We detail limitations and future work, yet still conclude that the system and the answers it enables are important because they will allow for more targeted and effective communication and reaction to the discussion as a first step towards monitoring people’s views.

[1]  N. MacDonald Vaccine hesitancy: Definition, scope and determinants. , 2015, Vaccine.

[2]  T. Murphy,et al.  Historical comparisons of morbidity and mortality for vaccine-preventable diseases in the United States. , 2007, JAMA.

[3]  Sandra Crouse Quinn,et al.  Public Acceptance of Peramivir During the 2009 H1N1 Influenza Pandemic: Implications for Other Drugs or Vaccines Under Emergency Use Authorizations , 2015, Disaster Medicine and Public Health Preparedness.

[4]  E. Cohen Vaccine Refusal, Mandatory Immunization, and the Risks of Vaccine-Preventable Diseases , 2010 .

[5]  Jacob Ratkiewicz,et al.  Predicting the Political Alignment of Twitter Users , 2011, 2011 IEEE Third Int'l Conference on Privacy, Security, Risk and Trust and 2011 IEEE Third Int'l Conference on Social Computing.

[6]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[7]  Chong Wang,et al.  Reading Tea Leaves: How Humans Interpret Topic Models , 2009, NIPS.

[8]  Sandra Crouse Quinn,et al.  Exploring communication, trust in government, and vaccination intention later in the 2009 H1N1 pandemic: results of a national survey. , 2013, Biosecurity and bioterrorism : biodefense strategy, practice, and science.

[9]  Michael J. Paul,et al.  National and Local Influenza Surveillance through Twitter: An Analysis of the 2012-2013 Influenza Epidemic , 2013, PloS one.

[10]  David M. Blei,et al.  Supervised Topic Models , 2007, NIPS.

[11]  J. Brownstein,et al.  Influenza A (H7N9) and the importance of digital epidemiology. , 2013, The New England journal of medicine.

[12]  Aron Culotta,et al.  Towards detecting influenza epidemics by analyzing Twitter messages , 2010, SOMA '10.

[13]  Gerjo Kok,et al.  Disease Detection or Public Opinion Reflection? Content Analysis of Tweets, Other Social Media, and Online Newspapers During the Measles Outbreak in the Netherlands in 2013 , 2015, Journal of medical Internet research.

[14]  Maarten Sap,et al.  Developing Age and Gender Predictive Lexica over Social Media , 2014, EMNLP.

[15]  È. Dubé,et al.  Strategies intended to address vaccine hesitancy: Review of published reviews. , 2015, Vaccine.

[16]  Jure Leskovec,et al.  Antisocial Behavior in Online Discussion Communities , 2015, ICWSM.

[17]  Mark Dredze,et al.  Sprite: Generalizing Topic Models with Structured Priors , 2015, TACL.

[18]  A. Getis The Analysis of Spatial Association by Use of Distance Statistics , 2010 .

[19]  David A. Broniatowski,et al.  Towards Real-Time Measurement of Public Epidemic Awareness : Monitoring Influenza Awareness through Twitter , 2015 .

[20]  J. Pennebaker,et al.  The Psychological Meaning of Words: LIWC and Computerized Text Analysis Methods , 2010 .

[21]  Sandra Crouse Quinn,et al.  The anthrax vaccine and research: reactions from postal workers and public health professionals. , 2008, Biosecurity and bioterrorism : biodefense strategy, practice, and science.

[22]  Michael J. Paul,et al.  Twitter Improves Influenza Forecasting , 2014, PLoS currents.

[23]  Lillian Lee,et al.  Opinion Mining and Sentiment Analysis , 2008, Found. Trends Inf. Retr..

[24]  Carol Glaser,et al.  Pertussis Epidemic — California, 2014 , 2014, MMWR. Morbidity and mortality weekly report.

[25]  Aron Culotta,et al.  Predicting Twitter User Demographics using Distant Supervision from Website Traffic Data , 2016, J. Artif. Intell. Res..

[26]  È. Dubé,et al.  Mapping vaccine hesitancy—Country-specific characteristics of a global phenomenon , 2014, Vaccine.

[27]  Michael J. Paul,et al.  Carmen: A Twitter Geolocation System with Applications to Public Health , 2013 .

[28]  Jeffrey Heer,et al.  Termite: visualization techniques for assessing textual topic models , 2012, AVI.

[29]  Daniel Salmon,et al.  Measles at Disneyland, a Problem for All Ages , 2015, Annals of Internal Medicine.

[30]  Sandra Crouse Quinn,et al.  Trust During the Early Stages of the 2009 H1N1 Pandemic , 2014, Journal of health communication.

[31]  Sandra Crouse Quinn,et al.  Public willingness to take a vaccine or drug under Emergency Use Authorization during the 2009 H1N1 pandemic. , 2009, Biosecurity and bioterrorism : biodefense strategy, practice, and science.

[32]  Mark Dredze,et al.  Understanding Vaccine Refusal: Why We Need Social Media Now. , 2016, American journal of preventive medicine.

[33]  Alberto Maria Segre,et al.  The Use of Twitter to Track Levels of Disease Activity and Public Concern in the U.S. during the Influenza A H1N1 Pandemic , 2011, PloS one.