Combining Participatory Influenza Surveillance with Modeling and Forecasting: Three Alternative Approaches

Background Influenza outbreaks affect millions of people every year and its surveillance is usually carried out in developed countries through a network of sentinel doctors who report the weekly number of Influenza-like Illness cases observed among the visited patients. Monitoring and forecasting the evolution of these outbreaks supports decision makers in designing effective interventions and allocating resources to mitigate their impact. Objective Describe the existing participatory surveillance approaches that have been used for modeling and forecasting of the seasonal influenza epidemic, and how they can help strengthen real-time epidemic science and provide a more rigorous understanding of epidemic conditions. Methods We describe three different participatory surveillance systems, WISDM (Widely Internet Sourced Distributed Monitoring), Influenzanet and Flu Near You (FNY), and show how modeling and simulation can be or has been combined with participatory disease surveillance to: i) measure the non-response bias in a participatory surveillance sample using WISDM; and ii) nowcast and forecast influenza activity in different parts of the world (using Influenzanet and Flu Near You). Results WISDM-based results measure the participatory and sample bias for three epidemic metrics i.e. attack rate, peak infection rate, and time-to-peak, and find the participatory bias to be the largest component of the total bias. The Influenzanet platform shows that digital participatory surveillance data combined with a realistic data-driven epidemiological model can provide both short-term and long-term forecasts of epidemic intensities, and the ground truth data lie within the 95 percent confidence intervals for most weeks. The statistical accuracy of the ensemble forecasts increase as the season progresses. The Flu Near You platform shows that participatory surveillance data provide accurate short-term flu activity forecasts and influenza activity predictions. The correlation of the HealthMap Flu Trends estimates with the observed CDC ILI rates is 0.99 for 2013-2015. Additional data sources lead to an error reduction of about 40% when compared to the estimates of the model that only incorporates CDC historical information. Conclusions While the advantages of participatory surveillance, compared to traditional surveillance, include its timeliness, lower costs, and broader reach, it is limited by a lack of control over the characteristics of the population sample. Modeling and simulation can help overcome this limitation as well as provide real-time and long-term forecasting of influenza activity in data-poor parts of the world.

[1]  Aravind Srinivasan,et al.  Modelling disease outbreaks in realistic urban social networks , 2004, Nature.

[2]  Paola Velardi,et al.  Results from the centers for disease control and prevention’s predict the 2013–2014 Influenza Season Challenge , 2016, BMC Infectious Diseases.

[3]  David M. Pennock,et al.  Using internet searches for influenza surveillance. , 2008, Clinical infectious diseases : an official publication of the Infectious Diseases Society of America.

[4]  James M. Hyman,et al.  Forecasting the 2013–2014 Influenza Season Using Wikipedia , 2014, PLoS Comput. Biol..

[5]  M. D. McKay,et al.  Creating synthetic baseline populations , 1996 .

[6]  Craig Dalton,et al.  Syndromic surveillance: is it a useful tool for local outbreak detection? , 2006, Journal of Epidemiology and Community Health.

[7]  J. Shaman,et al.  Forecasting seasonal outbreaks of influenza , 2012, Proceedings of the National Academy of Sciences.

[8]  Naren Ramakrishnan,et al.  Flu Gone Viral: Syndromic Surveillance of Flu on Twitter Using Temporal Topic Models , 2014, 2014 IEEE International Conference on Data Mining.

[9]  E. Nsoesie,et al.  Monitoring Influenza Epidemics in China with Search Query from Baidu , 2013, PloS one.

[10]  Soo-Yong Shin,et al.  Cumulative Query Method for Influenza Surveillance Using Search Engine Data , 2014, Journal of medical Internet research.

[11]  Chris Bailey-Kellogg,et al.  Spatial data mining to support pandemic preparedness , 2006, SKDD.

[12]  E. Nsoesie,et al.  Using Clinicians’ Search Query Data to Monitor Influenza Epidemics , 2014, Clinical infectious diseases : an official publication of the Infectious Diseases Society of America.

[13]  Mauricio Santillana,et al.  Accurate estimation of influenza epidemics using Google search data via ARGO , 2015, Proceedings of the National Academy of Sciences.

[14]  Jeremy Ginsberg,et al.  Detecting influenza epidemics using search engine query data , 2009, Nature.

[15]  Alessandro Vespignani,et al.  Modeling the spatial spread of infectious diseases: The GLobal Epidemic and Mobility computational model , 2010, J. Comput. Sci..

[16]  Jeffrey C. Mariner,et al.  Experiences in Participatory Surveillance and Community-based Reporting Systems for H5N1 Highly Pathogenic Avian Influenza: A Case Study Approach , 2014, EcoHealth.

[17]  Mark Dredze,et al.  Combining Search, Social Media, and Traditional Data Sources to Improve Influenza Surveillance , 2015, PLoS Comput. Biol..

[18]  C. Macken,et al.  Modeling targeted layered containment of an influenza pandemic in the United States , 2008, Proceedings of the National Academy of Sciences.

[19]  Clémentine Calba,et al.  Applying participatory approaches in the evaluation of surveillance systems: A pilot study on African swine fever surveillance in Corsica. , 2015, Preventive veterinary medicine.

[20]  Mila C Gonzalez,et al.  Sustaining Global Surveillance and Response to Emerging Zoonotic Diseases , 2009 .

[21]  Daniela Perrotta,et al.  Social Data Mining and Seasonal Influenza Forecasts: The FluOutlook Platform , 2015, ECML/PKDD.

[22]  Robert L Cook,et al.  Evaluating Google, Twitter, and Wikipedia as Tools for Influenza Surveillance Using Bayesian Change Point Analysis: A Comparative Analysis , 2016, JMIR public health and surveillance.

[23]  S. Fienberg An Iterative Procedure for Estimation in Contingency Tables , 1970 .

[24]  A Vespignani,et al.  Web‐based participatory surveillance of infectious diseases: the Influenzanet participatory surveillance experience , 2013, Clinical Microbiology and Infection.

[25]  M. Santillana,et al.  What can digital disease detection learn from (an external revision to) Google Flu Trends? , 2014, American journal of preventive medicine.

[26]  J. Freese,et al.  Comparing data characteristics and results of an online factorial survey between a population-based and a crowdsource-recruited sample , 2014 .

[27]  J. Brownstein,et al.  A Case Study of the New York City 2012-2013 Influenza Season With Daily Geocoded Twitter Data From Temporal and Spatiotemporal Perspectives , 2014, Journal of medical Internet research.

[28]  B. Lewis,et al.  Detail in network models of epidemiology: are we there yet? , 2010, Journal of biological dynamics.

[29]  Rumi Chunara,et al.  Flu Near You: Crowdsourced Symptom Reporting Spanning 2 Influenza Seasons. , 2015, American journal of public health.

[30]  Madhav V. Marathe,et al.  Synthesis of a high resolution social contact network for Delhi with application to pandemic planning , 2015, Artif. Intell. Medicine.

[31]  J S Brownstein,et al.  Cloud-based Electronic Health Records for Real-time, Region-specific Influenza Surveillance , 2016, Scientific reports.

[32]  Rumi Chunara,et al.  Determinants of Participants’ Follow-Up and Characterization of Representativeness in Flu Near You, A Participatory Disease Surveillance System , 2017, JMIR public health and surveillance.

[33]  Mehdi Jalalpour,et al.  Google Flu Trends Spatial Variability Validated Against Emergency Department Influenza-Related Visits , 2016, Journal of medical Internet research.

[34]  Daniela Perrotta,et al.  Using Participatory Web-based Surveillance Data to Improve Seasonal Influenza Forecasting in Italy , 2017, WWW.

[35]  Ananya Choudhury,et al.  WiSDM: a platform for crowd-sourced data acquisition, analytics, and synthetic data generation , 2016 .

[36]  Michael J. Paul,et al.  Twitter Improves Influenza Forecasting , 2014, PLoS currents.

[37]  C. Barrett,et al.  Comparing Effectiveness of Top-Down and Bottom-Up Strategies in Containing Influenza , 2011, PloS one.

[38]  Ronald Rosenfeld,et al.  Flexible Modeling of Epidemics with an Empirical Bayes Framework , 2014, PLoS Comput. Biol..

[39]  E. Nsoesie,et al.  A systematic review of studies on forecasting the dynamics of influenza outbreaks , 2013, Influenza and other respiratory viruses.

[40]  Achla Marathe,et al.  Fairness versus efficiency of vaccine allocation strategies. , 2015, Value in health : the journal of the International Society for Pharmacoeconomics and Outcomes Research.

[41]  Daniela Paolotti,et al.  The representativeness of a European multi-center network for influenza-like-illness participatory surveillance , 2014, BMC Public Health.

[42]  Achla Marathe,et al.  Effect of modelling slum populations on influenza spread in Delhi , 2016, BMJ Open.

[43]  Brian H. Spitzberg,et al.  The Reliability of Tweets as a Supplementary Method of Seasonal Influenza Surveillance , 2014, Journal of medical Internet research.

[44]  M. Marathe,et al.  Economic and social impact of influenza mitigation strategies by demographic class. , 2011, Epidemics.

[45]  T. Guin,et al.  Myths and Realities of Respondent Engagement in Online Surveys , 2012 .

[46]  S. Kullback,et al.  Contingency tables with given marginals. , 1968, Biometrika.

[47]  Alicia Karspeck,et al.  Comparison of Filtering Methods for the Modeling and Retrospective Forecasting of Influenza Epidemics , 2014, PLoS Comput. Biol..

[48]  Naren Ramakrishnan,et al.  Epidemiological modeling of news and rumors on Twitter , 2013, SNAKDD '13.

[49]  Alessandro Vespignani,et al.  Multiscale mobility networks and the spatial spreading of infectious diseases , 2009, Proceedings of the National Academy of Sciences.

[50]  Colin A. Chapman,et al.  Assessing Commitment and Reporting Fidelity to a Text Message-Based Participatory Surveillance in Rural Western Uganda , 2016, PloS one.