Using Participatory Web-based Surveillance Data to Improve Seasonal Influenza Forecasting in Italy

Traditional surveillance of seasonal influenza is generally affected by reporting lags of at least one week and by continuous revisions of the numbers initially released. As a consequence, influenza forecasts are often limited by the time required to collect new and accurate data. On the other hand, the availability of novel data streams for disease detection can help in overcoming these issues by capturing an additional surveillance signal that can be used to complement data collected by public health agencies. In this study, we investigate how combining both traditional and participatory Web-based surveillance data can provide accurate predictions for seasonal influenza in real-time fashion. To this aim, we use two data sources available in Italy from two different monitoring systems: traditional surveillance data based on sentinel doctors reports and digital surveillance data deriving from a participatory system that monitors the influenza activity through Internet-based surveys. We integrate such digital component in a linear autoregressive exogenous (ARX) model based on traditional surveillance data and evaluate its predictive ability over the course of four influenza seasons in Italy, from 2012-2013 to 2015-2016, for each of the four weekly time horizons. Our results show that by using data extracted from a Web-based participatory surveillance system, which are usually available one week in advance with respect to traditional surveillance, it is possible to obtain accurate weekly predictions of influenza activity at national level up to four weeks in advance. Compared to a model that is only based on data from sentinel doctors, our approach significantly improves real-time forecasts of influenza activity, by increasing the Pearson's correlation up to 30% and by reducing the Mean Absolute Error up to 43% for the four weekly time horizons.

[1]  Mark Dredze,et al.  You Are What You Tweet: Analyzing Twitter for Public Health , 2011, ICWSM.

[2]  Alessandro Vespignani,et al.  Determinants of Follow-Up Participation in the Internet-Based European Influenza Surveillance Platform Influenzanet , 2015 .

[3]  Rumi Chunara,et al.  Public health for the people: participatory infectious disease surveillance in the digital age , 2014, Emerging Themes in Epidemiology.

[4]  James M. Hyman,et al.  Forecasting the 2013–2014 Influenza Season Using Wikipedia , 2014, PLoS Comput. Biol..

[5]  Christel Faes,et al.  Eight Years of the Great Influenza Survey to Monitor Influenza-Like Illness in Flanders , 2013, PloS one.

[6]  Rumi Chunara,et al.  Flu Near You: Crowdsourced Symptom Reporting Spanning 2 Influenza Seasons. , 2015, American journal of public health.

[7]  N. Stocks,et al.  ASPREN surveillance system for influenza-like illness - A comparison with FluTracking and the National Notifiable Diseases Surveillance System. , 2009, Australian family physician.

[8]  David N Durrheim,et al.  Flutracking weekly online community survey of influenza-like illness: 2013 and 2014. , 2015, Communicable diseases intelligence quarterly report.

[9]  Madhav V. Marathe,et al.  Forecasting a Moving Target: Ensemble Models for ILI Case Count Predictions , 2014, SDM.

[10]  Alina Deshpande,et al.  Global Disease Monitoring and Forecasting with Wikipedia , 2014, PLoS Comput. Biol..

[11]  Utilizing syndromic surveillance data for estimating levels of influenza circulation. , 2014, American journal of epidemiology.

[12]  Craig Dalton,et al.  Flutracking: a weekly Australian community online survey of influenza-like illness in 2006, 2007 and 2008. , 2009, Communicable diseases intelligence quarterly report.

[13]  A Vespignani,et al.  Web‐based participatory surveillance of infectious diseases: the Influenzanet participatory surveillance experience , 2013, Clinical Microbiology and Infection.

[14]  Nedialko B. Dimitrov,et al.  Optimizing Provider Recruitment for Influenza Surveillance Networks , 2012, PLoS Comput. Biol..

[15]  References , 1971 .

[16]  David N. Durrheim,et al.  Building Influenza Surveillance Pyramids in Near Real Time, Australia , 2013, Emerging infectious diseases.

[17]  John S. Brownstein,et al.  Wikipedia Usage Estimates Prevalence of Influenza-Like Illness in the United States in Near Real-Time , 2014, PLoS Comput. Biol..

[18]  David N Durrheim,et al.  Flutracking provides a measure of field influenza vaccine effectiveness, Australia, 2007-2009. , 2010, Vaccine.

[19]  Sebastian Funk,et al.  Using the internet to estimate influenza vaccine effectiveness , 2012, Expert review of vaccines.

[20]  Paola Velardi,et al.  Results from the centers for disease control and prevention’s predict the 2013–2014 Influenza Season Challenge , 2016, BMC Infectious Diseases.

[21]  David M. Pennock,et al.  Using internet searches for influenza surveillance. , 2008, Clinical infectious diseases : an official publication of the Infectious Diseases Society of America.

[22]  Ellen Brooks-Pollock,et al.  Using an online survey of healthcare-seeking behaviour to estimate the magnitude and severity of the 2009 H1N1v influenza epidemic in England , 2011, BMC infectious diseases.

[23]  D. Paolotti,et al.  Ten-year performance of Influenzanet: ILI time series, risks, vaccine effects, and care-seeking behaviour. , 2015, Epidemics.

[24]  Alicia Karspeck,et al.  Real-Time Influenza Forecasts during the 2012–2013 Season , 2013, Nature Communications.

[25]  David N. Durrheim,et al.  Online Flutracking Survey of Influenza-like Illness during Pandemic (H1N1) 2009, Australia , 2010, Emerging infectious diseases.

[26]  W. Edmunds,et al.  Incidence and risk factors for influenza-like-illness in the UK: online surveillance using Flusurvey , 2014, BMC Infectious Diseases.

[27]  Mark Dredze,et al.  Combining Search, Social Media, and Traditional Data Sources to Improve Influenza Surveillance , 2015, PLoS Comput. Biol..

[28]  Nello Cristianini,et al.  Nowcasting Events from the Social Web with Statistical Learning , 2012, TIST.

[29]  Rumi Chunara,et al.  Estimating influenza attack rates in the United States using a participatory cohort , 2015, Scientific Reports.

[30]  M. Smolinski,et al.  Flu Near You: An Online Self-reported Influenza Surveillance System in the USA , 2013, Online Journal of Public Health Informatics.

[31]  Alberto Maria Segre,et al.  The Use of Twitter to Track Levels of Disease Activity and Public Concern in the U.S. during the Influenza A H1N1 Pandemic , 2011, PloS one.

[32]  Jeremy Ginsberg,et al.  Detecting influenza epidemics using search engine query data , 2009, Nature.

[33]  Michael J. Paul,et al.  National and Local Influenza Surveillance through Twitter: An Analysis of the 2012-2013 Influenza Epidemic , 2013, PloS one.

[34]  Naren Ramakrishnan,et al.  Flu Gone Viral: Syndromic Surveillance of Flu on Twitter Using Temporal Topic Models , 2014, 2014 IEEE International Conference on Data Mining.

[35]  Adam Wade Crawley Flu Near You: Comparing Crowdsourced Reports of Influenza-like Illness to the CDC Outpatient Influenza-like Illness Surveillance Network, October 2012 to March 2014 , 2014 .

[36]  M. Kretzschmar,et al.  An evidence synthesis approach to estimating the incidence of seasonal influenza in the Netherlands , 2013, Influenza and other respiratory viruses.

[37]  Aron Culotta,et al.  Towards detecting influenza epidemics by analyzing Twitter messages , 2010, SOMA '10.

[38]  D. Paolotti,et al.  Participatory Online Surveillance as a Supplementary Tool to Sentinel Doctors for Influenza-Like Illness Surveillance in Italy , 2017, PloS one.

[39]  D. Lazer,et al.  The Parable of Google Flu: Traps in Big Data Analysis , 2014, Science.

[40]  Caroline O. Buckee,et al.  Digital Epidemiology , 2012, PLoS Comput. Biol..

[41]  Daniela Paolotti,et al.  The representativeness of a European multi-center network for influenza-like-illness participatory surveillance , 2014, BMC Public Health.

[42]  Michael J. Paul,et al.  Twitter Improves Influenza Forecasting , 2014, PLoS currents.

[43]  Natasha L. Tilston,et al.  Internet-based surveillance of Influenza-like-illness in the UK during the 2009 H1N1 influenza pandemic , 2010, BMC public health.

[44]  Daniela Perrotta,et al.  Social Data Mining and Seasonal Influenza Forecasts: The FluOutlook Platform , 2015, ECML/PKDD.