Improved state-level influenza nowcasting in the United States leveraging Internet-based data and network approaches

In the presence of health threats, precision public health approaches aim to provide targeted, timely, and population-specific interventions. Accurate surveillance methodologies that can estimate infectious disease activity ahead of official healthcare-based reports, at relevant spatial resolutions, are important for achieving this goal. Here we introduce a methodological framework which dynamically combines two distinct influenza tracking techniques, using an ensemble machine learning approach, to achieve improved state-level influenza activity estimates in the United States. The two predictive techniques behind the ensemble utilize (1) a self-correcting statistical method combining influenza-related Google search frequencies, information from electronic health records, and historical flu trends within each state, and (2) a network-based approach leveraging spatio-temporal synchronicities observed in historical influenza activity across states. The ensemble considerably outperforms each component method in addition to previously proposed state-specific methods for influenza tracking, with higher correlations and lower prediction errors.Real-time disease surveillance can aid mitigation of outbreaks. Here, Lu et al. combine an approach using Google search and EHR data with an approach leveraging spatiotemporal synchronicities of influenza activity across states to improve state-level influenza activity estimates in the US.

[1]  Mark A. Miller,et al.  Synchrony, Waves, and Spatial Hierarchies in the Spread of Influenza , 2006, Science.

[2]  Andy Blackburn,et al.  Google Flu trends , 2008 .

[3]  A C C Gibbs,et al.  Data Analysis , 2009, Encyclopedia of Database Systems.

[4]  Pedro M. Valero-Mora,et al.  ggplot2: Elegant Graphics for Data Analysis , 2010 .

[5]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[6]  Nicholas G. Polson,et al.  Tracking Epidemics With Google Flu Trends Data and a State-Space SEIR Model , 2012, Journal of the American Statistical Association.

[7]  O Noya,et al.  Trypanosoma cruzi genotyping supports a common source of infection in a school-related oral outbreak of acute Chagas disease in Venezuela , 2013, Epidemiology and Infection.

[8]  Cécile Viboud,et al.  Reassessing Google Flu Trends Data for Detection of Seasonal and Pandemic Influenza: A Comparative Epidemiological Study at Three Geographic Scales , 2013, PLoS Comput. Biol..

[9]  Cécile Viboud,et al.  Spatial Transmission of 2009 Pandemic Influenza in the US , 2014, PLoS Comput. Biol..

[10]  Alicia Karspeck,et al.  Comparison of Filtering Methods for the Modeling and Retrospective Forecasting of Influenza Epidemics , 2014, PLoS Comput. Biol..

[11]  Andrew C. Miller,et al.  Advances in nowcasting influenza-like illness rates using search query logs , 2015, Scientific Reports.

[12]  Dotan A. Haim,et al.  Using Networks to Combine “Big Data” and Traditional Surveillance to Improve Influenza Predictions , 2015, Scientific Reports.

[13]  Ronald Rosenfeld,et al.  Flexible Modeling of Epidemics with an Empirical Bayes Framework , 2014, PLoS Comput. Biol..

[14]  Mauricio Santillana,et al.  ARGO: a model for accurate estimation of influenza epidemics using Google search data , 2015, ArXiv.

[15]  Marc Lipsitch,et al.  Inference of seasonal and pandemic influenza transmission dynamics , 2015, Proceedings of the National Academy of Sciences.

[16]  Jeffrey Shaman,et al.  Forecasting Influenza Epidemics in Hong Kong , 2015, PLoS Comput. Biol..

[17]  Mauricio Santillana,et al.  Accurate estimation of influenza epidemics using Google search data via ARGO , 2015, Proceedings of the National Academy of Sciences.

[18]  F. Collins,et al.  A new initiative on precision medicine. , 2015, The New England journal of medicine.

[19]  J S Brownstein,et al.  Cloud-based Electronic Health Records for Real-time, Region-specific Influenza Surveillance , 2016, Scientific reports.

[20]  W. Riley,et al.  Precision Public Health for the Era of Precision Medicine. , 2016, American journal of preventive medicine.

[21]  Jeffrey Shaman,et al.  Forecasting Influenza Outbreaks in Boroughs and Neighborhoods of New York City , 2016, PLoS Comput. Biol..

[22]  J. McCaw,et al.  Retrospective forecasting of the 2010–2014 Melbourne influenza seasons using multiple surveillance systems , 2016, Epidemiology and Infection.

[23]  Samuel C. Kou,et al.  Advances in using Internet searches to track dengue , 2016, PLoS Comput. Biol..

[24]  Mauricio Santillana,et al.  Editorial Commentary: Perspectives on the Future of Internet Search Engines and Biosurveillance Systems. , 2017, Clinical infectious diseases : an official publication of the Infectious Diseases Society of America.

[25]  Cécile Viboud,et al.  Human mobility and the spatial transmission of influenza in the United States , 2017, PLoS Comput. Biol..

[26]  John S. Brownstein,et al.  Forecasting Zika Incidence in the 2016 Latin America Outbreak Combining Traditional Disease Surveillance with Search, Social Media, and News Report Data , 2017, PLoS neglected tropical diseases.

[27]  Sasikiran Kandula,et al.  Subregional Nowcasts of Seasonal Influenza Using Search Trends , 2017, Journal of medical Internet research.

[28]  John S. Brownstein,et al.  Using electronic health records and Internet search information for accurate influenza forecasting , 2017, BMC Infectious Diseases.

[29]  Christopher. Simons,et al.  Machine learning with Python , 2017 .

[30]  Shawn Dolley,et al.  Big Data’s Role in Precision Public Health , 2018, Front. Public Health.

[31]  Ingemar J. Cox,et al.  Multi-Task Learning Improves Disease Models from Web Search , 2018, WWW.

[32]  Rok Sosic,et al.  Accurate Influenza Monitoring and Forecasting Using Novel Internet Data Streams: A Case Study in the Boston Metropolis , 2018, JMIR public health and surveillance.

[33]  Peter Dawson,et al.  Epidemic forecasts as a tool for public health: interpretation and (re)calibration , 2018, Australian and New Zealand journal of public health.