Age-Related Differences in the Accuracy of Web Query-Based Predictions of Influenza-Like Illness

Background Web queries are now widely used for modeling, nowcasting and forecasting influenza-like illness (ILI). However, given that ILI attack rates vary significantly across ages, in terms of both magnitude and timing, little is known about whether the association between ILI morbidity and ILI-related queries is comparable across different age-groups. The present study aimed to investigate features of the association between ILI morbidity and ILI-related query volume from the perspective of age. Methods Since Google Flu Trends is unavailable in Italy, Google Trends was used to identify entry terms that correlated highly with official ILI surveillance data. All-age and age-class-specific modeling was performed by means of linear models with generalized least-square estimation. Hold-out validation was used to quantify prediction accuracy. For purposes of comparison, predictions generated by exponential smoothing were computed. Results Five search terms showed high correlation coefficients of > .6. In comparison with exponential smoothing, the all-age query-based model correctly predicted the peak time and yielded a higher correlation coefficient with observed ILI morbidity (.978 vs. .929). However, query-based prediction of ILI morbidity was associated with a greater error. Age-class-specific query-based models varied significantly in terms of prediction accuracy. In the 0–4 and 25–44-year age-groups, these did well and outperformed exponential smoothing predictions; in the 15–24 and ≥ 65-year age-classes, however, the query-based models were inaccurate and highly overestimated peak height. In all but one age-class, peak timing predicted by the query-based models coincided with observed timing. Conclusions The accuracy of web query-based models in predicting ILI morbidity rates could differ among ages. Greater age-specific detail may be useful in flu query-based studies in order to account for age-specific features of the epidemiology of ILI.

[1]  Jae Ho Lee,et al.  Correlation between National Influenza Surveillance Data and Google Trends in South Korea , 2013, PloS one.

[2]  J. Fox Time-Series Regression and Generalized Least Squares , 2002 .

[3]  Cécile Viboud,et al.  Reassessing Google Flu Trends Data for Detection of Seasonal and Pandemic Influenza: A Comparative Epidemiological Study at Three Geographic Scales , 2013, PLoS Comput. Biol..

[4]  Laura M. Glass,et al.  Social contact networks for the spread of pandemic influenza in children and teenagers , 2008, BMC public health.

[5]  L. Brammer,et al.  Estimating influenza incidence and rates of influenza‐like illness in the outpatient setting , 2012, Influenza and other respiratory viruses.

[6]  F. Babl,et al.  Health information seeking by parents in the Internet age , 2008, Journal of paediatrics and child health.

[7]  Ian Portelli,et al.  Attack Rates Assessment of the 2009 Pandemic H1N1 Influenza A in Children and Their Contacts: A Systematic Review and Meta-Analysis , 2012, PloS one.

[8]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[9]  G. Eysenbach Infodemiology and Infoveillance: Framework for an Emerging Set of Public Health Informatics Methods to Analyze Search, Communication and Publication Behavior on the Internet , 2009, Journal of medical Internet research.

[10]  Louise Pelletier,et al.  Age-specific Differences in Influenza A Epidemic Curves: Do Children Drive the Spread of Influenza Epidemics? , 2011, American journal of epidemiology.

[11]  Sérgio Matos,et al.  Analysing Twitter and web queries for flu trend prediction , 2014, Theoretical Biology and Medical Modelling.

[12]  Aj Elliot Syndromic surveillance: the next phase of public health monitoring during the H1N1 influenza pandemic? , 2009, Euro surveillance : bulletin Europeen sur les maladies transmissibles = European communicable disease bulletin.

[13]  Michael J. Paul,et al.  National and Local Influenza Surveillance through Twitter: An Analysis of the 2012-2013 Influenza Epidemic , 2013, PloS one.

[14]  Lewis Spitz,et al.  Information on the World Wide Web--how useful is it for parents? , 2007, Journal of pediatric surgery.

[15]  C. Goss,et al.  Monitoring Influenza Activity in the United States: A Comparison of Traditional Surveillance Systems with Google Flu Trends , 2011, PloS one.

[16]  A. Tricco,et al.  Natural attack rate of influenza in unvaccinated children and adults: a meta-regression analysis , 2014, BMC Infectious Diseases.

[17]  Alexander Domnich,et al.  An overview of current and potential use of information and communication technologies for immunization promotion among adolescents , 2013, Human vaccines & immunotherapeutics.

[18]  John S. Brownstein,et al.  Wikipedia Usage Estimates Prevalence of Influenza-Like Illness in the United States in Near Real-Time , 2014, PLoS Comput. Biol..

[19]  Alexander Domnich,et al.  Demand-based web surveillance of sexually transmitted infections in Russia , 2014, International Journal of Public Health.

[20]  Farzad Mostashari,et al.  Monitoring the Impact of Influenza by Age: Emergency Department Fever and Respiratory Complaint Surveillance in New York City , 2007, PLoS medicine.

[21]  J. Brownstein,et al.  A Case Study of the New York City 2012-2013 Influenza Season With Daily Geocoded Twitter Data From Temporal and Spatiotemporal Perspectives , 2014, Journal of medical Internet research.

[22]  David M. Pennock,et al.  Using internet searches for influenza surveillance. , 2008, Clinical infectious diseases : an official publication of the Infectious Diseases Society of America.

[23]  C. Koppeschaar,et al.  Comparison of five influenza surveillance systems during the 2009 pandemic and their association with media attention , 2013, BMC Public Health.

[24]  E. Nsoesie,et al.  Monitoring Influenza Epidemics in China with Search Query from Baidu , 2013, PloS one.

[25]  Jay M Bernhardt,et al.  Online Pediatric Information Seeking Among Mothers of Young Children: Results From a Qualitative Study Using Focus Groups , 2004, Journal of medical Internet research.

[26]  C. Signorelli,et al.  Deaths after Fluad flu vaccine and the epidemic of panic in Italy , 2015, BMJ : British Medical Journal.

[27]  A. Dugas,et al.  Google Flu Trends: correlation with emergency department influenza rates and crowding metrics. , 2011, Clinical infectious diseases : an official publication of the Infectious Diseases Society of America.

[28]  Jeremy Ginsberg,et al.  Detecting influenza epidemics using search engine query data , 2009, Nature.

[29]  A. Hulth,et al.  Web Queries as a Source for Syndromic Surveillance , 2009, PloS one.

[30]  Gunther Eysenbach,et al.  Infodemiology and infoveillance tracking online health information and cyberbehavior for public health. , 2011, American journal of preventive medicine.

[31]  Declan Butler,et al.  When Google got flu wrong , 2013, Nature.

[32]  D. Cummings,et al.  Prediction of Dengue Incidence Using Search Query Surveillance , 2011, PLoS neglected tropical diseases.

[33]  M. L. Cristina,et al.  Influenza epidemiology in Italy two years after the 2009–2010 pandemic , 2013, Human vaccines & immunotherapeutics.

[34]  Cécile Viboud,et al.  Risk factors of influenza transmission in households. , 2004, The British journal of general practice : the journal of the Royal College of General Practitioners.

[35]  Alessandro Vespignani,et al.  Google Flu Trends Still Appears Sick: An Evaluation of the 2013-2014 Flu Season , 2014 .

[36]  T. Bernardo,et al.  Scoping Review on Search Queries and Social Media for Disease Surveillance: A Chronology of Innovation , 2013, Journal of medical Internet research.

[37]  S. Rutherford,et al.  Using Google Trends for Influenza Surveillance in South China , 2013, PloS one.

[38]  Vittoria Colizza,et al.  Evaluating the Feasibility and Participants’ Representativeness of an Online Nationwide Surveillance System for Influenza in France , 2013, PloS one.

[39]  Jonathan Taitz,et al.  Use of the Internet by parents of paediatric patients , 2006, Journal of paediatrics and child health.

[40]  Gail M Williams,et al.  Internet-based surveillance systems for monitoring emerging infectious diseases , 2013, The Lancet Infectious Diseases.

[41]  J. Aucott,et al.  The utility of "Google Trends" for epidemiological research: Lyme disease as an example. , 2010, Geospatial health.

[42]  Ken P Kleinman,et al.  Identifying pediatric age groups for influenza vaccination using a real-time regional surveillance system. , 2005, American journal of epidemiology.

[43]  R. Gasparini,et al.  A pharmacoeconomic appraisal of the strategy to tackle the H1N1v (A/California/07/09) pandemic in Italy: relevance of the CIRI-IV surveillance system. , 2011, Journal of preventive medicine and hygiene.

[44]  Alex R. Cook,et al.  Internet Search Limitations and Pandemic Influenza, Singapore , 2010, Emerging infectious diseases.

[45]  F. Ansaldi,et al.  Emergency department syndromic surveillance system for early detection of 5 syndromes: a pilot project in a reference teaching hospital in Genoa, Italy. , 2008, Journal of preventive medicine and hygiene.

[46]  Jon Skranes,et al.  Internet use among mothers of young children in Norway—a survey of Internet habits and perceived parental competence when caring for a sick child , 2014, Journal of Public Health.

[47]  Gunther Eysenbach,et al.  Infodemiology: Tracking Flu-Related Searches on the Web for Syndromic Surveillance , 2006, AMIA.

[48]  J. Fox Nonparametric Regression Appendix to An R and S-PLUS Companion to Applied Regression , 2002 .

[49]  Xi-chuan Zhou,et al.  Notifiable infectious disease surveillance with data collected by search engine , 2010, Journal of Zhejiang University SCIENCE C.

[50]  A. Fauci Seasonal and pandemic influenza preparedness: science and countermeasures. , 2006, The Journal of infectious diseases.

[51]  Gerardo Chowell,et al.  Severe respiratory disease concurrent with the circulation of H1N1 influenza. , 2009, The New England journal of medicine.

[52]  M. Santillana,et al.  What can digital disease detection learn from (an external revision to) Google Flu Trends? , 2014, American journal of preventive medicine.

[53]  D. Lazer,et al.  The Parable of Google Flu: Traps in Big Data Analysis , 2014, Science.