Syndromic Surveillance Models Using Web Data: The Case of Influenza in Greece and Italy Using Google Trends

Background An extended discussion and research has been performed in recent years using data collected through search queries submitted via the Internet. It has been shown that the overall activity on the Internet is related to the number of cases of an infectious disease outbreak. Objective The aim of the study was to define a similar correlation between data from Google Trends and data collected by the official authorities of Greece and Europe by examining the development and the spread of seasonal influenza in Greece and Italy. Methods We used multiple regressions of the terms submitted in the Google search engine related to influenza for the period from 2011 to 2012 in Greece and Italy (sample data for 104 weeks for each country). We then used the autoregressive integrated moving average statistical model to determine the correlation between the Google search data and the real influenza cases confirmed by the aforementioned authorities. Two methods were used: (1) a flu score was created for the case of Greece and (2) comparison of data from a neighboring country of Greece, which is Italy. Results The results showed that there is a significant correlation that can help the prediction of the spread and the peak of the seasonal influenza using data from Google searches. The correlation for Greece for 2011 and 2012 was .909 and .831, respectively, and correlation for Italy for 2011 and 2012 was .979 and .933, respectively. The prediction of the peak was quite precise, providing a forecast before it arrives to population. Conclusions We can create an Internet surveillance system based on Google searches to track influenza in Greece and Italy.

[1]  Asep Saepudin,et al.  MUNCULNYA RESOLUSI WHA ( WORLD HEALTH ASSEMBLY )NO. 64/56 DALAM KERJASAMA PENANGANAN PANDEMI INFLUENZA DENGAN KERANGKA GISRS( GLOBAL INFLUENZA SURVEILLANCE AND RESPON SYSTEM ) WHO , 2013 .

[2]  Y. Gel,et al.  Influenza Forecasting with Google Flu Trends , 2013, PloS one.

[3]  Mehdi Jalalpour,et al.  Google Flu Trends Spatial Variability Validated Against Emergency Department Influenza-Related Visits , 2016, Journal of medical Internet research.

[4]  Miguel-Angel Sicilia,et al.  Syndromic surveillance models using Web data: The case of scarlet fever in the UK , 2012, Informatics for health & social care.

[5]  Cécile Viboud,et al.  Reassessing Google Flu Trends Data for Detection of Seasonal and Pandemic Influenza: A Comparative Epidemiological Study at Three Geographic Scales , 2013, PLoS Comput. Biol..

[6]  Gunther Eysenbach,et al.  Infodemiology: Tracking Flu-Related Searches on the Web for Syndromic Surveillance , 2006, AMIA.

[7]  G. Eysenbach Infodemiology: The epidemiology of (mis)information. , 2002, The American journal of medicine.

[8]  Scott Levin,et al.  Google flu trends: correlation with emergency department influenza rates and crowding metrics , 2011 .

[9]  Chang-Gun Lee,et al.  Estimating Influenza Outbreaks Using Both Search Engine Query Data and Social Media Data in South Korea , 2016, Journal of medical Internet research.

[10]  Jeremy Ginsberg,et al.  Detecting influenza epidemics using search engine query data , 2009, Nature.

[11]  Tobias Preis,et al.  Adaptive nowcasting of influenza outbreaks using Google searches , 2014, Royal Society Open Science.

[12]  Hsinchun Chen,et al.  Syndromic surveillance systems , 2008, Annu. Rev. Inf. Sci. Technol..