Correlation between National Influenza Surveillance Data and Search Queries from Mobile Devices and Desktops in South Korea

Background Digital surveillance using internet search queries can improve both the sensitivity and timeliness of the detection of a health event, such as an influenza outbreak. While it has recently been estimated that the mobile search volume surpasses the desktop search volume and mobile search patterns differ from desktop search patterns, the previous digital surveillance systems did not distinguish mobile and desktop search queries. The purpose of this study was to compare the performance of mobile and desktop search queries in terms of digital influenza surveillance. Methods and Results The study period was from September 6, 2010 through August 30, 2014, which consisted of four epidemiological years. Influenza-like illness (ILI) and virologic surveillance data from the Korea Centers for Disease Control and Prevention were used. A total of 210 combined queries from our previous survey work were used for this study. Mobile and desktop weekly search data were extracted from Naver, which is the largest search engine in Korea. Spearman’s correlation analysis was used to examine the correlation of the mobile and desktop data with ILI and virologic data in Korea. We also performed lag correlation analysis. We observed that the influenza surveillance performance of mobile search queries matched or exceeded that of desktop search queries over time. The mean correlation coefficients of mobile search queries and the number of queries with an r-value of ≥ 0.7 equaled or became greater than those of desktop searches over the four epidemiological years. A lag correlation analysis of up to two weeks showed similar trends. Conclusion Our study shows that mobile search queries for influenza surveillance have equaled or even become greater than desktop search queries over time. In the future development of influenza surveillance using search queries, the recognition of changing trend of mobile search data could be necessary.

[1]  Jae Ho Lee,et al.  Correlation between National Influenza Surveillance Data and Google Trends in South Korea , 2013, PloS one.

[2]  Gunther Eysenbach,et al.  Infodemiology: Tracking Flu-Related Searches on the Web for Syndromic Surveillance , 2006, AMIA.

[3]  Gail M Williams,et al.  Internet-based surveillance systems for monitoring emerging infectious diseases , 2013, The Lancet Infectious Diseases.

[4]  M. Santillana,et al.  What can digital disease detection learn from (an external revision to) Google Flu Trends? , 2014, American journal of preventive medicine.

[5]  Anette Hulth,et al.  GET WELL: an automated surveillance system for gaining new epidemiological knowledge , 2011, BMC public health.

[6]  Alina Deshpande,et al.  Global Disease Monitoring and Forecasting with Wikipedia , 2014, PLoS Comput. Biol..

[7]  David L. Buckeridge,et al.  Information technology and global surveillance of cases of 2009 H1N1 influenza. , 2010, The New England journal of medicine.

[8]  Cécile Viboud,et al.  Reassessing Google Flu Trends Data for Detection of Seasonal and Pandemic Influenza: A Comparative Epidemiological Study at Three Geographic Scales , 2013, PLoS Comput. Biol..

[9]  Jeremy Ginsberg,et al.  Detecting influenza epidemics using search engine query data , 2009, Nature.

[10]  M. Vojnovic On Mobile User Behaviour Patterns , 2008, 2008 IEEE International Zurich Seminar on Communications.

[11]  Michael J. Paul,et al.  National and Local Influenza Surveillance through Twitter: An Analysis of the 2012-2013 Influenza Epidemic , 2013, PloS one.

[12]  Jane Li,et al.  Good abandonment in mobile and PC internet search , 2009, SIGIR.

[13]  Julie A. Pavlin,et al.  Syndromic Surveillance , 2004, Emerging infectious diseases.

[14]  S. Triple,et al.  Assessment of syndromic surveillance in Europe. , 2011 .

[15]  Soo-Yong Shin,et al.  Methods Using Social Media and Search Queries to Predict Infectious Disease Outbreaks , 2017, Healthcare informatics research.

[16]  Nuria Oliver,et al.  Understanding mobile web and mobile search use in today's dynamic mobile landscape , 2011, Mobile HCI.

[17]  D. Lazer,et al.  The Parable of Google Flu: Traps in Big Data Analysis , 2014, Science.

[18]  T. Bernardo,et al.  Scoping Review on Search Queries and Social Media for Disease Surveillance: A Chronology of Innovation , 2013, Journal of medical Internet research.

[19]  Sérgio Matos,et al.  Analysing Twitter and web queries for flu trend prediction , 2014, Theoretical Biology and Medical Modelling.

[20]  Soo-Yong Shin,et al.  Cumulative Query Method for Influenza Surveillance Using Search Engine Data , 2014, Journal of medical Internet research.

[21]  Ricardo Baeza-Yates,et al.  A Study of Mobile Search Queries in Japan , 2007 .

[22]  C. Irvin,et al.  Syndromic analysis of computerized emergency department patients' chief complaints: an opportunity for bioterrorism and influenza surveillance. , 2003, Annals of emergency medicine.

[23]  Laurent Hébert-Dufresne,et al.  Enhancing disease surveillance with novel data streams: challenges and opportunities , 2015, EPJ Data Science.

[24]  Triple S Project Assessment of syndromic surveillance in Europe , 2011, The Lancet.

[25]  Barry Smyth,et al.  A large scale study of European mobile search behaviour , 2008, Mobile HCI.

[26]  Kate E. Jones,et al.  Global trends in emerging infectious diseases , 2008, Nature.

[27]  Shumeet Baluja,et al.  A large scale study of wireless search behavior: Google mobile search , 2006, CHI.

[28]  A Hulth,et al.  Web query-based surveillance in Sweden during the influenza A(H1N1)2009 pandemic, April 2009 to February 2010. , 2011, Euro surveillance : bulletin Europeen sur les maladies transmissibles = European communicable disease bulletin.

[29]  M. Vicente,et al.  Monitoring influenza activity in Europe with Google Flu Trends: comparison with the findings of sentinel physician networks - results for 2009-10. , 2010, Euro surveillance : bulletin Europeen sur les maladies transmissibles = European communicable disease bulletin.

[30]  J. Brownstein,et al.  A Case Study of the New York City 2012-2013 Influenza Season With Daily Geocoded Twitter Data From Temporal and Spatiotemporal Perspectives , 2014, Journal of medical Internet research.

[31]  Ya Xu,et al.  Computers and iphones and mobile phones, oh my!: a logs-based comparison of search users on different devices , 2009, WWW '09.

[32]  Antonio Lima,et al.  Personalized routing for multitudes in smart cities , 2015, EPJ Data Science.

[33]  Stephen S Morse,et al.  Public health surveillance and infectious disease detection. , 2012, Biosecurity and bioterrorism : biodefense strategy, practice, and science.

[34]  John S. Brownstein,et al.  Wikipedia Usage Estimates Prevalence of Influenza-Like Illness in the United States in Near Real-Time , 2014, PLoS Comput. Biol..