The Application of Internet-Based Sources for Public Health Surveillance (Infoveillance): Systematic Review

Background Public health surveillance is based on the continuous and systematic collection, analysis, and interpretation of data. This informs the development of early warning systems to monitor epidemics and documents the impact of intervention measures. The introduction of digital data sources, and specifically sources available on the internet, has impacted the field of public health surveillance. New opportunities enabled by the underlying availability and scale of internet-based sources (IBSs) have paved the way for novel approaches for disease surveillance, exploration of health communities, and the study of epidemic dynamics. This field and approach is also known as infodemiology or infoveillance. Objective This review aimed to assess research findings regarding the application of IBSs for public health surveillance (infodemiology or infoveillance). To achieve this, we have presented a comprehensive systematic literature review with a focus on these sources and their limitations, the diseases targeted, and commonly applied methods. Methods A systematic literature review was conducted targeting publications between 2012 and 2018 that leveraged IBSs for public health surveillance, outbreak forecasting, disease characterization, diagnosis prediction, content analysis, and health-topic identification. The search results were filtered according to previously defined inclusion and exclusion criteria. Results Spanning a total of 162 publications, we determined infectious diseases to be the preferred case study (108/162, 66.7%). Of the eight categories of IBSs (search queries, social media, news, discussion forums, websites, web encyclopedia, and online obituaries), search queries and social media were applied in 95.1% (154/162) of the reviewed publications. We also identified limitations in representativeness and biased user age groups, as well as high susceptibility to media events by search queries, social media, and web encyclopedias. Conclusions IBSs are a valuable proxy to study illnesses affecting the general population; however, it is important to characterize which diseases are best suited for the available sources; the literature shows that the level of engagement among online platforms can be a potential indicator. There is a necessity to understand the population’s online behavior; in addition, the exploration of health information dissemination and its content is significantly unexplored. With this information, we can understand how the population communicates about illnesses online and, in the process, benefit public health.

[1]  Eric Horvitz,et al.  Social media as a measurement tool of depression in populations , 2013, WebSci.

[2]  Yong Huang,et al.  Dynamic Forecasting of Zika Epidemics Using Google Trends , 2016, bioRxiv.

[3]  Mark Dredze,et al.  You Are What You Tweet: Analyzing Twitter for Public Health , 2011, ICWSM.

[4]  Hsiao-Wei Yu,et al.  Forecasting the Incidence of Dementia and Dementia-Related Outpatient Visits With Google Trends: Evidence From Taiwan , 2015, Journal of medical Internet research.

[5]  T. Bernardo,et al.  Scoping Review on Search Queries and Social Media for Disease Surveillance: A Chronology of Innovation , 2013, Journal of medical Internet research.

[6]  K. Hirayama,et al.  Prediction of Zika-confirmed cases in Brazil and Colombia using Google Trends , 2018, Epidemiology and Infection.

[7]  Martin Szomszor,et al.  #swineflu , 2014, ACM Trans. Manag. Inf. Syst..

[8]  Robert V. Tauxe,et al.  Public Health Surveillance: A Tool for Targeting and Monitoring Interventions , 2006 .

[9]  Avare Stewart,et al.  Epidemic Intelligence for the Crowd, by the Crowd , 2012, ICWSM.

[10]  Ralf Reintjes,et al.  Google dengue trends: An indicator of epidemic behavior. The Venezuelan Case , 2017, Int. J. Medical Informatics.

[11]  Avinash R. Patwardhan,et al.  Comparison: Flu Prescription Sales Data from a Retail Pharmacy in the US with Google Flu Trends and US ILINet (CDC) Data as Flu Activity Indicator , 2012, PloS one.

[12]  James M. McCaw,et al.  Forecasting influenza outbreak dynamics in Melbourne from Internet search query surveillance data , 2016, Influenza and other respiratory viruses.

[13]  John S. Brownstein,et al.  Evaluation of Internet-Based Dengue Query Data: Google Dengue Trends , 2014, PLoS neglected tropical diseases.

[14]  Ingemar J. Cox,et al.  Enhancing Feature Selection Using Word Embeddings: The Case of Flu Surveillance , 2017, WWW.

[15]  Lex van Velsen,et al.  Should Health Organizations Use Web 2.0 Media in Times of an Infectious Disease Crisis? An In-depth Qualitative Study of Citizens’ Information Behavior During an EHEC Outbreak , 2012, Journal of medical Internet research.

[16]  A. Hulth,et al.  Syndromic surveillance for local outbreak detection and awareness: evaluating outbreak signals of acute gastroenteritis in telephone triage, web-based queries and over-the-counter pharmacy sales , 2013, Epidemiology and Infection.

[17]  Wagner Meira,et al.  Dengue prediction by the web: Tweets are a useful tool for estimating and forecasting Dengue at country and city level , 2017, PLoS neglected tropical diseases.

[18]  M. Martinez-Bakker,et al.  Digital epidemiology reveals global childhood disease seasonality and the effects of immunization , 2016, Proceedings of the National Academy of Sciences.

[19]  T. Kass-Hout,et al.  Social media in public health. , 2013, British medical bulletin.

[20]  Miguel-Angel Sicilia,et al.  Syndromic surveillance models using Web data: The case of scarlet fever in the UK , 2012, Informatics for health & social care.

[21]  Ozgur M. Araz,et al.  Using Google Flu Trends data in forecasting influenza-like-illness related ED visits in Omaha, Nebraska. , 2014, The American journal of emergency medicine.

[22]  O. Horstick,et al.  Dengue disease surveillance: an updated systematic literature review , 2014, Tropical medicine & international health : TM & IH.

[23]  Mark Dredze,et al.  Combining Search, Social Media, and Traditional Data Sources to Improve Influenza Surveillance , 2015, PLoS Comput. Biol..

[24]  Hong-Jun Yoon,et al.  A novel web informatics approach for automated surveillance of cancer mortality trends , 2016, J. Biomed. Informatics.

[25]  W. John Boscardin,et al.  Validating the Use of Google Trends to Enhance Pertussis Surveillance in California , 2015, PLoS currents.

[26]  Ingemar J. Cox,et al.  On Infectious Intestinal Disease Surveillance using Social Media Content , 2016, Digital Health.

[27]  Michael J. Paul,et al.  Discovering Health Topics in Social Media Using Topic Models , 2014, PloS one.

[28]  Dietrich Rebholz-Schuhmann,et al.  Disease mentions in airport and hospital geolocations expose dominance of news events for disease concerns , 2018, J. Biomed. Semant..

[29]  Anthony N. Nguyen,et al.  Automatic detection of tweets reporting cases of influenza like illnesses in Australia , 2015, Health Inf. Sci. Syst..

[30]  Soon Ae Chun,et al.  Knowledge-Based Tweet Classification for Disease Sentiment Monitoring , 2016, Sentiment Analysis and Ontology Engineering.

[31]  Mauricio Santillana,et al.  Accurate estimation of influenza epidemics using Google search data via ARGO , 2015, Proceedings of the National Academy of Sciences.

[32]  Atsushi Hozawa,et al.  Cancer patients on Twitter: a novel patient community on social media , 2012, BMC Research Notes.

[33]  Ophir Frieder,et al.  Health-related hypothesis generation using social media data , 2015, Social Network Analysis and Mining.

[34]  R. Buchbinder,et al.  Investigating Subjective Experience and the Influence of Weather Among Individuals With Fibromyalgia: A Content Analysis of Twitter , 2017, JMIR public health and surveillance.

[35]  Jesse O'Shea,et al.  Digital disease detection: A systematic review of event-based internet biosurveillance systems , 2017, International Journal of Medical Informatics.

[36]  Chang-Gun Lee,et al.  Estimating Influenza Outbreaks Using Both Search Engine Query Data and Social Media Data in South Korea , 2016, Journal of medical Internet research.

[37]  Robert L Cook,et al.  Evaluating Google, Twitter, and Wikipedia as Tools for Influenza Surveillance Using Bayesian Change Point Analysis: A Comparative Analysis , 2016, JMIR public health and surveillance.

[38]  J. Brownstein,et al.  Evaluation of Epidemic Intelligence Systems Integrated in the Early Alerting and Reporting Project for the Detection of A/H5N1 Influenza Events , 2013, PloS one.

[39]  Amod Amritphale,et al.  Analysis of YouTube as a Source of Information for West Nile Virus Infection , 2014, Clinical Medicine & Research.

[40]  Alina Deshpande,et al.  Global Disease Monitoring and Forecasting with Wikipedia , 2014, PLoS Comput. Biol..

[41]  R. Phalkey,et al.  Challenges to the surveillance of non-communicable diseases – a review of selected approaches , 2015, BMC Public Health.

[42]  Y. Gel,et al.  Influenza Forecasting with Google Flu Trends , 2013, PloS one.

[43]  Daihai He,et al.  News trends and web search query of HIV/AIDS in Hong Kong , 2017, PloS one.

[44]  Kamal Kishore,et al.  Google Search Trends Predicting Disease Outbreaks: An Analysis from India , 2018, Healthcare informatics research.

[45]  Ophir Frieder,et al.  Coughing, sneezing, and aching online: Twitter and the volume of influenza-like illness in a pediatric hospital , 2017, PloS one.

[46]  Gerardo Chowell,et al.  Characterizing Ebola Transmission Patterns Based on Internet News Reports. , 2016, Clinical infectious diseases : an official publication of the Infectious Diseases Society of America.

[47]  Alok N. Choudhary,et al.  Mining social media streams to improve public health allergy surveillance , 2015, 2015 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM).

[48]  Cécile Viboud,et al.  Reassessing Google Flu Trends Data for Detection of Seasonal and Pandemic Influenza: A Comparative Epidemiological Study at Three Geographic Scales , 2013, PLoS Comput. Biol..

[49]  James A Gillespie,et al.  Internet Search Patterns of Human Immunodeficiency Virus and the Digital Divide in the Russian Federation: Infoveillance Study , 2013, Journal of medical Internet research.

[50]  K. Gold,et al.  Internet message boards for pregnancy loss: who's on-line and why? , 2012, Women's health issues : official publication of the Jacobs Institute of Women's Health.

[51]  Marwan Bikdash,et al.  From social media to public health surveillance: Word embedding based clustering method for twitter classification , 2017, SoutheastCon 2017.

[52]  David M. Pennock,et al.  Using internet searches for influenza surveillance. , 2008, Clinical infectious diseases : an official publication of the Infectious Diseases Society of America.

[53]  Ashlynn R. Daughton,et al.  Measuring Global Disease with Wikipedia: Success, Failure, and a Research Agenda , 2017, CSCW.

[54]  Miguel-Angel Sicilia,et al.  Syndromic Surveillance Models Using Web Data: The Case of Influenza in Greece and Italy Using Google Trends , 2017, JMIR public health and surveillance.

[55]  Chengliang Chai,et al.  Identifying Potential Norovirus Epidemics in China via Internet Surveillance , 2017, Journal of medical Internet research.

[56]  Michael J. Paul,et al.  National and Local Influenza Surveillance through Twitter: An Analysis of the 2012-2013 Influenza Epidemic , 2013, PloS one.

[57]  Mauricio Santillana,et al.  Utilizing Nontraditional Data Sources for Near Real-Time Estimation of Transmission Dynamics During the 2015-2016 Colombian Zika Virus Disease Outbreak , 2016, JMIR public health and surveillance.

[58]  Diana Borsa,et al.  Automatic Identification of Web-Based Risk Markers for Health Events , 2015, Journal of medical Internet research.

[59]  A. Gumel,et al.  Emergency department and ‘Google flu trends’ data as syndromic surveillance indicators for seasonal influenza , 2014, Epidemiology and Infection.

[60]  Max L. Wilson,et al.  Finding information about mental health in microblogging platforms: a case study of depression , 2014, IIiX.

[61]  Marcel Salathé,et al.  On the ground validation of online diagnosis with Twitter and medical records , 2014, WWW.

[62]  Anthony A. Donato,et al.  YouTube as a Source of Information on Ebola Virus Disease , 2015, North American journal of medical sciences.

[63]  Sérgio Matos,et al.  Analysing Twitter and web queries for flu trend prediction , 2014, Theoretical Biology and Medical Modelling.

[64]  Brian H. Spitzberg,et al.  The Reliability of Tweets as a Supplementary Method of Seasonal Influenza Surveillance , 2014, Journal of medical Internet research.

[65]  Sung-Hoon Kim,et al.  Correlation between National Influenza Surveillance Data and Search Queries from Mobile Devices and Desktops in South Korea , 2016, PloS one.

[66]  Ravikiran Vatrapu,et al.  A supervised machine learning study of online discussion forums about type-2 diabetes , 2017, 2017 IEEE 19th International Conference on e-Health Networking, Applications and Services (Healthcom).

[67]  Olga Baysal,et al.  Mining Twitter Data for Influenza Detection and Surveillance , 2016, 2016 IEEE/ACM International Workshop on Software Engineering in Healthcare Systems (SEHS).

[68]  Han Zhao,et al.  Monitoring Epidemic Alert Levels by Analyzing Internet Search Volume , 2013, IEEE Transactions on Biomedical Engineering.

[69]  J S Brownstein,et al.  An overview of internet biosurveillance. , 2013, Clinical microbiology and infection : the official publication of the European Society of Clinical Microbiology and Infectious Diseases.

[70]  Naren Ramakrishnan,et al.  Syndromic surveillance of Flu on Twitter using weakly supervised temporal topic models , 2016, Data Mining and Knowledge Discovery.

[71]  Muhammad Imran,et al.  Classifying Information from Microblogs during Epidemics , 2017, DH.

[72]  S. Lattanzi,et al.  Why do people search Wikipedia for information on multiple sclerosis? , 2018, Multiple sclerosis and related disorders.

[73]  D. Lazer,et al.  The Parable of Google Flu: Traps in Big Data Analysis , 2014, Science.

[74]  Carol A Gotway Crawford,et al.  A New Source of Data for Public Health Surveillance: Facebook Likes , 2015, Journal of medical Internet research.

[75]  Alexander Domnich,et al.  Age-Related Differences in the Accuracy of Web Query-Based Predictions of Influenza-Like Illness , 2015, PloS one.

[76]  Shaoyong Chen,et al.  Comparision of microblogging service between Sina Weibo and Twitter , 2011, Proceedings of 2011 International Conference on Computer Science and Network Technology.

[77]  Alok N. Choudhary,et al.  Forecasting Influenza Levels Using Real-Time Social Media Streams , 2017, 2017 IEEE International Conference on Healthcare Informatics (ICHI).

[78]  Andrew Crooks,et al.  The Measles Vaccination Narrative in Twitter: A Quantitative Analysis , 2016, JMIR public health and surveillance.

[79]  B. Lewis,et al.  Methods of using real-time social media technologies for detection and remote monitoring of HIV outcomes. , 2014, Preventive medicine.

[80]  David A Asch,et al.  Decoding twitter: Surveillance and trends for cardiac arrest and resuscitation communication. , 2013, Resuscitation.

[81]  John S. Brownstein,et al.  Wikipedia Usage Estimates Prevalence of Influenza-Like Illness in the United States in Near Real-Time , 2014, PLoS Comput. Biol..

[82]  Dotan A. Haim,et al.  Using Networks to Combine “Big Data” and Traditional Surveillance to Improve Influenza Predictions , 2015, Scientific Reports.

[83]  Wangjian Zhang,et al.  Predicting the hand, foot, and mouth disease incidence using search engine query data and climate variables: an ecological study in Guangdong, China , 2017, BMJ Open.

[84]  D A Asch,et al.  The content of social media's shared images about Ebola: a retrospective study. , 2015, Public health.

[85]  Joon Lee,et al.  Disease Monitoring and Health Campaign Evaluation Using Google Search Activities for HIV and AIDS, Stroke, Colorectal Cancer, and Marijuana Use in Canada: A Retrospective Observational Study , 2016, JMIR public health and surveillance.

[86]  Mizuki Morita,et al.  Twitter Catches The Flu: Detecting Influenza Epidemics using Twitter , 2011, EMNLP.

[87]  Gil-Young Song,et al.  Predicting National Suicide Numbers with Social Media Data , 2013, PloS one.

[88]  Alberto Maria Segre,et al.  Eliciting Disease Data from Wikipedia Articles , 2015, Proceedings of the ... International AAAI Conference on Weblogs and Social Media. International AAAI Conference on Weblogs and Social Media.

[89]  James M. Hyman,et al.  Forecasting the 2013–2014 Influenza Season Using Wikipedia , 2014, PLoS Comput. Biol..

[90]  Ming Zhang,et al.  Automatic classification of diseases from free-text death certificates for real-time surveillance , 2015, BMC Medical Informatics and Decision Making.

[91]  Benjamin A Lopman,et al.  Use of Internet Search Data to Monitor Rotavirus Vaccine Impact in the United States, United Kingdom, and Mexico , 2018, Journal of the Pediatric Infectious Diseases Society.

[92]  Wenli Zhang,et al.  Extracting Signals from Social Media for Chronic Disease Surveillance , 2016, Digital Health.

[93]  John S. Brownstein,et al.  Using electronic health records and Internet search information for accurate influenza forecasting , 2017, BMC Infectious Diseases.

[94]  Jang Seok Oh,et al.  Use of Hangeul Twitter to Track and Predict Human Influenza Infection , 2013, PloS one.

[95]  Thomas Wetter,et al.  Screening Internet forum participants for depression symptoms by assembling and enhancing multiple NLP methods , 2015, Comput. Methods Programs Biomed..

[96]  Craig Dalton,et al.  Syndromic surveillance: is it a useful tool for local outbreak detection? , 2006, Journal of Epidemiology and Community Health.

[97]  E. Nsoesie,et al.  Monitoring Influenza Epidemics in China with Search Query from Baidu , 2013, PloS one.

[98]  Sihem Amer-Yahia,et al.  Health Monitoring on Social Media over Time , 2016, IEEE Transactions on Knowledge and Data Engineering.

[99]  Declan Butler,et al.  When Google got flu wrong , 2013, Nature.

[100]  Gisele L. Pappa,et al.  Enhancement of Epidemiological Models for Dengue Fever Based on Twi er Data , 2018 .

[101]  Wenjun Ma,et al.  Dengue Baidu Search Index data can improve the prediction of local dengue epidemic: A case study in Guangzhou, China , 2017, PLoS neglected tropical diseases.

[102]  ChenHsinchun,et al.  Text mining self-disclosing health information for public health service , 2014 .

[103]  Savino Sciascia,et al.  What can Google and Wikipedia can tell us about a disease? Big Data trends analysis in Systemic Lupus Erythematosus , 2017, Int. J. Medical Informatics.

[104]  Raja Jurdak,et al.  Predicting Prevalence of Influenza-Like Illness From Geo-Tagged Tweets , 2017, WWW.

[105]  Elad Yom-Tov Ebola data from the Internet: An Opportunity for Syndromic Surveillance or a News Event? , 2015, Digital Health.

[106]  J. Brownstein,et al.  A Case Study of the New York City 2012-2013 Influenza Season With Daily Geocoded Twitter Data From Temporal and Spatiotemporal Perspectives , 2014, Journal of medical Internet research.

[107]  Aron Culotta,et al.  Estimating county health statistics with twitter , 2014, CHI.

[108]  John Robert Bautista,et al.  Using Google Trends to Examine the Spatio-Temporal Incidence and Behavioral Patterns of Dengue Disease: A Case Study in Metropolitan Manila, Philippines , 2018, bioRxiv.

[109]  J. Brownstein,et al.  Early detection of disease outbreaks using the Internet , 2009, Canadian Medical Association Journal.

[110]  S. T. Rosenbloom,et al.  A Scalable Framework to Detect Personal Health Mentions on Twitter , 2015, Journal of Medical Internet Research.

[111]  Paola Velardi,et al.  Can Twitter Be a Source of Information on Allergy? Correlation of Pollen Counts with Tweets Reporting Symptoms of Allergic Rhinoconjunctivitis and Names of Antihistamine Drugs , 2015, PloS one.

[112]  Weisi Guo,et al.  Google Trends can improve surveillance of Type 2 diabetes , 2017, Scientific Reports.

[113]  Gerjo Kok,et al.  Disease Detection or Public Opinion Reflection? Content Analysis of Tweets, Other Social Media, and Online Newspapers During the Measles Outbreak in the Netherlands in 2013 , 2015, Journal of medical Internet research.

[114]  Ophir Frieder,et al.  A framework for detecting public health trends with Twitter , 2013, 2013 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM 2013).

[115]  Ye Wen,et al.  Monitoring seasonal influenza epidemics by using internet search data with an ensemble penalized regression model , 2017, Scientific Reports.

[116]  Naren Ramakrishnan,et al.  Flu Gone Viral: Syndromic Surveillance of Flu on Twitter Using Temporal Topic Models , 2014, 2014 IEEE International Conference on Data Mining.

[117]  W. John Boscardin,et al.  Evaluating Google Flu Trends in Latin America: Important Lessons for the Next Phase of Digital Disease Detection , 2017, Clinical infectious diseases : an official publication of the Infectious Diseases Society of America.

[118]  Gabriela Ochoa,et al.  Infoveillance of infectious diseases in USA: STDs, tuberculosis, and hepatitis , 2018, J. Big Data.

[119]  Mark Dredze,et al.  Separating Fact from Fear: Tracking Flu Infections on Twitter , 2013, NAACL.

[120]  Jae Ho Lee,et al.  Correlation between National Influenza Surveillance Data and Google Trends in South Korea , 2013, PloS one.

[121]  Kui Liu,et al.  Does Eating Chicken Feet With Pickled Peppers Cause Avian Influenza? Observational Case Study on Chinese Social Media During the Avian Influenza A (H7N9) Outbreak , 2018, JMIR public health and surveillance.

[122]  Umar Saif,et al.  FluBreaks: early epidemic detection from Google flu trends. , 2012, Journal of medical Internet research.

[123]  Glen A. Coppersmith,et al.  Understanding Depressive Symptoms and Psychosocial Stressors on Twitter: A Corpus-Based Study , 2017, Journal of medical Internet research.

[124]  J. Eastwood,et al.  What Are Young Adults Saying About Mental Health? An Analysis of Internet Blogs , 2012, Journal of medical Internet research.

[125]  Travis C Porco,et al.  Surveillance Tools Emerging From Search Engines and Social Media Data for Determining Eye Disease Patterns. , 2016, JAMA ophthalmology.

[126]  Natalie Dixon,et al.  You are what you tweet , 2013, INTR.

[127]  Sunmoo Yoon,et al.  What can we learn about the Ebola outbreak from tweets? , 2015, American journal of infection control.

[128]  Zhiwei Xu,et al.  Monitoring Pertussis Infections Using Internet Search Queries , 2017, Scientific Reports.

[129]  N. Bragazzi,et al.  Monitoring public interest toward pertussis outbreaks: an extensive Google Trends-based analysis. , 2018, Public health.

[130]  Víctor M. Prieto,et al.  Twitter: A Good Place to Detect Health Conditions , 2014, PloS one.

[131]  Mehdi Jalalpour,et al.  Google Flu Trends Spatial Variability Validated Against Emergency Department Influenza-Related Visits , 2016, Journal of medical Internet research.

[132]  Yongqing Nan,et al.  A machine learning method to monitor China’s AIDS epidemics with data from Baidu trends , 2018, PloS one.

[133]  Elad Yom-Tov,et al.  Detecting Disease Outbreaks in Mass Gatherings Using Internet Data Monitoring , 2015 .

[134]  Aron Culotta,et al.  Towards detecting influenza epidemics by analyzing Twitter messages , 2010, SOMA '10.

[135]  Evan Dennison Livelo,et al.  Intelligent Dengue Infoveillance Using Gated Recurrent Neural Learning and Cross-Label Frequencies , 2018, 2018 IEEE International Conference on Agents (ICA).

[136]  Toomas Timpka,et al.  Importance of Internet Surveillance in Public Health Emergency Control and Prevention: Evidence From a Digital Epidemiologic Study During Avian Influenza A H7N9 Outbreaks , 2014, Journal of medical Internet research.

[137]  Shilu Tong,et al.  Using internet search queries for infectious disease surveillance: screening diseases for suitability , 2014, BMC Infectious Diseases.

[138]  John S. Brownstein,et al.  Forecasting Zika Incidence in the 2016 Latin America Outbreak Combining Traditional Disease Surveillance with Search, Social Media, and News Report Data , 2017, PLoS neglected tropical diseases.

[139]  Nicola Luigi Bragazzi,et al.  Infodemiology and Infoveillance of Multiple Sclerosis in Italy , 2013, Multiple sclerosis international.

[140]  Brian H. Spitzberg,et al.  The Complex Relationship of Realspace Events and Messages in Cyberspace: Case Study of Influenza and Pertussis Using Tweets , 2013, Journal of medical Internet research.

[141]  Kwok-Leung Tsui,et al.  Forecasting influenza in Hong Kong with Google search queries and statistical model fusion , 2017, PloS one.

[142]  A. Hulth,et al.  Web Queries as a Source for Syndromic Surveillance , 2009, PloS one.

[143]  Gunther Eysenbach,et al.  Infodemiology and infoveillance tracking online health information and cyberbehavior for public health. , 2011, American journal of preventive medicine.

[144]  Jiann-Liang Chen,et al.  Flu trend prediction based on massive data analysis , 2018, 2018 IEEE 3rd International Conference on Cloud Computing and Big Data Analysis (ICCCBDA).

[145]  Yi Hao,et al.  Chinese Social Media Reaction to Information about 42 Notifiable Infectious Diseases , 2015, PloS one.

[146]  M. Noll-Hussong,et al.  Whiplash Syndrome Reloaded: Digital Echoes of Whiplash Syndrome in the European Internet Search Engine Context , 2017, JMIR public health and surveillance.

[147]  Anette Hulth,et al.  Detecting the Norovirus Season in Sweden Using Search Engine Data – Meeting the Needs of Hospital Infection Control Teams , 2014, PloS one.

[148]  S. Rutherford,et al.  Using Google Trends for Influenza Surveillance in South China , 2013, PloS one.

[149]  Rick Siow Mong Goh,et al.  Leveraging social networking sites for disease surveillance and public sensing: the case of the 2013 avian influenza A(H7N9) outbreak in China. , 2015, Western Pacific surveillance and response journal : WPSAR.

[150]  Marco Roccetti,et al.  Inside chronic autoimmune disease communities: A social networks perspective to Crohn's patient behavior and medical information , 2015, 2015 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM).

[151]  H. Sueki,et al.  The association of suicide-related Twitter use with suicidal behaviour: a cross-sectional study of young internet users in Japan. , 2015, Journal of affective disorders.

[152]  Jeremy Ginsberg,et al.  Detecting influenza epidemics using search engine query data , 2009, Nature.

[153]  Ireneus Kagashe,et al.  Enhancing Seasonal Influenza Surveillance: Topic Analysis of Widely Used Medicinal Drugs Using Twitter Data , 2017, Journal of medical Internet research.

[154]  Gregory J. Park,et al.  Psychological Language on Twitter Predicts County-Level Heart Disease Mortality , 2015, Psychological science.

[155]  S. Willard,et al.  Internet search trends analysis tools can provide real-time data on kidney stone disease in the United States. , 2013, Urology.

[156]  Zion Tsz Ho Tse,et al.  Twitter and Middle East respiratory syndrome, South Korea, 2015: A multi-lingual study , 2018, Infection, Disease & Health.

[157]  Benyuan Liu,et al.  Predicting Flu Trends using Twitter data , 2011, 2011 IEEE Conference on Computer Communications Workshops (INFOCOM WKSHPS).

[158]  Daniela Perrotta,et al.  Forecasting Seasonal Influenza Fusing Digital Indicators and a Mechanistic Disease Model , 2017, WWW.

[159]  R. Guha,et al.  What are we ‘tweeting’ about obesity? Mapping tweets with topic modeling and Geographic Information System , 2013, Cartography and geographic information science.

[160]  Anand Krishnan,et al.  Finding the Patient’s Voice Using Big Data: Analysis of Users’ Health-Related Concerns in the ChaCha Question-and-Answer Service (2009–2012) , 2016, Journal of medical Internet research.

[161]  A. Flahault,et al.  More Diseases Tracked by Using Google Trends , 2009, Emerging infectious diseases.

[162]  Michael J. Paul,et al.  Using Social Media to Perform Local Influenza Surveillance in an Inner-City Hospital: A Retrospective Observational Study , 2015, JMIR public health and surveillance.

[163]  L Charles Bailey,et al.  Relationship Between State-Level Google Online Search Volume and Cancer Incidence in the United States: Retrospective Study , 2018, Journal of medical Internet research.

[164]  Ming-Hsiang Tsou,et al.  Applying GIS and Machine Learning Methods to Twitter Data for Multiscale Surveillance of Influenza , 2016, PloS one.

[165]  L. Madoff ProMED-mail: an early warning system for emerging diseases. , 2004, Clinical infectious diseases : an official publication of the Infectious Diseases Society of America.

[166]  Tao Liu,et al.  Early detection of an epidemic erythromelalgia outbreak using Baidu search data , 2015, Scientific Reports.

[167]  Pengzhu Zhang,et al.  Health-Related Hot Topic Detection in Online Communities Using Text Clustering , 2013, PloS one.

[168]  Wenli Zhang,et al.  Predicting Asthma-Related Emergency Department Visits Using Big Data , 2015, IEEE Journal of Biomedical and Health Informatics.

[169]  Paola Velardi,et al.  Influenza-Like Illness Surveillance on Twitter through Automated Learning of Naïve Language , 2013, PloS one.

[170]  Naren Ramakrishnan,et al.  Temporal Topic Modeling to Assess Associations between News Trends and Infectious Disease Outbreaks , 2016, Scientific Reports.

[171]  Eugene Agichtein,et al.  Did You Really Just Have a Heart Attack?: Towards Robust Detection of Personal Health Mentions in Social Media , 2018, WWW.

[172]  Soo-Yong Shin,et al.  Cumulative Query Method for Influenza Surveillance Using Search Engine Data , 2014, Journal of medical Internet research.

[173]  Kaoru Sezaki,et al.  A robust and scalable framework for detecting self-reported illness from twitter , 2012, 2012 IEEE 14th International Conference on e-Health Networking, Applications and Services (Healthcom).

[174]  Hsinchun Chen,et al.  Text mining self‐disclosing health information for public health service , 2014, J. Assoc. Inf. Sci. Technol..

[175]  N. Deutsch,et al.  How To Do It. , 2022, California state journal of medicine.

[176]  Nicola Luigi Bragazzi,et al.  Harnessing Big Data for Communicable Tropical and Sub-Tropical Disorders: Implications From a Systematic Review of the Literature , 2018, Front. Public Health.

[177]  Sean D. Young,et al.  Using Search Engine Data as a Tool to Predict Syphilis , 2018, Epidemiology.

[178]  Yukiko Kawai,et al.  Twitter-Based Influenza Detection After Flu Peak via Tweets With Indirect Information: Text Mining Study , 2018, JMIR public health and surveillance.

[179]  Y. Yasui,et al.  Google Flu Trends in Canada: a comparison of digital disease surveillance data with physician consultations and respiratory virus surveillance data, 2010–2014 , 2015, Epidemiology and Infection.

[180]  Richard O. Sinnott,et al.  Identification and Validation of Real-Time Health Events through Social Media , 2015, 2015 IEEE International Conference on Data Science and Data Intensive Systems.

[181]  F. Bazzoli,et al.  Attitudes of Crohn’s Disease Patients: Infodemiology Case Study and Sentiment Analysis of Facebook and Twitter Posts , 2017, JMIR public health and surveillance.

[182]  Hannah G Dahlen,et al.  A search for hope and understanding: an analysis of threatened miscarriage internet forums. , 2014, Midwifery.

[183]  Nadir Weibel,et al.  Analyzing social media to characterize local HIV at-risk populations , 2015, Wireless Health.

[184]  Naren Ramakrishnan,et al.  Characterizing Diseases from Unstructured Text: A Vocabulary Driven Word2vec Approach , 2016, CIKM.

[185]  Sasikiran Kandula,et al.  Subregional Nowcasts of Seasonal Influenza Using Search Trends , 2017, Journal of medical Internet research.

[186]  J. Brownstein,et al.  Social and news media enable estimation of epidemiological patterns early in the 2010 Haitian cholera outbreak. , 2012, The American journal of tropical medicine and hygiene.

[187]  Daniela Amicizia,et al.  Assessing Ebola-related web search behaviour: insights and implications from an analytical study of Google Trends-based query volumes , 2015 .

[188]  Ingemar J. Cox,et al.  Multi-Task Learning Improves Disease Models from Web Search , 2018, WWW.

[189]  Paola Velardi,et al.  Twitter mining for fine-grained syndromic surveillance , 2014, Artif. Intell. Medicine.

[190]  A. Dugas,et al.  Google Flu Trends: correlation with emergency department influenza rates and crowding metrics. , 2011, Clinical infectious diseases : an official publication of the Infectious Diseases Society of America.