Google Trends in Infodemiology and Infoveillance: Methodology Framework

Internet data are being increasingly integrated into health informatics research and are becoming a useful tool for exploring human behavior. The most popular tool for examining online behavior is Google Trends, an open tool that provides information on trends and the variations of online interest in selected keywords and topics over time. Online search traffic data from Google have been shown to be useful in analyzing human behavior toward health topics and in predicting disease occurrence and outbreaks. Despite the large number of Google Trends studies during the last decade, the literature on the subject lacks a specific methodology framework. This article aims at providing an overview of the tool and data and at presenting the first methodology framework in using Google Trends in infodemiology and infoveillance, including the main factors that need to be taken into account for a strong methodology base. We provide a step-by-step guide for the methodology that needs to be followed when using Google Trends and the essential aspects required for valid results in this line of research. At first, an overview of the tool and the data are presented, followed by an analysis of the key methodological points for ensuring the validity of the results, which include selecting the appropriate keyword(s), region(s), period, and category. Overall, this article presents and analyzes the key points that need to be considered to achieve a strong methodological basis for using Google Trends data, which is crucial for ensuring the value and validity of the results, as the analysis of online queries is extensively integrated in health research in the big data era.

[1]  M. Mckee,et al.  Tracking search engine queries for suicide in the United Kingdom, 2004-2013. , 2016, Public health.

[2]  Wolfgang Weinmann,et al.  Could Google Trends Be Used to Predict Methamphetamine-Related Crime? An Analysis of Search Volume Data in Switzerland, Germany, and Austria , 2016, PloS one.

[3]  So Hyun Park,et al.  Identification of Primary Medication Concerns Regarding Thyroid Hormone Replacement Therapy From Online Patient Medication Reviews: Text Mining of Social Network Data , 2018, Journal of medical Internet research.

[4]  Miguel A. Vadillo,et al.  Researching Mental Health Disorders in the Era of Social Media: Systematic Review , 2017, Journal of medical Internet research.

[5]  Jingcheng Du,et al.  Public Perception Analysis of Tweets During the 2015 Measles Outbreak: Comparative Study Using Convolutional Neural Network Models , 2018, Journal of medical Internet research.

[6]  Ingmar Weber,et al.  Online Health Monitoring using Facebook Advertisement Audience Estimates in the United States: Evaluation Study , 2018, JMIR public health and surveillance.

[7]  Sasikiran Kandula,et al.  Subregional Nowcasts of Seasonal Influenza Using Search Trends , 2017, Journal of medical Internet research.

[8]  Monica Vichi,et al.  A Google-based approach for monitoring suicide risk , 2016, Psychiatry Research.

[9]  Yaogang Wang,et al.  Association Between Cancer Incidence and Mortality in Web-Based Data in China: Infodemiology Study , 2019, Journal of medical Internet research.

[10]  Amaryllis Mavragani,et al.  Predicting referendum results in the Big Data Era , 2018, Journal of Big Data.

[11]  Rumi Chunara,et al.  Estimating influenza attack rates in the United States using a participatory cohort , 2015, Scientific Reports.

[12]  Seung-Pyo Jun,et al.  Ten years of research change using Google Trends: From the perspective of big data utilizations and applications , 2017 .

[13]  Mark Dredze,et al.  Vaccine Images on Twitter: Analysis of What Images are Shared , 2018, Journal of medical Internet research.

[14]  Michael Scharkow,et al.  Measuring the Public Agenda using Search Engine Queries , 2011 .

[15]  Gabriela Ochoa,et al.  Forecasting AIDS prevalence in the United States using online search traffic data , 2018, Journal of Big Data.

[16]  Gunther Eysenbach,et al.  Infodemiology: Tracking Flu-Related Searches on the Web for Syndromic Surveillance , 2006, AMIA.

[17]  Emmanuel Chazard,et al.  Real Time Influenza Monitoring Using Hospital Big Data in Combination with Machine Learning Methods: Comparison Study , 2018, JMIR public health and surveillance.

[18]  N. Bragazzi,et al.  Monitoring public interest toward pertussis outbreaks: an extensive Google Trends-based analysis. , 2018, Public health.

[19]  T. Bernardo,et al.  Scoping Review on Search Queries and Social Media for Disease Surveillance: A Chronology of Innovation , 2013, Journal of medical Internet research.

[20]  Marco Huesch,et al.  Frequencies of Private Mentions and Sharing of Mammography and Breast Cancer Terms on Facebook: A Pilot Study , 2017, Journal of medical Internet research.

[21]  Gabriela Ochoa,et al.  Infoveillance of infectious diseases in USA: STDs, tuberculosis, and hepatitis , 2018, J. Big Data.

[22]  Peter M. Broadwell,et al.  How Twitter Can Support the HIV/AIDS Response to Achieve the 2030 Eradication Goal: In-Depth Thematic Analysis of World AIDS Day Tweets , 2018, JMIR public health and surveillance.

[23]  S. Nuti,et al.  The Use of Google Trends in Health Care Research: A Systematic Review , 2014, PloS one.

[24]  Nicola Luigi Bragazzi,et al.  Google Trends Predicts Present and Future Plague Cases During the Plague Outbreak in Madagascar: Infodemiological Study , 2019, JMIR public health and surveillance.

[25]  Arunkumar Bagavathi,et al.  Dynamics of Health Agency Response and Public Engagement in Public Health Emergency: A Case Study of CDC Tweeting Patterns During the 2016 Zika Epidemic , 2018, JMIR public health and surveillance.

[26]  Y. Gel,et al.  Influenza Forecasting with Google Flu Trends , 2013, PloS one.

[27]  Duan-Rung Chen,et al.  Economic Recession and Obesity-Related Internet Search Behavior in Taiwan: Analysis of Google Trends Data , 2018, JMIR public health and surveillance.

[28]  Gunther Eysenbach,et al.  Infodemiology and infoveillance tracking online health information and cyberbehavior for public health. , 2011, American journal of preventive medicine.

[29]  M Radin,et al.  Infodemiology of systemic lupus erythematous using Google Trends , 2017, Lupus.

[30]  H. Eugene Stanley,et al.  Quantifying the Advantage of Looking Forward , 2012, Scientific Reports.

[31]  Gabriela Ochoa,et al.  Assessing the Methods, Tools, and Statistical Approaches in Google Trends Research: Systematic Review , 2018, Journal of medical Internet research.

[32]  Anita Burgun,et al.  Detection of Cases of Noncompliance to Drug Treatment in Patient Forum Posts: Topic Model Approach , 2018, Journal of medical Internet research.

[33]  Amaryllis Mavragani,et al.  YES or NO: Predicting the 2015 GReferendum results using Google Trends , 2016 .

[34]  Michael S. Deiner,et al.  Monitoring Interest in Herpes Zoster Vaccination: Analysis of Google Search Data , 2018, JMIR public health and surveillance.

[35]  J. Brownstein,et al.  Using Twitter to Examine Web-Based Patient Experience Sentiments in the United States: Longitudinal Study , 2018, Journal of medical Internet research.

[36]  Andrey Zheluk,et al.  Internet Search and Krokodil in the Russian Federation: An Infoveillance Study , 2014, Journal of medical Internet research.

[37]  Man-pui Sally Chan,et al.  Associations of Topics of Discussion on Twitter With Survey Measures of Attitudes, Knowledge, and Behaviors Related to Zika: Probabilistic Study in the United States , 2018, JMIR public health and surveillance.

[38]  Jon-Patrick Allem,et al.  Hookah-Related Posts to Twitter From 2017 to 2018: Thematic Analysis , 2018, Journal of medical Internet research.

[39]  Gabriela Ochoa,et al.  The Internet and the Anti-Vaccine Movement: Tracking the 2017 EU Measles Outbreak , 2018, Big Data Cogn. Comput..

[40]  Benjamin S Crosier,et al.  Exploring the Utility of Community-Generated Social Media Content for Detecting Depression: An Analytical Study on Instagram , 2018, Journal of medical Internet research.

[41]  Jyrki Kettunen,et al.  Diurnal Variations of Depression-Related Health Information Seeking: Case Study in Finland Using Google Trends Data , 2018, JMIR mental health.

[42]  H. Stanley,et al.  Quantifying Trading Behavior in Financial Markets Using Google Trends , 2013, Scientific Reports.

[43]  M. A. Álvarez-Mon,et al.  Increasing Interest of Mass Communication Media and the General Public in the Distribution of Tweets About Mental Disorders: Observational Study , 2018, Journal of medical Internet research.

[44]  Enny Das,et al.  Too Far to Care? Measuring Public Attention and Fear for Ebola Using Twitter , 2017, Journal of medical Internet research.

[45]  A. Ahmadvand,et al.  “Googling” for Cancer: An Infodemiological Assessment of Online Search Interests in Australia, Canada, New Zealand, the United Kingdom, and the United States , 2016, JMIR cancer.

[46]  D. Carvalho,et al.  Using Google Trends Data to Study Public Interest in Breast Cancer Screening in Brazil: Why Not a Pink February? , 2017, JMIR public health and surveillance.

[47]  F. Bazzoli,et al.  Attitudes of Crohn’s Disease Patients: Infodemiology Case Study and Sentiment Analysis of Facebook and Twitter Posts , 2017, JMIR public health and surveillance.

[48]  Daniel Dajun Zeng,et al.  Tracking Dabbing Using Search Query Surveillance: A Case Study in the United States , 2016, Journal of medical Internet research.

[49]  Yukiko Kawai,et al.  Twitter-Based Influenza Detection After Flu Peak via Tweets With Indirect Information: Text Mining Study , 2018, JMIR public health and surveillance.

[50]  Nikki Adams,et al.  Detecting Novel and Emerging Drug Terms Using Natural Language Processing: A Social Media Corpus Study , 2018, JMIR public health and surveillance.

[51]  L Charles Bailey,et al.  Relationship Between State-Level Google Online Search Volume and Cancer Incidence in the United States: Retrospective Study , 2018, Journal of medical Internet research.

[52]  N. Bragazzi,et al.  Forecasting the West Nile Virus in the United States: An Extensive Novel Data Streams–Based Time Series Analysis and Structural Equation Modeling of Related Digital Searching Behavior , 2019, JMIR public health and surveillance.

[53]  G. Eysenbach Infodemiology and Infoveillance: Framework for an Emerging Set of Public Health Informatics Methods to Analyze Search, Communication and Publication Behavior on the Internet , 2009, Journal of medical Internet research.

[54]  J. Brownstein,et al.  Digital disease detection--harnessing the Web for public health surveillance. , 2009, The New England journal of medicine.

[55]  Amaryllis Mavragani,et al.  Quantifying the UK Online Interest in Substances of the EU Watchlist for Water Monitoring: Diclofenac, Estradiol, and the Macrolide Antibiotics , 2016 .

[56]  Rok Sosic,et al.  Accurate Influenza Monitoring and Forecasting Using Novel Internet Data Streams: A Case Study in the Boston Metropolis , 2018, JMIR public health and surveillance.

[57]  Eleftherios Mylonakis,et al.  Google trends: a web-based tool for real-time surveillance of disease outbreaks. , 2009, Clinical infectious diseases : an official publication of the Infectious Diseases Society of America.

[58]  John S Brownstein,et al.  Using Twitter to Detect Psychological Characteristics of Self-Identified Persons With Autism Spectrum Disorder: A Feasibility Study , 2019, JMIR mHealth and uHealth.

[59]  M. Keller,et al.  Reproductive Health and Medication Concerns for Patients With Inflammatory Bowel Disease: Thematic and Quantitative Analysis Using Social Listening , 2018, Journal of medical Internet research.

[60]  Onicio B Leal-Neto,et al.  Digital disease detection and participatory surveillance: overview and perspectives for Brazil , 2016, Revista de saude publica.

[61]  Amaryllis Mavragani,et al.  Integrating Smart Health in the US Health Care System: Infodemiology Study of Asthma Monitoring in the Google Era , 2018, JMIR public health and surveillance.

[62]  Josette F. Jones,et al.  Novel Approach to Cluster Patient-Generated Data Into Actionable Topics: Case Study of a Web-Based Breast Cancer Forum , 2018, JMIR medical informatics.