Analysis of Online Information Searching for Cardiovascular Diseases on a Consumer Health Information Portal

Since the early 2000's, Internet usage for health information searching has increased significantly. Studying search queries can help us to understand users "information need" and how do they formulate search queries ("expression of information need"). Although cardiovascular diseases (CVD) affect a large percentage of the population, few studies have investigated how and what users search for CVD. We address this knowledge gap in the community by analyzing a large corpus of 10 million CVD related search queries from MayoClinic.com. Using UMLS MetaMap and UMLS semantic types/concepts, we developed a rule-based approach to categorize the queries into 14 health categories. We analyzed structural properties, types (keyword-based/Wh-questions/Yes-No questions) and linguistic structure of the queries. Our results show that the most searched health categories are 'Diseases/Conditions', 'Vital-Sings', 'Symptoms' and 'Living-with'. CVD queries are longer and are predominantly keyword-based. This study extends our knowledge about online health information searching and provides useful insights for Web search engines and health websites.

[1]  Yin Yang,et al.  A study of medical and health queries to web search engines. , 2004, Health information and libraries journal.

[2]  Alan R. Aronson,et al.  Effective mapping of biomedical text to the UMLS Metathesaurus: the MetaMap program , 2001, AMIA.

[3]  Margaret M. Barry,et al.  A literature review on health information-seeking behaviour on the web: a health consumer and health , 2011 .

[4]  S. Cotten,et al.  THE ASSOCIATION AMONG GENDER, COMPUTER USE AND ONLINE HEALTH SEARCHING, AND MENTAL HEALTH , 2008 .

[5]  Nigel H. Lovell,et al.  Using information technology to improve the management of chronic disease , 2003, The Medical journal of Australia.

[6]  Amanda Spink,et al.  Searching the Web: the public and their queries , 2001 .

[7]  J. Bernhardt,et al.  Health information-seeking behaviors, health indicators, and health risks. , 2010, American journal of public health.

[8]  Elmer V. Bernstam,et al.  A day in the life of PubMed: analysis of a typical day's query log. , 2007, Journal of the American Medical Informatics Association : JAMIA.

[9]  Ryen W. White,et al.  Cyberchondria: Studies of the escalation of medical concerns in Web search , 2009, TOIS.

[10]  Sandra L Saperstein,et al.  Using the Internet for Health-Related Activities: Findings From a National Probability Sample , 2009, Journal of medical Internet research.

[11]  Jeremy Ginsberg,et al.  Detecting influenza epidemics using search engine query data , 2009, Nature.

[12]  S. Fortmann,et al.  Socioeconomic status and health: how education, income, and occupation contribute to risk factors for cardiovascular disease. , 1992, American journal of public health.

[13]  Ryen W. White,et al.  From health search to healthcare: explorations of intention and utilization via query logs and user surveys , 2014, J. Am. Medical Informatics Assoc..

[14]  J. Kronenfeld,et al.  Chronic illness and health-seeking information on the Internet , 2007, Health.

[15]  Stephen Wu,et al.  Comparative Analysis of Online Health Queries Originating From Personal Computers and Smart Devices on a Consumer Health Information Portal , 2014, Journal of medical Internet research.

[16]  Dan Klein,et al.  Feature-Rich Part-of-Speech Tagging with a Cyclic Dependency Network , 2003, NAACL.

[17]  W. Bruce Croft,et al.  Search Engines - Information Retrieval in Practice , 2009 .

[18]  Marc-Allen Cartright,et al.  Intentions and attention in exploratory health search , 2011, SIGIR.

[19]  K. Lorig,et al.  Internet-Based Chronic Disease Self-Management: A Randomized Trial , 2006, Medical care.

[20]  Michael A. Zarro,et al.  A study of user queries leading to a health information website: AfterTheInjury.org , 2011, iConference.