Captions and biases in diagnostic search

People frequently turn to the Web with the goal of diagnosing medical symptoms. Studies have shown that diagnostic search can often lead to anxiety about the possibility that symptoms are explained by the presence of rare, serious medical disorders, rather than far more common benign syndromes. We study the influence of the appearance of potentially-alarming content, such as severe illnesses or serious treatment options associated with the queried for symptoms, in captions comprising titles, snippets, and URLs. We explore whether users are drawn to results with potentially-alarming caption content, and if so, the implications of such attraction for the design of search engines. We specifically study the influence of the content of search result captions shown in response to symptom searches on search-result click-through behavior. We show that users are significantly more likely to examine and click on captions containing potentially-alarming medical terminology such as “heart attack” or “medical emergency” independent of result rank position and well-known positional biases in users' search examination behaviors. The findings provide insights about the possible effects of displaying implicit correlates of searchers' goals in search-result captions, such as unexpressed concerns and fears. As an illustration of the potential utility of these results, we developed and evaluated an enhanced click prediction model that incorporates potentially-alarming caption features and show that it significantly outperforms models that ignore caption content. Beyond providing additional understanding of the effects of Web content on medical concerns, the methods and findings have implications for search engine design. As part of our discussion on the implications of this research, we propose procedures for generating more representative captions that may be less likely to cause alarm, as well as methods for learning to more appropriately rank search results from logged search behavior, for examples, by also considering the presence of potentially-alarming content in the captions that motivate observed clicks and down-weighting clicks seemingly driven by searchers' health anxieties.

[1]  Zhihua Zhang,et al.  Learning click models via probit bayesian inference , 2010, CIKM.

[2]  Filip Radlinski,et al.  Evaluating the accuracy of implicit feedback from clicks and query reformulations in Web search , 2007, TOIS.

[3]  Steven Taylor,et al.  Understanding and treating health anxiety: A cognitive-behavioral approach , 2004 .

[4]  Charles L. A. Clarke,et al.  The influence of caption features on clickthrough patterns in web search , 2007, SIGIR.

[5]  Christian Köhler,et al.  How do consumers search for and appraise health information on the world wide web? Qualitative study using focus groups, usability tests, and in-depth interviews , 2002, BMJ : British Medical Journal.

[6]  J. Kronenfeld,et al.  Chronic illness and health-seeking information on the Internet , 2007, Health.

[7]  Ryen W. White,et al.  Large-scale analysis of individual and task differences in search result page examination strategies , 2012, WSDM '12.

[8]  Abbe Mowshowitz,et al.  Bias on the web , 2002, CACM.

[9]  Susan T. Dumais,et al.  Learning user interaction models for predicting web search result preferences , 2006, SIGIR.

[10]  Junghoo Cho,et al.  Impact of search engines on page popularity , 2004, WWW '04.

[11]  Suresh K. Bhavnani,et al.  Exploring the distribution of online healthcare information , 2003, CHI Extended Abstracts.

[12]  Ryen W. White,et al.  Studies of the onset and persistence of medical concerns in search logs , 2012, SIGIR '12.

[13]  Steve Fox,et al.  Evaluating implicit measures to improve web search , 2005, TOIS.

[14]  Steven Taylor,et al.  Health Anxiety : clinical and research perspectives on hypochondriasis and related conditions , 2001 .

[15]  Susan T. Dumais,et al.  The good, the bad, and the random: an eye-tracking study of ad quality in web search , 2010, SIGIR.

[16]  Ciya Liao,et al.  A model to estimate intrinsic document relevance from the clickthrough logs of a web search engine , 2010, WSDM '10.

[17]  Benjamin Piwowarski,et al.  A user browsing model to predict search engine click data from past observations. , 2008, SIGIR '08.

[18]  Nina Mishra,et al.  Domain bias in web search , 2012, WSDM '12.

[19]  Abbe Mowshowitz,et al.  Assessing bias in search engines , 2002, Inf. Process. Manag..

[20]  Peter S. Hussey,et al.  Comparing Health System Performance In OECD Countries , 2001 .

[21]  Sergey Brin,et al.  The Anatomy of a Large-Scale Hypertextual Web Search Engine , 1998, Comput. Networks.

[22]  Edward Cutrell,et al.  What are you looking for?: an eye-tracking study of information usage in web search , 2007, CHI.

[23]  Yin Yang,et al.  A study of medical and health queries to web search engines. , 2004, Health information and libraries journal.

[24]  Rukmini Iyer,et al.  Data-driven text features for sponsored search click prediction , 2009, KDD Workshop on Data Mining and Audience Intelligence for Advertising.

[25]  Xiaolong Li,et al.  Inferring search behaviors using partially observable Markov (POM) model , 2010, WSDM '10.

[26]  Ryen W. White,et al.  Finding relevant documents using top ranking sentences: an evaluation of two alternative schemes , 2002, SIGIR '02.

[27]  Marc-Allen Cartright,et al.  Intentions and attention in exploratory health search , 2011, SIGIR.

[28]  Susan T. Dumais,et al.  Evaluating implicit measures to improve the search experiences , 2003 .

[29]  Ryen W. White,et al.  Cyberchondria: Studies of the escalation of medical concerns in Web search , 2009, TOIS.

[30]  Ryen W. White,et al.  Characterizing the influence of domain expertise on web search behavior , 2009, WSDM '09.

[31]  Ryen W. White,et al.  Predicting escalations of medical queries based on web page structure and content , 2010, SIGIR '10.

[32]  Steven Taylor,et al.  Treating Health Anxiety: A Cognitive-Behavioral Approach , 2004 .

[33]  Susan L. Gerhart,et al.  Do Web search engines suppress controversy? , 2004, First Monday.

[34]  Ryen W. White,et al.  Leaving so soon?: understanding and predicting web search abandonment rationales , 2012, CIKM.

[35]  Chao Liu,et al.  Click chain model in web search , 2009, WWW '09.

[36]  Ryen W. White,et al.  No clicks, no problem: using cursor movements to understand and improve search , 2011, CHI.

[37]  Mike Thelwall,et al.  Search engine coverage bias: evidence and possible causes , 2004, Inf. Process. Manag..

[38]  Gary Hsieh,et al.  The presentation of health-related search results and its impact on negative emotional outcomes , 2013, CHI.

[39]  Yisong Yue,et al.  Beyond position bias: examining result attractiveness as a source of presentation bias in clickthrough data , 2010, WWW '10.

[40]  Filip Radlinski,et al.  Minimally Invasive Randomization for Collecting Unbiased Preferences from Clickthrough Logs , 2006, AAAI 2006.

[41]  Thorsten Joachims,et al.  Optimizing search engines using clickthrough data , 2002, KDD.

[42]  Ryoji Kataoka,et al.  Query-page intention matching using clicked titles and snippets to boost search rankings , 2009, JCDL '09.

[43]  Kerry Rodden,et al.  Eye-mouse coordination patterns on web search results pages , 2008, CHI Extended Abstracts.

[44]  Chao Liu,et al.  Efficient multiple-click models in web search , 2009, WSDM '09.

[45]  Ryen W. White,et al.  Why searchers switch: understanding and predicting engine switching rationales , 2011, SIGIR.

[46]  Shahm Martini,et al.  Treating Health Anxiety: A Cognitive-Behavioral Approach , 2005 .

[47]  Nick Craswell,et al.  An experimental comparison of click position-bias models , 2008, WSDM '08.

[48]  Peter R. Harris,et al.  How do patients evaluate and make use of online health information? , 2007, Social science & medicine.

[49]  Meredith Ringel Morris,et al.  Augmenting web pages and search results to support credibility assessment , 2011, CHI.

[50]  Meredith Ringel Morris,et al.  Augmenting Web Pages and Search Results to Help People Find Trustworthy Information Online , 2011 .

[51]  Ryen W. White,et al.  Text selections as implicit relevance feedback , 2012, SIGIR '12.

[52]  A. Tversky,et al.  Judgment under Uncertainty: Heuristics and Biases , 1974, Science.

[53]  Mark Sanderson,et al.  Advantages of query biased summaries in information retrieval , 1998, SIGIR '98.

[54]  Ingmar Weber,et al.  The demographics of web search , 2010, SIGIR.

[55]  A Vespignani,et al.  Topical interests and the mitigation of search engine bias , 2006, Proceedings of the National Academy of Sciences.

[56]  Eugene Agichtein,et al.  Ready to buy or just browsing?: detecting web searcher goals from interaction data , 2010, SIGIR.

[57]  Yi Chang,et al.  Modeling Perceived Relevance for Tail Queries without Click-Through Data , 2011, ArXiv.

[58]  Ryen W. White,et al.  Experiences with Web Search on Medical Concerns and Self Diagnosis , 2009, AMIA.

[59]  Eric Goldman,et al.  Search Engine Bias and the Demise of Search Engine Utopianism , 2006 .

[60]  Eric Brill,et al.  Improving web search ranking by incorporating user behavior information , 2006, SIGIR.

[61]  Olivier Chapelle,et al.  A dynamic bayesian network click model for web search ranking , 2009, WWW '09.

[62]  Doug Beeferman,et al.  Agglomerative clustering of a search engine query log , 2000, KDD '00.