Content Bias in Online Health Search

Search engines help people answer consequential questions. Biases in retrieved and indexed content (e.g., skew toward erroneous outcomes that represent deviations from reality), coupled with searchers' biases in how they examine and interpret search results, can lead people to incorrect answers. In this article, we seek to better understand biases in search and retrieval, and in particular those affecting the accuracy of content in search results, including the search engine index, features used for ranking, and the formulation of search queries. Focusing on the important domain of online health search, this research broadens previous work on biases in search to examine the role of search systems in contributing to biases. To assess bias, we focus on questions about medical interventions and employ reliable ground truth data from authoritative medical sources. In the course of our study, we utilize large-scale log analysis using data from a popular Web search engine, deep probes of result lists on that search engine, and crowdsourced human judgments of search result captions and landing pages. Our findings reveal bias in results, amplifying searchers' existing biases that appear evident in their search activity. We also highlight significant bias in indexed content and show that specific ranking signals and specific query terms support bias. Both of these can degrade result accuracy and increase skewness in search results. Our analysis has implications for bias mitigation strategies in online search systems, and we offer recommendations for search providers based on our findings.

[1]  Gerhard Weikum,et al.  Harmony and dissonance: organizing the people's voices on political controversies , 2012, WSDM '12.

[2]  Meredith Ringel Morris,et al.  Augmenting Web Pages and Search Results to Help People Find Trustworthy Information Online , 2011 .

[3]  Susan T. Dumais,et al.  Improving Web Search Ranking by Incorporating User Behavior Information , 2019, SIGIR Forum.

[4]  Daniele Fanelli,et al.  Negative results are disappearing from most disciplines and countries , 2011, Scientometrics.

[5]  Filip Radlinski,et al.  Large-scale validation and analysis of interleaved search evaluation , 2012, TOIS.

[6]  Ryen W. White,et al.  Web-scale pharmacovigilance: listening to signals from the crowd , 2013, J. Am. Medical Informatics Assoc..

[7]  Nina Mishra,et al.  Domain bias in web search , 2012, WSDM '12.

[8]  Cynna Selvy,et al.  Unified Medical Language System (UMLS) , 2015 .

[9]  Ke Wang,et al.  Bias and controversy: beyond the statistical deviation , 2006, KDD '06.

[10]  Abbe Mowshowitz,et al.  Assessing bias in search engines , 2002, Inf. Process. Manag..

[11]  Pierre Pluye,et al.  Shortcomings of health information on the Internet. , 2003, Health promotion international.

[12]  Joe Carthy,et al.  Exploring the effectiveness of medical entity recognition for clinical information retrieval , 2013, DTMBIO '13.

[13]  Ryen W. White,et al.  Predicting escalations of medical queries based on web page structure and content , 2010, SIGIR '10.

[14]  P. Easterbrook,et al.  Publication bias in clinical research , 1991, The Lancet.

[15]  Bernd Faust Information Retrieval A Health And Biomedical Perspective , 2016 .

[16]  Matthew J. Lindberg,et al.  Feeling validated versus being correct: a meta-analysis of selective exposure to information. , 2009, Psychological bulletin.

[17]  Suresh K. Bhavnani,et al.  Exploring the distribution of online healthcare information , 2003, CHI Extended Abstracts.

[18]  D. Sackett,et al.  Evidence based medicine: what it is and what it isn't , 1996, BMJ.

[19]  M C Weinstein,et al.  Gains in life expectancy from medical interventions--standardizing data on outcomes. , 1998, The New England journal of medicine.

[20]  A R Jadad,et al.  Methodology and reports of systematic reviews and meta-analyses: a comparison of Cochrane reviews with articles published in paper-based journals. , 1998, JAMA.

[21]  John D. Lafferty,et al.  Beyond independent relevance: methods and evaluation metrics for subtopic retrieval , 2003, SIGIR.

[22]  Lydia B. Chilton,et al.  Addressing people's information needs directly in a web search result page , 2011, WWW.

[23]  Ryen W. White,et al.  Mining the search trails of surfing crowds: identifying relevant websites from user activity , 2008, WWW.

[24]  Nick Craswell,et al.  An experimental comparison of click position-bias models , 2008, WSDM '08.

[25]  Yindalon Aphinyanagphongs,et al.  Text Categorization Models for Identifying Unproven Cancer Treatments on the Web , 2007, MedInfo.

[26]  Eli Pariser,et al.  The Filter Bubble: What the Internet Is Hiding from You , 2011 .

[27]  Peter Ingwersen,et al.  Polyrepresentation of information needs and semantic entities: elements of a cognitive theory for information retrieval interaction , 1994, SIGIR '94.

[28]  E. Horvitz,et al.  Toward Enhanced Pharmacovigilance Using Patient-Generated Data on the Internet , 2014, Clinical pharmacology and therapeutics.

[29]  Robert H. Baud,et al.  Health search engine with e-document analysis for reliable search results , 2006, Int. J. Medical Informatics.

[30]  A Vespignani,et al.  Topical interests and the mitigation of search engine bias , 2006, Proceedings of the National Academy of Sciences.

[31]  Ryen W. White,et al.  Experiences with Web Search on Medical Concerns and Self Diagnosis , 2009, AMIA.

[32]  Johann Gamper,et al.  MOUNA: mining opinions to unveil neglected arguments , 2012, CIKM '12.

[33]  Junghoo Cho,et al.  Impact of search engines on page popularity , 2004, WWW '04.

[34]  Filip Radlinski,et al.  Evaluating the accuracy of implicit feedback from clicks and query reformulations in Web search , 2007, TOIS.

[35]  Andrei Z. Broder,et al.  A Technique for Measuring the Relative Size and Overlap of Public Web Search Engines , 1998, Comput. Networks.

[36]  Barbara M. Wildemuth,et al.  The effects of domain knowledge on search tactic formulation , 2004, J. Assoc. Inf. Sci. Technol..

[37]  Ryen W. White Beliefs and biases in web search , 2013, SIGIR.

[38]  John Mark Agosta,et al.  Highlighting disputed claims on the web , 2010, WWW '10.

[39]  Steve Fox,et al.  Evaluating implicit measures to improve web search , 2005, TOIS.

[40]  G. Smith,et al.  Bias in meta-analysis detected by a simple, graphical test , 1997, BMJ.

[41]  Chris Buckley,et al.  Pivoted Document Length Normalization , 1996, SIGIR Forum.

[42]  Nicholas J. Belkin,et al.  The effect multiple query representations on information retrieval system performance , 1993, SIGIR.

[43]  Michael S. Bernstein,et al.  Direct answers for search queries in the long tail , 2012, CHI.

[44]  R. J. Cline,et al.  Consumer health information seeking on the Internet: the state of the art. , 2001, Health education research.

[45]  Peter C Gøtzsche,et al.  Cochrane reviews compared with industry supported meta-analyses and other meta-analyses of the same drugs: systematic review , 2006, BMJ : British Medical Journal.

[46]  Shiri Dori-Hacohen,et al.  Detecting controversy on the web , 2013, CIKM.

[47]  S. Dumais,et al.  Promoting Civil Discourse Through Search Engine Diversity , 2014 .

[48]  Ryen W. White,et al.  From cookies to cooks: insights on dietary patterns via analysis of web usage logs , 2013, WWW.

[49]  Abbe Mowshowitz,et al.  Bias on the web , 2002, CACM.

[50]  Daniel Kahneman,et al.  Availability: A heuristic for judging frequency and probability , 1973 .

[51]  Charles B. Inlander Good Operations, Bad Operations: The People's Medical Society's Guide to Surgery , 1993 .

[52]  Michele Tarsilla Cochrane Handbook for Systematic Reviews of Interventions , 2010, Journal of MultiDisciplinary Evaluation.

[53]  Guido Zuccon,et al.  An evaluation of corpus-driven measures of medical concept similarity for information retrieval , 2012, CIKM '12.

[54]  E. Gabrilovich,et al.  Postmarket Drug Surveillance Without Trial Costs: Discovery of Adverse Drug Reactions Through Large-Scale Analysis of Web Search Queries , 2013, Journal of medical Internet research.

[55]  Sergey Brin,et al.  The Anatomy of a Large-Scale Hypertextual Web Search Engine , 1998, Comput. Networks.

[56]  Yin Yang,et al.  A study of medical and health queries to web search engines. , 2004, Health information and libraries journal.

[57]  Jimmy J. Lin,et al.  Web question answering: is more always better? , 2002, SIGIR '02.

[58]  Charles L. A. Clarke,et al.  The influence of caption features on clickthrough patterns in web search , 2007, SIGIR.

[59]  Suresh K. Bhavnani,et al.  Domain-specific search strategies for the effective retrieval of healthcare and shopping information , 2002, CHI Extended Abstracts.

[60]  Linda A. Watson,et al.  Information Retrieval: A Health and Biomedical Perspective. , 2005 .

[61]  Craig MacDonald,et al.  Exploiting term dependence while handling negation in medical search , 2012, SIGIR '12.

[62]  Ravi Kumar,et al.  Search in the Lost Sense of "Query": Question Formulation in Web Search Queries and its Temporal Changes , 2011, ACL.

[63]  Elizabeth Sillence,et al.  Trust and mistrust of online health sites , 2004, CHI.

[64]  Eric Brill,et al.  Beyond PageRank: machine learning for static ranking , 2006, WWW '06.

[65]  David Hawking,et al.  Overview of the TREC 2003 Web Track , 2003, TREC.

[66]  Donna K. Harman,et al.  Overview of the Reliable Information Access Workshop , 2009, Information Retrieval.

[67]  Leif Azzopardi,et al.  Search engine predilection towards news media providers , 2009, SIGIR.

[68]  C. Lee Giles,et al.  Accessibility of information on the web , 1999, Nature.

[69]  Eric Goldman,et al.  Search Engine Bias and the Demise of Search Engine Utopianism , 2006 .

[70]  Mike Thelwall,et al.  Search engine coverage bias: evidence and possible causes , 2004, Inf. Process. Manag..

[71]  Charles P. Friedman,et al.  Research Paper: Factors Associated with Success in Searching MEDLINE and Applying Evidence to Answer Clinical Questions , 2002, J. Am. Medical Informatics Assoc..

[72]  Anirban Dasgupta,et al.  The discoverability of the web , 2007, WWW '07.

[73]  Susan L. Gerhart Do Web search engines suppress controversy? , 2004, First Monday.

[74]  Christian Köhler,et al.  How do consumers search for and appraise health information on the world wide web? Qualitative study using focus groups, usability tests, and in-depth interviews , 2002, BMJ : British Medical Journal.

[75]  J. Kronenfeld,et al.  Chronic illness and health-seeking information on the Internet , 2007, Health.

[76]  Robert S. Taylor Question-Negotiation and Information Seeking in Libraries , 1968, Coll. Res. Libr..

[77]  Ryen W. White,et al.  Captions and biases in diagnostic search , 2013, TWEB.

[78]  Dan Roth,et al.  BiasTrust: teaching biased users about controversial topics , 2012, CIKM '12.

[79]  Leif Azzopardi,et al.  Retrievability: an evaluation measure for higher order information access tasks , 2008, CIKM '08.

[80]  Leif Azzopardi,et al.  Best and Fairest: An Empirical Analysis of Retrieval System Bias , 2014, ECIR.

[81]  Charles L. A. Clarke,et al.  The effect of document retrieval quality on factoid question answering performance , 2004, Annual International ACM SIGIR Conference on Research and Development in Information Retrieval.

[82]  J. Fleiss Measuring nominal scale agreement among many raters. , 1971 .

[83]  Sandeep Pandey,et al.  WIC: A General-Purpose Algorithm for Monitoring Web Information Sources , 2004, VLDB.

[84]  Nicholas J. Belkin,et al.  Features of documents relevant to task- and fact- oriented questions , 2002, CIKM '02.

[85]  Johann Gamper,et al.  Diversifying search results of controversial queries , 2011, CIKM '11.

[86]  H. Simon Bounded Rationality and Organizational Learning , 1991 .

[87]  Ryen W. White,et al.  Web to world: predicting transitions from self-diagnosis to the pursuit of local medical assistance in web search. , 2010, AMIA ... Annual Symposium proceedings. AMIA Symposium.

[88]  Stephen E. Robertson,et al.  A probabilistic model of information retrieval: development and comparative experiments - Part 2 , 2000, Inf. Process. Manag..

[89]  Doug Beeferman,et al.  Agglomerative clustering of a search engine query log , 2000, KDD '00.

[90]  David Hawking,et al.  Focused crawling for both topical relevance and quality of medical information , 2005, CIKM '05.

[91]  Yisong Yue,et al.  Beyond position bias: examining result attractiveness as a source of presentation bias in clickthrough data , 2010, WWW '10.

[92]  Filip Radlinski,et al.  Minimally Invasive Randomization for Collecting Unbiased Preferences from Clickthrough Logs , 2006, AAAI 2006.

[93]  W. Hersh,et al.  Factors associated with successful answering of clinical questions using an information retrieval system. , 2002, Bulletin of the Medical Library Association.

[94]  Thorsten Joachims,et al.  Optimizing search engines using clickthrough data , 2002, KDD.

[95]  Michael Collins,et al.  Answer Extraction , 2000, ANLP.

[96]  D. Pelleg,et al.  Patterns of Information-Seeking for Cancer on the Internet: An Analysis of Real World Data , 2012, PloS one.

[97]  W WhiteRyen,et al.  Content Bias in Online Health Search , 2014 .

[98]  W R Hersh,et al.  How well do physicians use electronic information retrieval systems? A framework for investigation and systematic review. , 1998, JAMA.

[99]  Fujian Song,et al.  Quality of Cochrane reviews , 2002, BMJ : British Medical Journal.

[100]  Paul K J Han,et al.  Communicating the Uncertainty of Harms and Benefits of Medical Interventions , 2007, Medical decision making : an international journal of the Society for Medical Decision Making.

[101]  Ian Ruthven,et al.  Re-examining the potential effectiveness of interactive query expansion , 2003, SIGIR.

[102]  Meredith Ringel Morris,et al.  Augmenting web pages and search results to support credibility assessment , 2011, CHI.

[103]  P. Wason On the Failure to Eliminate Hypotheses in a Conceptual Task , 1960 .

[104]  K. Dickersin The existence of publication bias and risk factors for its occurrence. , 1990, JAMA.

[105]  Olivier Bodenreider,et al.  The Unified Medical Language System (UMLS): integrating biomedical terminology , 2004, Nucleic Acids Res..

[106]  David Hawking,et al.  Buying bestsellers online: A case study in Search & Searchability , 2002, ADCS.

[107]  Dale J. Prediger,et al.  Coefficient Kappa: Some Uses, Misuses, and Alternatives , 1981 .

[108]  Ryen W. White,et al.  Cyberchondria: Studies of the escalation of medical concerns in Web search , 2009, TOIS.

[109]  Jeremy Ginsberg,et al.  Detecting influenza epidemics using search engine query data , 2009, Nature.

[110]  Eli Pariser FILTER BUBBLE: Wie wir im Internet entmündigt werden , 2012 .

[111]  Gary Hsieh,et al.  The presentation of health-related search results and its impact on negative emotional outcomes , 2013, CHI.

[112]  Eugene Agichtein,et al.  Beyond dwell time: estimating document relevance from cursor movements and other post-click searcher behavior , 2012, WWW.