Boolean versus ranked querying for biomedical systematic reviews

BackgroundThe process of constructing a systematic review, a document that compiles the published evidence pertaining to a specified medical topic, is intensely time-consuming, often taking a team of researchers over a year, with the identification of relevant published research comprising a substantial portion of the effort. The standard paradigm for this information-seeking task is to use Boolean search; however, this leaves the user(s) the requirement of examining every returned result. Further, our experience is that effective Boolean queries for this specific task are extremely difficult to formulate and typically require multiple iterations of refinement before being finalized.MethodsWe explore the effectiveness of using ranked retrieval as compared to Boolean querying for the purpose of constructing a systematic review. We conduct a series of experiments involving ranked retrieval, using queries defined methodologically, in an effort to understand the practicalities of incorporating ranked retrieval into the systematic search task.ResultsOur results show that ranked retrieval by itself is not viable for this search task requiring high recall. However, we describe a refinement of the standard Boolean search process and show that ranking within a Boolean result set can improve the overall search performance by providing early indication of the quality of the results, thereby speeding up the iterative query-refinement process.ConclusionsOutcomes of experiments suggest that an interactive query-development process using a hybrid ranked and Boolean retrieval system has the potential for significant time-savings over the current search process in the systematic reviewing.

[1]  Alistair Moffat,et al.  Against recall: is it persistence, cardinality, density, coverage, or totality? , 2009, SIGF.

[2]  K. A. McKibbon,et al.  Optimal search strategies for retrieving scientifically strong studies of treatment from Medline: analytical survey , 2005, BMJ : British Medical Journal.

[3]  Jun'ichi Tsujii,et al.  Bidirectional Inference with the Easiest-First Strategy for Tagging Sequence Data , 2005, HLT.

[4]  Ian H. Witten,et al.  Managing gigabytes (2nd ed.): compressing and indexing documents and images , 1999 .

[5]  H. Handoll,et al.  Lessons for search strategies from a systematic review, in The Cochrane Library, of nutritional supplementation trials in patients after hip fracture. , 2001, The American journal of clinical nutrition.

[6]  William R. Hersh,et al.  Reducing workload in systematic review preparation using automated citation classification. , 2006, Journal of the American Medical Informatics Association : JAMIA.

[7]  Edward A. Fox,et al.  Research Contributions , 2014 .

[8]  William R. Hersh,et al.  TREC GENOMICS Track Overview , 2003, TREC.

[9]  JUSTIN ZOBEL,et al.  Inverted files for text search engines , 2006, CSUR.

[10]  Elmer V. Bernstam,et al.  A day in the life of PubMed: analysis of a typical day's query log. , 2007, Journal of the American Medical Informatics Association : JAMIA.

[11]  Li Zhang,et al.  Optimizing search strategies to identify randomized controlled trials in MEDLINE , 2006, BMC medical research methodology.

[12]  Robert B. Allen,et al.  Using UMLS-based Re-Weighting Terms as a Query Expansion Strategy , 2006, 2006 IEEE International Conference on Granular Computing.

[13]  J. McGowan,et al.  Errors in search strategies were identified by type and frequency. , 2006, Journal of clinical epidemiology.

[14]  J. McGowan,et al.  Systematic reviews need systematic searchers. , 2005, Journal of the Medical Library Association : JMLA.

[15]  Michele Tarsilla Cochrane Handbook for Systematic Reviews of Interventions , 2010, Journal of MultiDisciplinary Evaluation.

[16]  Adriana Yoshii,et al.  Analysis of the reporting of search strategies in Cochrane systematic reviews. , 2009, Journal of the Medical Library Association : JMLA.

[17]  Zhiyong Lu,et al.  Evaluation of query expansion using MeSH in PubMed , 2009, Information Retrieval.

[18]  Alistair Moffat,et al.  Has adhoc retrieval improved since 1994? , 2009, SIGIR.

[19]  William S. Cooper,et al.  Getting beyond Boole , 1988, Inf. Process. Manag..

[20]  Jacques Savoy,et al.  Searching in Medline: Query expansion and manual indexing evaluation , 2008, Inf. Process. Manag..

[21]  Andrew Turpin,et al.  Challenging conventional assumptions of automated information retrieval with real users: Boolean searching and batch retrieval evaluations , 2001, Inf. Process. Manag..

[22]  Amanda Spink,et al.  Real life, real users, and real needs: a study and analysis of user queries on the web , 2000, Inf. Process. Manag..

[23]  Alistair Moffat,et al.  Rank-biased precision for measurement of retrieval effectiveness , 2008, TOIS.

[24]  Ian H. Witten,et al.  Managing Gigabytes: Compressing and Indexing Documents and Images , 1999 .

[25]  Andrew Trotman,et al.  Sound and complete relevance assessment for XML retrieval , 2008, TOIS.

[26]  D. Moher,et al.  Guides for reading and interpreting systematic reviews: I. Getting started. , 1998, Archives of pediatrics & adolescent medicine.

[27]  David Moher,et al.  No consensus exists on search reporting methods for systematic reviews. , 2008, Journal of clinical epidemiology.

[28]  D. Moher,et al.  An alternative to the hand searching gold standard: validating methodological search filters using relative recall , 2006, BMC medical research methodology.

[29]  Stephen E. Robertson,et al.  Okapi at TREC-6 Automatic ad hoc, VLC, routing, filtering and QSDR , 1997, TREC.

[30]  Carol Lefebvre,et al.  How to identify randomized controlled trials in MEDLINE: ten years on. , 2006, Journal of the Medical Library Association : JMLA.

[31]  Howard R. Turtle Natural language vs. Boolean query evaluation: a comparison of retrieval performance , 1994, SIGIR '94.

[32]  S. Golder,et al.  Poor reporting and inadequate searches were apparent in systematic reviews of adverse effects. , 2008, Journal of clinical epidemiology.

[33]  K. Dickersin,et al.  Systematic Reviews: Identifying relevant studies for systematic reviews , 1994 .