Extended Boolean retrieval for systematic biomedical reviews

Searching for relevant documents is a laborious task involved in preparing systematic reviews of biomedical literature. Currently, complex Boolean queries are iteratively developed, and then each document of the final query result is assessed for relevance. However, the result set sizes of these queries are hard to control, and in practice it is difficult to balance the competing desires to keep result sets to a manageable volume, and yet not exclude relevant documents from consideration. Ranking overcomes these problems by allowing the user to choose the number of documents to be inspected. However, previous work did not show significant improvements over the Boolean approach when ranked keyword queries based on terms in the Boolean queries, review title, research question or inclusion criteria were used. The extended Boolean retrieval model also provides ranked output, but existing complex Boolean queries can be directly used as formal description of the complex information needs occurring in this domain. In this paper we show that extended Boolean retrieval is able to find a larger quantity of relevant documents than previous approaches when comparable (or greater) numbers of documents are inspected for relevance.

[1]  C. Paice Soft evaluation of Boolean search queries in information retrieval systems , 1984 .

[2]  David Moher,et al.  No consensus exists on search reporting methods for systematic reviews. , 2008, Journal of clinical epidemiology.

[3]  Tadeusz Radecki,et al.  Fuzzy set theoretical approach to document retrieval , 1979, Inf. Process. Manag..

[4]  Michele Tarsilla Cochrane Handbook for Systematic Reviews of Interventions , 2010, Journal of MultiDisciplinary Evaluation.

[5]  William R Hersh,et al.  Enhancing access to the Bibliome: the TREC 2004 Genomics Track , 2006, Journal of biomedical discovery and collaboration.

[6]  William R. Hersh,et al.  Information Retrieval: A Health and Biomedical Perspective , 2002 .

[7]  F. Davidoff,et al.  Evidence based medicine. , 2006, BMJ.

[8]  Falk Scholer,et al.  The challenge of high recall in biomedical systematic search , 2009, DTMBIO.

[9]  Alistair Moffat,et al.  Rank-biased precision for measurement of retrieval effectiveness , 2008, TOIS.

[10]  C. Beahler,et al.  Information retrieval in systematic reviews: challenges in the public health arena. , 2000, American journal of preventive medicine.

[11]  W. Bruce Croft,et al.  Discovering key concepts in verbose queries , 2008, SIGIR '08.

[12]  Timothy Baldwin,et al.  Facilitating biomedical systematic reviews using ranked text retrieval and classification , 2008 .

[13]  K. Dickersin,et al.  Systematic Reviews: Identifying relevant studies for systematic reviews , 1994 .

[14]  Steve Renals,et al.  Proceedings of the Ninth Text REtrieval Conference , 2001 .

[15]  J. Lee Analyzing the Effectiveness of Extended Boolean Models in Information Retrieval , 1995 .

[16]  H. Handoll,et al.  Lessons for search strategies from a systematic review, in The Cochrane Library, of nutritional supplementation trials in patients after hip fracture. , 2001, The American journal of clinical nutrition.

[17]  William R. Hersh,et al.  Reducing workload in systematic review preparation using automated citation classification. , 2006, Journal of the American Medical Informatics Association : JAMIA.

[18]  J. McGowan,et al.  Systematic reviews need systematic searchers. , 2005, Journal of the Medical Library Association : JMLA.

[19]  Stephen E. Robertson,et al.  Okapi at TREC-3 , 1994, TREC.

[20]  Li Zhang,et al.  Optimizing search strategies to identify randomized controlled trials in MEDLINE , 2006, BMC medical research methodology.

[21]  Edward A. Fox,et al.  Research Contributions , 2014 .

[22]  Donald H. Kraft,et al.  A mathematical model of a weighted boolean retrieval system , 1979, Inf. Process. Manag..

[23]  Vladimir G. Voiskunskii,et al.  Boolean Search: Current State and Perspectives , 1999, J. Am. Soc. Inf. Sci..

[24]  Alistair Moffat,et al.  Against recall: is it persistence, cardinality, density, coverage, or totality? , 2009, SIGF.

[25]  William Hersh,et al.  Comprar Information Retrieval: A Health And Biomedical Perspective | Hersh, William | 9780387787022 | Springer , 2009 .

[26]  Maria Elena Smith,et al.  Aspects of the P-Norm Model of Information Retrieval: Syntactic Query Generation, Efficiency, And Theoretical , 1990 .

[27]  Joon Ho Lee,et al.  Properties of extended Boolean models in information retrieval , 1994, SIGIR '94.

[28]  J. McGowan,et al.  Errors in search strategies were identified by type and frequency. , 2006, Journal of clinical epidemiology.

[29]  K. Shojania,et al.  How Quickly Do Systematic Reviews Go Out of Date? A Survival Analysis , 2007, Annals of Internal Medicine.

[30]  Stephen E. Robertson,et al.  Okapi at TREC-4 , 1995, TREC.