A2A: a platform for research in biomedical literature search

Background Finding relevant literature is crucial for many biomedical research activities and in the practice of evidence-based medicine. Search engines such as PubMed provide a means to search and retrieve published literature, given a query. However, they are limited in how users can control the processing of queries and articles—or as we call them documents—by the search engine. To give this control to both biomedical researchers and computer scientists working in biomedical information retrieval, we introduce a public online tool for searching over biomedical literature. Our setup is guided by the NIST setup of the relevant TREC evaluation tasks in genomics, clinical decision support, and precision medicine. Results To provide benchmark results for some of the most common biomedical information retrieval strategies, such as querying MeSH subject headings with a specific weight or querying over the title of the articles only, we present our evaluations on public datasets. Our experiments report well-known information retrieval metrics such as precision at a cutoff of ranked documents. Conclusions We introduce the A2A search and benchmarking tool which is publicly available for the researchers who want to explore different search strategies over published biomedical literature. We outline several query formulation strategies and present their evaluations with known human judgements for a large pool of topics, from genomics to precision medicine.

[1]  Ellen M. Voorhees The TREC Medical Records Track , 2013, BCB.

[2]  Sarvnaz Karimi,et al.  An Experimentation Platform for Precision Medicine , 2019, SIGIR.

[3]  Alexander Kotov,et al.  Optimization Method for Weighting Explicit and Latent Concepts in Clinical Decision Support Queries , 2016, ICTIR.

[4]  Giorgio Maria Di Nunzio,et al.  The University of Padua IMS Research Group at TREC 2018 Precision Medicine Track , 2018, TREC.

[5]  Craig MacDonald,et al.  Terrier Information Retrieval Platform , 2005, ECIR.

[6]  Guido Zuccon,et al.  Generating Clinical Queries from Patient Narratives: A Comparison between Machines and Humans , 2017, SIGIR.

[7]  Erik Faessler,et al.  JULIE Lab & Med Uni Graz @ TREC 2019 Precision Medicine Track , 2019, TREC.

[8]  Ellen M. Voorhees,et al.  Overview of the TREC 2020 Precision Medicine Track , 2017, TREC.

[9]  Kirk Roberts,et al.  TREC-COVID: rationale and structure of an information retrieval shared task for COVID-19 , 2020, J. Am. Medical Informatics Assoc..

[10]  Jimmy J. Lin,et al.  Anserini: Enabling the Use of Lucene for Information Retrieval Research , 2017, SIGIR.

[11]  Alistair Moffat,et al.  Improvements that don't add up: ad-hoc retrieval results since 1998 , 2009, CIKM.

[12]  Ben Carterette,et al.  Information retrieval evaluation using test collections , 2016, Information Retrieval Journal.

[13]  Falk Scholer,et al.  Quantifying the impact of concept recognition on biomedical information retrieval , 2012, Inf. Process. Manag..

[14]  Gerald J. Kowalski,et al.  Information Retrieval Systems , 1997, The Information Retrieval Series.

[15]  Ellen M. Voorhees,et al.  TREC genomics special issue overview , 2009, Information Retrieval.

[16]  Czeslaw Jedrzejek,et al.  Baseline and extensions approach to information retrieval of complex medical data: Poznan's approach to the bioCADDIE 2016 , 2018, Database J. Biol. Databases Curation.

[17]  Marc-Allen Cartright,et al.  Galago: A Modular Distributed Processing and Retrieval System , 2012, OSIR@SIGIR.

[18]  Luca Toldo,et al.  Semi-Supervised Information Retrieval System for Clinical Decision Support , 2016, TREC.

[19]  William R. Hersh,et al.  Phrases, Boosting, and Query Expansion Using External Knowledge Resources for Genomic Information Retrieval , 2003, TREC.

[20]  Falk Scholer,et al.  A2A: Benchmark Your Clinical Decision Support Search , 2018, SIGIR.

[21]  Shoubin Dong,et al.  SCUT-CCNL at TREC 2019 Precision Medicine Track , 2019, TREC.

[22]  Peter Szolovits,et al.  MIMIC-III, a freely accessible critical care database , 2016, Scientific Data.

[23]  C. J. van Rijsbergen,et al.  Probabilistic models of information retrieval based on measuring the divergence from randomness , 2002, TOIS.

[24]  Stephen Wan,et al.  CSIRO at 2019 TREC Precision Medicine Track , 2019, TREC.

[25]  Emine Yilmaz,et al.  A simple and efficient sampling method for estimating AP and NDCG , 2008, SIGIR '08.

[26]  W. Bruce Croft,et al.  Relevance-Based Language Models , 2001, SIGIR '01.

[27]  Jimmy J. Lin,et al.  Critically Examining the "Neural Hype": Weak Baselines and the Additivity of Effectiveness Gains from Neural Ranking Models , 2019, SIGIR.

[28]  Julio Gonzalo,et al.  EvALL: Open Access Evaluation for Information Access Systems , 2017, SIGIR.

[29]  Ellen M. Voorhees,et al.  Overview of the TREC 2014 Clinical Decision Support Track , 2014, TREC.

[30]  Giorgio Maria Di Nunzio,et al.  An Analysis of Query Reformulation Techniques for Precision Medicine , 2019, SIGIR.

[31]  Stephen E. Robertson,et al.  A probabilistic model of information retrieval: development and comparative experiments - Part 1 , 2000, Inf. Process. Manag..