Improving Medical Cases Retrieval Using an Online Fact Database

This paper presents an approach for retrieval of medical cases using a novel query expansion method. The approach relies purely on the text data in the medical cases. The cases are indexed with Terrier IR search engine based on their text content including the caption of the figure contained within them. Furthermore, in the retrieval phase there is an input consisted of a long text query in a narrative form. The input query is expanded by using on-line fact databases, such as Freebase, with the aim that this will add more terms relevant to the concepts mentioned in the text. The goal is to provide a way of query expansion, so that the query is more defined, which should provide more narrowed and precise results in the retrieval. The retrieval is done with the BM25 weighting model. Our approach shows that expanding the input text query in this fashion can provide a boost in the retrieval performance.

[1]  Ellen M. Voorhees,et al.  Query expansion using lexical-semantic relations , 1994, SIGIR '94.

[2]  C. Dye,et al.  Research for Universal Health Coverage , 2013, Science Translational Medicine.

[3]  Jeongeun Lee,et al.  SNUMedinfo at ImageCLEF 2013: Medical Retrieval Task , 2013, CLEF.

[4]  Adil Alpkocak,et al.  DEMIR at ImageCLEFMed 2012: Inter-modality and Intra-Modality Integrated Combination Retrieval , 2012 .

[5]  C. J. van Rijsbergen,et al.  Probabilistic models of information retrieval based on measuring the divergence from randomness , 2002, TOIS.

[6]  Roberto Navigli,et al.  An analysis of ontology-based query expansion strategies , 2003 .

[7]  Craig MacDonald,et al.  University of Glasgow at WebCLEF 2005: Experiments in per-field Normalisation and Language Specific Stemming , 2005, CLEF.

[8]  Claudio Carpineto,et al.  A Survey of Automatic Query Expansion in Information Retrieval , 2012, CSUR.

[9]  Fabio A. González,et al.  Bioingenium at ImageCLEF 2012: Text and Visual Indexing for Medical Images , 2012, CLEF.

[10]  Olivier Bodenreider,et al.  The Unified Medical Language System (UMLS): integrating biomedical terminology , 2004, Nucleic Acids Res..

[11]  João Magalhães,et al.  Multimodal medical information retrieval with unsupervised rank fusion , 2015, Comput. Medical Imaging Graph..

[12]  Alan R. Aronson,et al.  Effective mapping of biomedical text to the UMLS Metathesaurus: the MetaMap program , 2001, AMIA.

[13]  Ivan Kitanovski,et al.  FCSE at Medical Tasks of ImageCLEF 2013 , 2013, CLEF.

[14]  Daekeun You,et al.  ITI's Participation in the 2013 Medical Track of ImageCLEF , 2013, CLEF.

[15]  Praveen Paritosh,et al.  Freebase: a collaboratively created graph database for structuring human knowledge , 2008, SIGMOD Conference.

[16]  Henning Müller,et al.  Overview of the ImageCLEF 2013 Medical Tasks , 2013, CLEF.

[17]  Mounir Errami,et al.  eTBLAST: a web server to identify expert reviewers, appropriate journals and similar publications , 2007, Nucleic Acids Res..

[18]  Ivan Kitanovski,et al.  Multimodal Medical Image Retrieval , 2012, ICT Innovations.

[19]  Ivan Kitanovski,et al.  FCSE at ImageCLEF 2012: Evaluating Techniques for Medical Image Retrieval , 2012, CLEF.

[20]  William R. Hersh,et al.  Assessing thesaurus-based query expansion using the UMLS Metathesaurus , 2000, AMIA.