Combining Thesauri-Based Methods for Biomedical Retrieval

This paper describes our participation in the TREC 2005 Genomics track. We took part in the ad hoc retrieval task and aimed at integrat- ing thesauri in the retrieval model. We developed three thesauri-based methods, two of which made use of the existing MeSH thesaurus. One method uses blind relevance feedback on MeSH terms, the second uses an index of the MeSH thesaurus for query expansion. The third method makes use of a dynamically generated lookup list, by which acronyms and synonyms could be inferred. We show that, despite the relatively minor improve- ments in retrieval performance of individually ap- plied methods, a combination works best and is able to deliver significant improvements over the baseline.

[1]  Mark Sanderson,et al.  Information retrieval system evaluation: effort, sensitivity, and reliability , 2005, SIGIR '05.

[2]  Jay Ponte,et al.  LANGUAGE MODELS FOR RELEVANCE FEEDBACK , 2002 .

[3]  David A. Hull Using statistical testing in the evaluation of retrieval experiments , 1993, SIGIR.

[4]  Nicholas J. Belkin,et al.  Relevance Feedback versus Local Context Analysis as Term Suggestion Devices: Rutgers' TREC-8 Interactive Track Experience , 1999, TREC.

[5]  Edward A. Fox,et al.  Combination of Multiple Searches , 1993, TREC.

[6]  Jaap Kamps,et al.  Improving Retrieval Effectiveness by Reranking Documents Based on Controlled Vocabulary , 2004, ECIR.

[7]  Wessel Kraaij,et al.  MeSH Based Feedback, Concept Recognition and Stacked Classification for Curation Tasks , 2004, TREC.

[8]  M. de Rijke,et al.  Biomedical Retrieval: How Can a Thesaurus Help? , 2005, OTM Conferences.

[9]  Dennis Wollersheim,et al.  Using Medical Test Collection Relevance Judgements to Identify Ontological Relationships Useful for Query Expansion , 2005, 21st International Conference on Data Engineering Workshops (ICDEW'05).

[10]  Marti A. Hearst,et al.  TREC 2007 Genomics Track Overview , 2007, TREC.

[11]  Gerard Salton,et al.  Improving retrieval performance by relevance feedback , 1997, J. Am. Soc. Inf. Sci..

[12]  Xie Kanglin Lucene Search Engine , 2007 .

[13]  Djoerd Hiemstra,et al.  Using language models for information retrieval , 2001 .

[14]  J. J. Rocchio,et al.  Relevance feedback in information retrieval , 1971 .

[15]  M. de Rijke,et al.  The effectiveness of combining information retrieval strategies for European languages , 2004, SAC '04.