Improving image retrieval effectiveness via query expansion using MeSH hierarchical structure.

OBJECTIVE We explored two strategies for query expansion utilizing medical subject headings (MeSH) ontology to improve the effectiveness of medical image retrieval systems. In order to achieve greater effectiveness in the expansion, the search text was analyzed to identify which terms were most amenable to being expanded. DESIGN To perform the expansions we utilized the hierarchical structure by which the MeSH descriptors are organized. Two strategies for selecting the terms to be expanded in each query were studied. The first consisted of identifying the medical concepts using the unified medical language system metathesaurus. In the second strategy the text of the query was divided into n-grams, resulting in sequences corresponding to MeSH descriptors. MEASUREMENTS For the evaluation of the system, we used the collection made available by the ImageCLEF organization in its 2011 medical image retrieval task. The main measure of efficiency employed for evaluating the techniques developed was the mean average precision (MAP). RESULTS Both strategies exceeded the average MAP score in the ImageCLEF 2011 competition (0.1644). The n-gram expansion strategy achieved a MAP of 0.2004, which represents an improvement of 21.89% over the average MAP score in the competition. On the other hand, the medical concepts expansion strategy scored 0.2172 in the MAP, representing a 32.11% improvement. This run won the text-based medical image retrieval task in 2011. CONCLUSIONS Query expansion exploiting the hierarchical structure of the MeSH descriptors achieved a significant improvement in image retrieval systems.

[1]  Luis Alfonso Ureña López,et al.  Query expansion with a medical ontology to improve a multimodal information retrieval system , 2009, Comput. Biol. Medicine.

[2]  Sophia Ananiadou,et al.  Text Mining for Biology And Biomedicine , 2005 .

[3]  Dietrich Rebholz-Schuhmann,et al.  Ontology refinement for improved information retrieval , 2010, Inf. Process. Manag..

[4]  Michael Krauthammer,et al.  Yale Image Finder (YIF): a new search engine for retrieving biomedical images , 2008, Bioinform..

[5]  Patrick Ruch,et al.  Taking Benefit of Query and Document Expansion using MeSH Descriptors in Medical ImageCLEF 2009 , 2009, CLEF.

[6]  George R. Thoma,et al.  Biomedical Image Retrieval Using Multimodal Context and Concept Feature Spaces , 2011, MCBR-CDS.

[7]  Henning Müller,et al.  Overview of the CLEF 2011 Medical Image Classification and Retrieval Tasks , 2011, CLEF.

[8]  Patrick Ruch,et al.  Query and Document Expansion with Medical Subject Headings Terms at Medical Imageclef 2008 , 2008, CLEF.

[9]  Zhiyong Lu,et al.  Evaluation of query expansion using MeSH in PubMed , 2009, Information Retrieval.

[10]  Preslav Nakov,et al.  BioText Search Engine: beyond abstract search , 2007, Bioinform..

[11]  Miguel Ángel García Cumbreras,et al.  Integrating MeSH Ontology to Improve Medical Information Retrieval , 2007, CLEF.

[12]  Carole A. Goble,et al.  Ontology-based Knowledge Representation for Bioinformatics , 2000, Briefings Bioinform..

[13]  Cheng Thao,et al.  GoldMiner: a radiology image search engine. , 2007, AJR. American journal of roentgenology.

[14]  Miguel Ángel García Cumbreras,et al.  Query Expansion on Medical Image Retrieval: MeSH vs. UMLS , 2008, CLEF.

[15]  Zhiyong Lu,et al.  Recommending MeSH terms for annotating biomedical articles , 2011, J. Am. Medical Informatics Assoc..

[16]  Eugene Kim,et al.  Overview of the ImageCLEFmed 2006 Medical Retrieval and Annotation Tasks , 2006, CLEF.

[17]  Alan R. Aronson,et al.  Effective mapping of biomedical text to the UMLS Metathesaurus: the MetaMap program , 2001, AMIA.

[18]  Olivier Bodenreider,et al.  The Unified Medical Language System (UMLS): integrating biomedical terminology , 2004, Nucleic Acids Res..

[19]  George R. Thoma,et al.  Design and Development of a Multimodal Biomedical Information Retrieval System , 2012, J. Comput. Sci. Eng..

[20]  Charles E. Kahn,et al.  Automated semantic indexing of figure captions to improve radiology image retrieval. , 2009, Journal of the American Medical Informatics Association : JAMIA.

[21]  K P Pfeiffer,et al.  A MeSH based intelligent search intermediary for Consumer Health Information Systems. , 2001 .

[22]  Betsy L. Humphreys,et al.  Relationships in Medical Subject Headings (MeSH) , 2001 .

[23]  Ralph Grishman,et al.  A Maximum Entropy Approach to Named Entity Recognition , 1999 .

[24]  Stuart J. Nelson,et al.  The MeSH Translation Maintenance System: Structure, Interface Design, and Implementation , 2004, MedInfo.

[25]  Eugene Kim,et al.  Overview of the ImageCLEFmed 2006 Medical Retrieval and Medical Annotation Tasks , 2006, CLEF.