Multi-terminology indexing for the assignment of MeSH descriptors to medical abstracts in French

BACKGROUND To facilitate information retrieval in the biomedical domain, a system for the automatic assignment of Medical Subject Headings to documents curated by an online quality-controlled health gateway was implemented. The French Multi-Terminology Indexer (F-MTI) implements a multiterminology approach using nine main medical terminologies in French and the mappings between them. OBJECTIVE This paper presents recent efforts to assess the added value of (a) integrating four new terminologies (Orphanet, ATC, drug names, MeSH supplementary concepts) into F-MTI's knowledge sources and (b) performing the automatic indexing on the titles and abstracts (vs. title only) of the online health resources. METHODS F-MTI was evaluated on a CISMeF corpus comprising 18,161 manually indexed resources. RESULTS The performance of F-MTI including nine health terminologies on CISMeF resources with Title only was 27.9% precision and 19.7% recall, while the performance on CISMeF resources with Title and Abstract is 14.9 % precision (-13.0%) and 25.9% recall (+6.2%). CONCLUSION In a few weeks, CISMeF will launch the indexing of resources based on title and abstract, using nine terminologies.

[1]  Alan R. Aronson,et al.  Semi-Automatic Indexing of Full Text Biomedical Articles , 2005, AMIA.

[2]  Olivier Bodenreider,et al.  Beyond synonymy: exploiting the UMLS semantics in mapping vocabularies , 1998, AMIA.

[3]  Stéfan Jacques Darmoni,et al.  Automatic indexing of online health resources for a French quality controlled gateway , 2006, Inf. Process. Manag..

[4]  Fiammetta Namer FLEMM : Un analyseur flexionnel du français à base de règles , 2000 .

[5]  Stuart Weibel The State of the Dublin Core Metadata Initiative , 1999 .

[6]  Aurélie Névéol,et al.  Enhancing the MeSH thesaurus to retrieve French online health resources in a quality-controlled gateway. , 2004, Health information and libraries journal.

[7]  Stéfan Jacques Darmoni,et al.  Using multi-terminology indexing for the assignment of MeSH descriptors to health resources in a French online catalogue , 2008, AMIA.

[8]  Stéfan Jacques Darmoni,et al.  Exploring Multi-terminology Indexing of Discharge Summaries , 2008 .

[9]  Stuart Weibel,et al.  State of the Dublin Core Metadata Initiative, April 2003 , 2003, D Lib Mag..

[10]  Marius Fieschi,et al.  SMTS®: un serveur multiterminologies en santé , 2009 .

[11]  Susanne M. Humphrey,et al.  The NLM Indexing Initiative's Medical Text Indexer , 2004, MedInfo.

[12]  Pierre Nugues,et al.  The Integral Dictionary: A Lexical Network Based on Componential Semantics , 2003, ICCSA.

[13]  Stéfan Jacques Darmoni,et al.  Evaluation of a Simple Method for the Automatic Assignment of MeSH Descriptors to Health Resources in a French Online Catalogue , 2007, MedInfo.