Enrichment of French Biomedical Ontologies with UMLS Concepts and Semantic Types for Biomedical Named Entity Recognition Though Ontological Semantic Annotation

Medical terminologies and ontologies are a crucial resource for semantic annotation of biomedical text. In French, there are considerably less resources and tools to use them than in English. Some terminologies from the Unified Medical Language System have been translated but often the identifiers used in the UMLS Metathesaurus, that make its huge integrated value, have been lost during the process. In this work, we present our method and results in enriching seven French versions of UMLS sources with UMLS Concept Unique Identifiers and Semantic Types based on information extracted from class labels, multilingual translation mappings and codes. We then measure the impact of the enrichment through the application of the SIFR Annotator, a service to identify ontology concepts in free text deployed within the SIFR BioPortal, a repository for French biomedical ontologies and terminologies. We use the Quaero Corpus to evaluate.

[1]  et al.,et al.  NCBO Technology: Powering semantically aware applications , 2013, Journal of Biomedical Semantics.

[2]  Olivier Bodenreider,et al.  Aggregating UMLS Semantic Types for Reducing Conceptual Complexity , 2001, MedInfo.

[3]  Julien Grosjean,et al.  Multiterminology cross-lingual model to create the European Health Terminology/Ontology Portal , 2011 .

[4]  Olivier Bodenreider,et al.  The Unified Medical Language System (UMLS): integrating biomedical terminology , 2004, Nucleic Acids Res..

[5]  Christopher G. Chute,et al.  BioPortal: ontologies and integrated data resources at the click of a mouse , 2009, Nucleic Acids Res..

[6]  Robert H. Baud,et al.  Towards a Unified Medical Lexicon for French , 2003, MIE.

[7]  Lina Fatima Soualmia,et al.  Aligning Biomedical Terminologies in French: Towards Semantic Interoperability in Medical Applications , 2012 .

[8]  Danushka Bollegala,et al.  A Cross-Lingual Similarity Measure for Detecting Biomedical Term Translations , 2015, PloS one.

[9]  Pierre Zweigenbaum,et al.  The Quaero French Medical Corpus : A Ressource for Medical Entity Recognition and Normalization , 2014 .

[10]  I N Sarkar,et al.  Linking biomedical language information and knowledge resources: GO and UMLS. , 2002, Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing.

[11]  Clement Jonquet,et al.  SIFR BioPortal : Un portail ouvert et générique d’ontologies et de terminologies biomédicales françaises au service de l’annotation sémantique , 2016 .

[12]  Olivier Bodenreider,et al.  Beyond synonymy: exploiting the UMLS semantics in mapping vocabularies , 1998, AMIA.

[13]  Robert H. Baud,et al.  VUMeF: Extending the French Involvement in the UMLS metathesaurus , 2003, AMIA.

[14]  Mark A. Musen,et al.  Roadmap for a Multilingual BioPortal , 2015, MSW@ESWC.

[15]  Mark A. Musen,et al.  The Open Biomedical Annotator , 2009, Summit on translational bioinformatics.

[16]  Abdul Mateen Rajput,et al.  Semi-automatic Approach for Ontology Enrichment Using UMLS , 2013 .

[17]  Amina Annane,et al.  Multilingual Mapping Reconciliation between English-French Biomedical Ontologies , 2016, WIMS.