Ontology integration: Experience with medical terminologies

To build a common controlled vocabulary is a formidable challenge in medical informatics. Due to vast scale and multiplicity in interpretation of medical data, it is natural to face overlapping terminologies in the process of practicing medical informatics [A. Rector, Clinical terminology: why is it so hard? Methods Inf. Med. 38 (1999) 239-252]. A major concern lies in the integration of seemingly overlapping terminologies in the medical domain and this issue has not been well addressed. In this paper, we describe a novel approach for medical ontology integration that relies on the theory of Algorithmic Semantic Refinement we previously developed. Our approach simplifies the task of matching pairs of corresponding concepts derived from a pair of ontologies, which is vital to terminology mapping. A formal theory and algorithm for our approach have been devised and the application of this method to two medical terminologies has been developed. The result of our work is an integrated medical terminology and a methodology and implementation ready to use for other ontology integration tasks.

[1]  Philip Resnik,et al.  Using Information Content to Evaluate Semantic Similarity in a Taxonomy , 1995, IJCAI.

[2]  D. Lindberg,et al.  Building the Unified Medical Language System , 1989 .

[3]  James Geller,et al.  Auditing the UMLS for redundant classifications , 2002, AMIA.

[4]  D. Lindberg,et al.  The Unified Medical Language System , 1993, Methods of Information in Medicine.

[5]  Desmond D'Souza,et al.  Objects, Components, and Frameworks with UML: The Catalysis Approach , 1998 .

[6]  Thomas R. Gruber,et al.  Toward principles for the design of ontologies used for knowledge sharing? , 1995, Int. J. Hum. Comput. Stud..

[7]  A. Rector Clinical Terminology: Why Is it so Hard? , 1999, Methods of Information in Medicine.

[8]  Myoung-Ho Kim,et al.  Information Retrieval Based on Conceptual Distance in is-a Hierarchies , 1993, J. Documentation.

[9]  Prasenjit Mitra,et al.  Semi-automatic Integration of Knowledge Sources , 1999 .

[10]  Michael Sussna,et al.  Word sense disambiguation for free-text indexing using a massive semantic network , 1993, CIKM '93.

[11]  Steffen Staab,et al.  Ontology Learning for the Semantic Web , 2002, IEEE Intell. Syst..

[12]  Alexa T. McCray,et al.  Representing biomedical knowledge in the UMLS semantic network , 1993 .

[13]  Pedro M. Domingos,et al.  Reconciling schemas of disparate data sources: a machine-learning approach , 2001, SIGMOD '01.

[14]  Ching-Lai Hwang,et al.  Multiple Attribute Decision Making: Methods and Applications - A State-of-the-Art Survey , 1981, Lecture Notes in Economics and Mathematical Systems.

[15]  Kevin Knight,et al.  Toward Distributed Use of Large-Scale Ontologies t , 1997 .

[16]  A. McCray The UMLS Semantic Network. , 1989 .

[17]  Ian Horrocks,et al.  The GRAIL concept modelling language for medical terminology , 1997, Artif. Intell. Medicine.

[18]  P L Schuyler,et al.  The UMLS Metathesaurus: representing different views of biomedical concepts. , 1993, Bulletin of the Medical Library Association.

[19]  Carole A. Goble,et al.  Investigating Semantic Similarity Measures Across the Gene Ontology: The Relationship Between Sequence and Annotation , 2003, Bioinform..

[20]  Patrick Henry Winston,et al.  The psychology of computer vision , 1976, Pattern Recognit..

[21]  Dekang Lin,et al.  An Information-Theoretic Definition of Similarity , 1998, ICML.

[22]  David W. Conrath,et al.  Semantic Similarity Based on Corpus Statistics and Lexical Taxonomy , 1997, ROCLING/IJCLCLP.

[23]  D. Skuce,et al.  How We Might Reach Agreement on Shared Ontologies: A Fundamental Approach , 1997 .

[24]  Betsy L. Humphreys,et al.  Technical Milestone: The Unified Medical Language System: An Informatics Research Collaboration , 1998, J. Am. Medical Informatics Assoc..

[25]  A. Tversky Features of Similarity , 1977 .

[26]  Naomi C. Broering,et al.  High performance medical libraries: Advances in information management for the virtual era , 1993 .

[27]  Mark A. Musen,et al.  Anchor-PROMPT: Using Non-Local Context for Semantic Matching , 2001, OIS@IJCAI.

[28]  Pedro M. Domingos,et al.  Learning to map between ontologies on the semantic web , 2002, WWW '02.

[29]  James Geller,et al.  Research Paper: Representing the UMLS as an Object-oriented Database: Modeling Issues and Advantages , 2000, J. Am. Medical Informatics Assoc..

[30]  Diego Calvanese,et al.  Ontology of Integration and Integration of Ontologies , 2001, Description Logics.

[31]  Georg Groh,et al.  Facilitating the Exchange of Explicit Knowledge through Ontology Mappings , 2001, FLAIRS.

[32]  James Geller,et al.  Semantic refinement and error correction in large terminological knowledge bases , 2003, Data Knowl. Eng..

[33]  Graeme Hirst,et al.  Lexical chains as representations of context for the detection and correction of malapropisms , 1995 .

[34]  Roy Rada,et al.  Development and application of a metric on semantic nets , 1989, IEEE Trans. Syst. Man Cybern..

[35]  A. L. Rector Clinical terminology : Why is it so hard? : Challenges to Progresses , 1999 .

[36]  Mark A. Musen,et al.  PROMPT: Algorithm and Tool for Automated Ontology Merging and Alignment , 2000, AAAI/IAAI.

[37]  Eneko Agirre,et al.  Word Sense Disambiguation using Conceptual Density , 1996, COLING.

[38]  Alexa T. McCray,et al.  Concepts, Issues, and Standards. Current Status of the NLM's Umls Project: The Scope and Structure of the First Version of the UMLS Seoantic Network , 1990 .

[39]  P. Resnik Selection and information: a class-based approach to lexical relationships , 1993 .

[40]  Amihai Motro,et al.  Database Schema Matching Using Machine Learning with Feature Selection , 2002, CAiSE.

[41]  Gerd Stumme,et al.  FCA-MERGE: Bottom-Up Merging of Ontologies , 2001, IJCAI.

[42]  H. Sofia Pinto,et al.  A methodology for ontology integration , 2001, K-CAP '01.

[43]  Max J. Egenhofer,et al.  Determining Semantic Similarity among Entity Classes from Different Ontologies , 2003, IEEE Trans. Knowl. Data Eng..

[44]  Aldo Gangemi,et al.  ONIONS: An Ontological Methodology for Taxonomic Knowledge Integration , 2007 .

[45]  Hans Chalupsky,et al.  OntoMorph: A Translation System for Symbolic Knowledge , 2000, KR.

[46]  Mark S. Tuttle,et al.  Concepts, Issues, and Standards. Current Status of the NLM's Umls Project: Using Meta-1-The 1st Version of the UMLS Metathesaurus , 1990 .

[47]  Michael Hoey,et al.  Patterns of Lexis In Text , 1991 .

[48]  Jin H. Kim,et al.  A Model of Knowledge Based Information Retrieval with Hierarchical Concept Graph , 1990, J. Documentation.

[49]  Ryutaro Ichise,et al.  Rule Induction for Concept Hierarchy Alignment , 2001, Workshop on Ontology Learning.

[50]  Deborah L. McGuinness,et al.  The Chimaera Ontology Environment , 2000, AAAI/IAAI.