NEO: Systematic Non-Lattice Embedding of Ontologies for Comparing the Subsumption Relationship in SNOMED CT and in FMA Using MapReduce

A structural disparity of the subsumption relationship between FMA and SNOMED CT’s Body Structure sub-hierarchy is that while the is-a relation in FMA has a tree structure, the corresponding relation in Body Structure is not even a lattice. This paper introduces a method called NEO, for non-lattice embedding of FMA fragments into the Body Structure sub-hierarchy to understand (1) this structural disparity, and (2) its potential utility in analyzing non-lattice fragments in SNOMED CT. NEO consists of four steps. First, transitive, upper- and down-closures are computed for FMA and SNOMED CT using MapReduce, a modern scalable distributed computing technique. Secondly, UMLS mappings between FMA and SNOMED CT concepts are used to identify equivalent concepts in non-lattice fragments from Body Structure. Then, non-lattice fragments in the Body Structure sub-hierarchy are extracted, and FMA concepts matching those in the non-lattice fragments are used as the seeds to generate the corresponding FMA fragments. Lastly, the corresponding FMA fragments are embedded to the non-lattice fragments for comparative visualization and analysis. After identifying 8,428 equivalent concepts between the collection of over 30,000 concepts in Body Structure and the collection of over 83,000 concepts in FMA using UMLS equivalent concept mappings, 2,117 shared is-a relations and 5,715 mismatched relations were found. Among Body Structure’s 90,465 non-lattice fragments, 65,968 (73%) contained one or more is-a relations that are in SNOMED CT but not in FMA, even though they have equivalent source and target concepts. This shows that SNOMED CT may be more liberal in classifying a relation as is-a, a potential explanation for the fragments not conforming to the lattice property.

[1]  Mark A. Musen,et al.  Ontology Quality Assurance with the Crowd , 2013, HCOMP.

[2]  Olivier Bodenreider,et al.  Mining Relation Reversals in the Evolution of SNOMED CT Using MapReduce , 2015, AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science.

[3]  Guo-Qiang Zhang,et al.  MEDCIS: Multi-Modality Epilepsy Data Capture and Integration System , 2014, AMIA.

[4]  Olivier Bodenreider,et al.  Using SPARQL to Test for Lattices: Application to Quality Assurance in Biomedical Ontologies , 2010, International Semantic Web Conference.

[5]  Olivier Bodenreider,et al.  Comparing the Representation of Anatomy in the FMA and SNOMED CT , 2006, AMIA.

[6]  Guo-Qiang Zhang,et al.  A Semantic-based Approach for Exploring Consumer Health Questions Using UMLS , 2014, AMIA.

[7]  O Bodenreider,et al.  Biomedical ontologies in action: role in knowledge management, data integration and decision support. , 2008, Yearbook of medical informatics.

[8]  Chien-Hung Chen,et al.  Domain Ontology As Conceptual Model for Big Data Management: Application in Biomedical Informatics , 2014, ER.

[9]  Guo-Qiang Zhang Large-Scale, Exhaustive Lattice-Based Structural Auditing of SNOMED CT , 2010, KSEM.

[10]  José L. V. Mejino,et al.  A reference ontology for biomedical informatics: the Foundational Model of Anatomy , 2003, J. Biomed. Informatics.

[11]  Kevin Donnelly,et al.  SNOMED-CT: The advanced terminology and coding system for eHealth. , 2006, Studies in health technology and informatics.

[12]  Olivier Bodenreider,et al.  The Unified Medical Language System (UMLS): integrating biomedical terminology , 2004, Nucleic Acids Res..

[13]  Licong Cui,et al.  Ontology-guided Health Information Extraction, Organization, and Exploration , 2014 .

[14]  Lingyun Luo,et al.  An analysis of FMA using structural self-bisimilarity , 2013, J. Biomed. Informatics.