Semantic Mapping of Clinical Model Data to Biomedical Terminologies to Facilitate Data Interoperability

The lack of use of a single terminology across all health systems has led to issues of patient data interoperability. This requires data models to be independent of the terminologies when formalising data representations. The paper discusses the principles to base a middleware system to enable semantic mapping of data in the models to formal biomedical terminologies. The principles have been tested against the MoST system, which maps Archetype data to SNOMED CT codes. At first, contextual and non-contextual methods were applied using lexical and semantic procedures to obtain matches. These automated matches were then presented as candidate mappings to clinical modelers to choose from. The aim of the research is to enable clinical modelers to quickly and efficiently codify data at the time of modeling the information through automated processes. The research intends to simplify the task of mapping thereby encouraging standardising data at source by alleviating tedious manual lookups performed traditionally by clinicians. Introduction An unusual feature of health informatics is that biomedical terminology models and clinical data models are developed by separate groups that work independently of each other. Therefore, there are inherent differences in the principles on which each of them is based as well disparities in the semantics of representation. However, there is increasing government recognition for the need to integrate the functioning of all complementing models in health information systems to achieve data interoperability and shared care. To achieve data interoperability it is important that data conforms to some standard, which is adopted by all conforming systems. In health care, formal biomedical terminologies provide such standards to record patient data. Patient data is increasingly being recorded using formal clinical information systems. Data models, such as Archetype Models, form part of such systems by providing structured and constrained representations of clinical recording scenarios. To facilitate data interoperability it is necessary to integrate the data in the models to standard terminologies like Systematized Nomenclature of Medicine Clinical Terms (SNOMED CT). The paper discusses a methodology used to perform the first level of data standardisation i.e., data mapping. The Model Standardisation Using Terminology (MoST) system was developed to test the methodology using openEHR Archetype Models and SNOMED CT. Context and non-context methods using lexical and semantic procedures were employed to find matches. The most appropriate matches resulting from application of filtering rules were presented as candidates to the clinical modeler for mapping. A clinical modeler is a person with medical knowledge who is engaged in the task of modeling clinical data for use in information systems. The enhancement in the precision of the results by applying semantic and filtering techniques will be demonstrated. The time, speed, and quality of the results will then be evaluated against traditional manual term matching and lexical processes.

[1]  J. Fleiss Measuring nominal scale agreement among many raters. , 1971 .

[2]  J. Fleiss Statistical methods for rates and proportions , 1974 .

[3]  Mark S. Tuttle,et al.  Intervocabulary Mapping Within the UMLS: The Role of Lexical Matching , 1988 .

[4]  Alan L. Rector Helping with a Humanly Impossible Task: Integrating Knowledge Based Systems into Clinical Care , 1989, SCAI.

[5]  John F. Sowa,et al.  Principles of semantic networks , 1991 .

[6]  A. Rector,et al.  Foundations for an Electronic Medical Record , 1991, Methods of Information in Medicine.

[7]  S Kay,et al.  A Framework for Modelling the Electronic Medical Record , 1993, Methods of Information in Medicine.

[8]  B L Humphreys,et al.  The UMLS project: making the conceptual connection between users and the information they need. , 1993, Bulletin of the Medical Library Association.

[9]  Carol Friedman,et al.  Research Paper: A General Natural-language Text Processor for Clinical Radiology , 1994, J. Am. Medical Informatics Assoc..

[10]  Dean F. Sittig Grand challenges in medical informatics? , 1994, J. Am. Medical Informatics Assoc..

[11]  Thomas R. Gruber,et al.  Toward principles for the design of ontologies used for knowledge sharing? , 1995, Int. J. Hum. Comput. Stud..

[12]  A. Rector,et al.  A Terminology Server for Medical Language and Medical Information Systems , 1995, Methods of Information in Medicine.

[13]  Rogers P Pole The GALEN High Level Ontology , 1996 .

[14]  Kent A. Spackman,et al.  SNOMED RT: a reference terminology for health care , 1997, AMIA.

[15]  J. Cimino Desiderata for Controlled Medical Vocabularies in the Twenty-First Century , 1998, Methods of Information in Medicine.

[16]  Torben Bach Pedersen,et al.  Research issues in clinical data warehousing , 1998, Proceedings. Tenth International Conference on Scientific and Statistical Database Management (Cat. No.98TB100243).

[17]  A. Rector Thesauri and Formal Classifications: Terminologies for People and Machines , 1998, Methods of Information in Medicine.

[18]  K. I. Jantunen,et al.  Guide to Medical Informatics, the Internet, and Telemedicine , 1998 .

[19]  Olivier Bodenreider,et al.  Beyond synonymy: exploiting the UMLS semantics in mapping vocabularies , 1998, AMIA.

[20]  A. Rector Clinical Terminology: Why Is it so Hard? , 1999, Methods of Information in Medicine.

[21]  Stephen B. Johnson,et al.  A semantic lexicon for medical language processing. , 1999, Journal of the American Medical Informatics Association : JAMIA.

[22]  Les Smith,et al.  Information management and technology strategy in healthcare: local timescales and national requirements , 2000, Inf. Res..

[23]  Robert H. Dolin,et al.  An update on HL7's XML-based document representation standards , 2000, AMIA.

[24]  Kate Johnson,et al.  A method for the automated mapping of laboratory results to LOINC , 2000, AMIA.

[25]  R. Gottlieb,et al.  Structured clinical observations: a method to teach clinical skills with limited time and financial resources. , 2000, Pediatrics.

[26]  Christiane Fellbaum,et al.  Book Reviews: WordNet: An Electronic Lexical Database , 1999, CL.

[27]  T Beale,et al.  Archetypes: Constraint-based Domain Models for Future-proof Information Systems , 2000, OOPSLA 2000.

[28]  Alan R. Aronson,et al.  Effective mapping of biomedical text to the UMLS Metathesaurus: the MetaMap program , 2001, AMIA.

[29]  P. V. Biron,et al.  The HL7 Clinical Document Architecture. , 2001, Journal of the American Medical Informatics Association : JAMIA.

[30]  N. F. Noy,et al.  Ontology Development 101: A Guide to Creating Your First Ontology , 2001 .

[31]  Alan L. Rector,et al.  The Interface between Information, Terminology, and Inference Models , 2001, MedInfo.

[32]  Kilem Gwet Computing Inter-Rater Reliability With the SAS System Kilem Gwet , Ph . D . Sr , 2002 .

[33]  P. Maurette,et al.  [To err is human: building a safer health system]. , 2002, Annales francaises d'anesthesie et de reanimation.

[34]  George Stricker,et al.  Criteria for evaluating treatment guidelines. , 2002, The American psychologist.

[35]  William T. Hole,et al.  Finding UMLS Metathesaurus concepts in MEDLINE , 2002, AMIA.

[36]  Kent A. Spackman,et al.  Role grouping as an extension to the description logic of Ontylog, motivated by concept modeling in SNOMED , 2002, AMIA.

[37]  Alan L. Rector Analysis of propagation along transitive roles: Formalisation of the GALEN experience with Medical Ontologies , 2002, Description Logics.

[38]  Sam Heard,et al.  Archetype Definitions and Principles , 2003 .

[39]  Mansour Sarr Improving precision and recall using a spell checker in a search engine , 2003 .

[40]  D Kalra,et al.  Clinical foundations and information architecture for the implementation of a federated health record service , 2003 .

[41]  Stephanie W. Haas,et al.  Using nurses' natural language entries to build a concept-oriented terminology for patients' chief complaints in the emergency department , 2003, J. Biomed. Informatics.

[42]  M. Maloof Learning When Data Sets are Imbalanced and When Costs are Unequal and Unknown , 2003 .

[43]  C. McDonald,et al.  LOINC, a universal standard for identifying laboratory observations: a 5-year update. , 2003, Clinical chemistry.

[44]  Volker Haarslev,et al.  Racer: An OWL Reasoning Agent for the Semantic Web , 2003 .

[45]  Tze-Wey Loong,et al.  Understanding sensitivity and specificity with the right side of the brain , 2003, BMJ : British Medical Journal.

[46]  Cen Tc,et al.  CEN/TC 251: European Standardization of Health Informatics , 2004 .

[47]  J. Sim,et al.  The kappa statistic in reliability studies: use, interpretation, and sample size requirements. , 2005, Physical therapy.

[48]  Olivier Bodenreider,et al.  Linking the Gene Ontology to other biological ontologies , 2005 .

[49]  Thomas K. Houston,et al.  Data Quality in the Outpatient Setting: Impact on Clinical Decision Support Systems , 2005, AMIA.

[50]  Alan L. Rector,et al.  Binding Ontologies & Coding Systems to Electronic Health Records and Messages , 2006, KR-MED.

[51]  Jennifer Y. Sun,et al.  A system for automated lexical mapping. , 2006, Journal of the American Medical Informatics Association : JAMIA.

[52]  Ian Horrocks,et al.  FaCT++ Description Logic Reasoner: System Description , 2006, IJCAR.

[53]  Andrew Nugent,et al.  Emergency medicine. , 2007, Clinical privilege white paper.

[54]  Alan L. Rector,et al.  Unambiguous data modeling to ensure higher accuracy term binding to clinical terminologies , 2007, AMIA.

[55]  A. Rector,et al.  Medical Informatics , 2003, Description Logic Handbook.

[56]  Alan L. Rector,et al.  What's in a code? Towards a Formal Account of the Relation of Ontologies and Coding Systems , 2007, MedInfo.

[57]  Alan L. Rector,et al.  Semantic Issues in Integrating Data from Different Models to Achieve Data Interoperability , 2007, MedInfo.

[58]  S. Trent Rosenbloom,et al.  Experiences mapping a legacy interface terminology to SNOMED CT , 2008, BMC Medical Informatics Decis. Mak..

[59]  John Wright,et al.  Better information for better health , 2008 .

[60]  Aldo Gangemi,et al.  The GALEN CORE Model Schemata for Anatomy: Towards a Re-usable Application-Independent Model of Medical Concepts , 2008 .

[61]  Erik Sundvall,et al.  Integration of tools for binding archetypes to SNOMED CT , 2008, BMC Medical Informatics Decis. Mak..