Aggregating the syntactic and semantic similarity of healthcare data towards their transformation to HL7 FHIR through ontology matching

BACKGROUND AND OBJECTIVE Healthcare systems deal with multiple challenges in releasing information from data silos, finding it almost impossible to be implemented, maintained and upgraded, with difficulties ranging in the technical, security and human interaction fields. Currently, the increasing availability of health data is demanding data-driven approaches, bringing the opportunities to automate healthcare related tasks, providing better disease detection, more accurate prognosis, faster clinical research advance and better fit for patient management. In order to share data with as many stakeholders as possible, interoperability is the only sustainable way for letting systems to talk with one another and getting the complete image of a patient. Thus, it becomes clear that an efficient solution in the data exchange incompatibility is of extreme importance. Consequently, interoperability can develop a communication framework between non-communicable systems, which can be achieved through transforming healthcare data into ontologies. However, the multidimensionality of healthcare domain and the way that is conceptualized, results in the creation of different ontologies with contradicting or overlapping parts. Thus, an effective solution to this problem is the development of methods for finding matches among the various components of ontologies in healthcare, in order to facilitate semantic interoperability. METHODS The proposed mechanism promises healthcare interoperability through the transformation of healthcare data into the corresponding HL7 FHIR structure. In more detail, it aims at building ontologies of healthcare data, which are later stored into a triplestore. Afterwards, for each constructed ontology the syntactic and semantic similarities with the various HL7 FHIR Resources ontologies are calculated, based on their Levenshtein distance and their semantic fingerprints accordingly. Henceforth, after the aggregation of these results, the matching to the HL7 FHIR Resources takes place, translating the healthcare data into a widely adopted medical standard. RESULTS Through the derived results it can be seen that there exist cases that an ontology has been matched to a specific HL7 FHIR Resource due to its syntactic similarity, whereas the same ontology has been matched to a different HL7 FHIR Resource due to its semantic similarity. Nevertheless, the developed mechanism performed well since its matching results had exact match with the manual ontology matching results, which are considered as a reference value of high quality and accuracy. Moreover, in order to furtherly investigate the quality of the developed mechanism, it was also evaluated through its comparison with the Alignment API, as well as the non-dominated sorting genetic algorithm (NSGA-III) which provide ontology alignment. In both cases, the results of all the different implementations were almost identical, proving the developed mechanism's high efficiency, whereas through the comparison with the NSGA-III algorithm, it was observed that the developed mechanism needs additional improvements, through a potential adoption of the NSGA-III technique. CONCLUSIONS The developed mechanism creates new opportunities in conquering the field of healthcare interoperability. However, according to the mechanism's evaluation results, it is almost impossible to create syntactic or semantic patterns for understanding the nature of a healthcare dataset. Hence, additional work should be performed in evaluating the developed mechanism, and updating it with respect to the results that will derive from its comparison with similar ontology matching mechanisms and data of multiple nature.

[1]  José F. Aldana-Montes,et al.  Review: an overview of current ontology meta-matching solutions , 2012 .

[2]  Susannah G. Tringe,et al.  FOAM (Functional Ontology Assignments for Metagenomes): a Hidden Markov Model (HMM) database with environmental focus , 2014, Nucleic acids research.

[3]  José F. Aldana-Montes,et al.  Evaluation of two heuristic approaches to solve the ontology meta-matching problem , 2009, Knowledge and Information Systems.

[4]  Yi Li,et al.  RiMOM: A Dynamic Multistrategy Ontology Alignment Framework , 2009, IEEE Transactions on Knowledge and Data Engineering.

[5]  Christophe Cruz,et al.  Transforming XML documents to OWL ontologies: A survey , 2015, J. Inf. Sci..

[6]  Dimosthenis Kyriazis,et al.  FHIR Ontology Mapper (FOM): Aggregating Structural and Semantic Similarities of Ontologies towards their Alignment to HL7 FHIR , 2018, 2018 IEEE 20th International Conference on e-Health Networking, Applications and Services (Healthcom).

[7]  Yildiray Kabak,et al.  Artemis message exchange framework: semantic interoperability of exchanged messages in the healthcare domain , 2005, SGMD.

[8]  Chakib Tadj,et al.  A modification of Wu and Palmer Semantic Similarity Measure , 2016 .

[9]  Dimosthenis Kyriazis,et al.  The Road to the Future of Healthcare: Transmitting Interoperable Healthcare Data Through a 5G Based Communication Platform , 2018, EMCIS.

[10]  Stefanos D. Kollias,et al.  A String Metric for Ontology Alignment , 2005, SEMWEB.

[11]  Diego Martínez Hernández,et al.  Experiences in reusing knowledge sources using Protégé and PROMPT , 2005, Int. J. Hum. Comput. Stud..

[12]  Charles N Mead,et al.  Data interchange standards in healthcare IT--computable semantic interoperability: now possible but still difficult, do we really need a better mousetrap? , 2006, Journal of healthcare information management : JHIM.

[13]  Dimosthenis Kyriazis,et al.  Internet of Medical Things (IoMT): Acquiring and Transforming Data into HL7 FHIR through 5G Network Slicing , 2019, Emerging Science Journal.

[14]  José Francisco Aldana Montes,et al.  MaSiMe: A Customized Similarity Measure and Its Application for Tag Cloud Refactoring , 2009, OTM Workshops.

[15]  Barbara A. Gylys,et al.  MEDICAL TERMINOLOGY SYSTEMS: A BODY SYSTEMS APPROACH , 2004 .

[16]  Dimosthenis Kyriazis,et al.  A String Similarity Evaluation for Healthcare Ontologies Alignment to HL7 FHIR Resources , 2019 .

[17]  Erhard Rahm,et al.  Schema and ontology matching with COMA++ , 2005, SIGMOD '05.

[18]  J. Kacprzyk,et al.  The Ordered Weighted Averaging Operators: Theory and Applications , 1997 .

[19]  Junfeng Chen,et al.  Using NSGA-III for optimising biomedical ontology alignment , 2019, CAAI Trans. Intell. Technol..

[20]  Thabet Slimani,et al.  Description and Evaluation of Semantic Similarity Measures Approaches , 2013, ArXiv.

[21]  David Sánchez,et al.  Using ontologies for structuring organizational knowledge in Home Care assistance , 2010, Int. J. Medical Informatics.

[22]  Grzegorz Kondrak,et al.  N-Gram Similarity and Distance , 2005, SPIRE.

[23]  Dimosthenis Kyriazis,et al.  Structurally Mapping Healthcare Data to HL7 FHIR through Ontology Alignment , 2019, Journal of Medical Systems.

[24]  Guoqian Jiang,et al.  Modeling and validating HL7 FHIR profiles using semantic web Shape Expressions (ShEx) , 2017, J. Biomed. Informatics.

[25]  Alireza Osareh,et al.  ONTOLOGY ALIGNMENT USING MACHINE LEARNING TECHNIQUES , 2011 .

[26]  Jérôme David,et al.  The Alignment API 4.0 , 2011, Semantic Web.

[27]  Sungyoung Lee,et al.  Achieving interoperability among healthcare standards: building semantic mappings at models level , 2012, ICUIMC.

[28]  Pradeep Kumar Ray,et al.  Towards an ontology for data quality in integrated chronic disease management: A realist review of the literature , 2013, Int. J. Medical Informatics.

[29]  Dimosthenis Kyriazis,et al.  Matching Ontologies to HL7 FHIR Towards Their Syntactic and Semantic Similarity , 2018, ICIMTH.

[30]  Dimosthenis Kyriazis,et al.  A Semantic Similarity Evaluation for Healthcare Ontologies Matching to HL7 FHIR Resources , 2020, MIE.

[31]  Manish M. Potey,et al.  Semantic Search based on Ontology Alignment for Information Retrieval , 2014 .

[32]  A. Sunitha,et al.  Ontology-Driven Knowledge-Based Health-Care System, An Emerging Area – Challenges And Opportunities – Indian Scenario , 2014 .

[33]  Xingsi Xue,et al.  Optimizing ontology alignments through a Memetic Algorithm using both MatchFmeasure and Unanimous Improvement Ratio , 2015, Artif. Intell..

[34]  Jeng-Shyang Pan,et al.  A Compact Co-Evolutionary Algorithm for sensor ontology meta-matching , 2017, Knowledge and Information Systems.

[35]  Somjit Arch-int,et al.  A semantic interoperability approach to health‐care data: Resolving data‐level conflicts , 2016, Expert Syst. J. Knowl. Eng..

[36]  Dimosthenis Kyriazis,et al.  Towards a Secure Semantic Knowledge of Healthcare Data Through Structural Ontological Transformations , 2018, JCKBSE.