EHR2CCAS: A framework for mapping EHR to disease knowledge presenting causal chain of disorders - chronic kidney disease example

OBJECTIVE The goal of this work was to capture diseases in patients by comprehending the fine-grained medical conditions and disease progression manifested by transitions in medical conditions. We realize this by introducing our earlier work on a state-of-the-art knowledge presentation, which defines a disease as a causal chain of abnormal states (CCAS). Here, we propose a framework, EHR2CCAS, for constructing a system to map electronic health record (EHR) data to CCAS. MATERIALS AND METHODS EHR2CCAS is a framework consisting of modules that access heterogeneous EHR to estimate the presence of abnormal states in a CCAS for a patient in a given time window. EHR2CCAS applies expert-driven (rule-based) and data-driven (machine learning) methods to identify abnormal states from structured and unstructured EHR data. It features data-driven approaches for unlocking clinical texts and imputations based on the EHR temporal properties and the causal CCAS structure. This study presents the CCAS of chronic kidney disease as an example. A mapping system between the EHR from the University of Tokyo Hospital and CCAS of chronic kidney disease was constructed and evaluated against expert annotation. RESULTS The system achieved high prediction performance in identifying abnormal states that had strong agreement among annotators. Our handling of narrative varieties in texts and our imputation of the presence of an abnormal state markedly improved the prediction performance. EHR2CCAS presents patient data describing the temporal presence of abnormal states in CCAS, which is useful in individual disease progression management. Further analysis of the differentiation of transition among abnormal states outputted by EHR2CCAS can contribute to detecting disease subtypes. CONCLUSION This work represents the first step toward combining disease knowledge and EHR to extract abnormality related to a disease defined as fine-grained abnormal states and transitions among them. This can aid in disease progression management and deep phenotyping.

[1]  Sunghwan Sohn,et al.  Mayo clinical Text Analysis and Knowledge Extraction System (cTAKES): architecture, component evaluation and applications , 2010, J. Am. Medical Informatics Assoc..

[2]  Qiang Yang,et al.  A Survey on Transfer Learning , 2010, IEEE Transactions on Knowledge and Data Engineering.

[3]  Daniel P. Morin,et al.  Evaluating the benefits of home-based management of atrial fibrillation: current perspectives , 2016, Pragmatic and observational research.

[4]  Alan R. Aronson,et al.  An overview of MetaMap: historical perspective and recent advances , 2010, J. Am. Medical Informatics Assoc..

[5]  Kazuhiko Ohe,et al.  Identity Tracking of a Disease as a Causal Chain , 2012, ICBO.

[6]  Riichiro Mizoguchi,et al.  Causality and the ontology of disease , 2015, Appl. Ontology.

[7]  Tina Hernandez-Boussard,et al.  Advances in Electronic Phenotyping: From Rule-Based Definitions to Machine Learning Models. , 2018, Annual review of biomedical data science.

[8]  Hongfang Liu,et al.  Journal of Biomedical Informatics , 2022 .

[9]  J. Górriz,et al.  The Concept and the Epidemiology of Diabetic Nephropathy Have Changed in Recent Years , 2015, Journal of clinical medicine.

[10]  Yoshua Bengio,et al.  Practical Recommendations for Gradient-Based Training of Deep Architectures , 2012, Neural Networks: Tricks of the Trade.

[11]  Kazuhiko Ohe,et al.  A Semi-Automatic Framework to Identify Abnormal States in EHR Narratives , 2017, MedInfo.

[12]  Kazuhiko Ohe,et al.  Disease Compass– a navigation system for disease knowledge based on ontology and linked data techniques , 2017, Journal of Biomedical Semantics.

[13]  Kazuhiko Ohe,et al.  TEXT2TABLE: Medical Text Summarization System Based on Named Entity Recognition and Modality Identification , 2009, BioNLP@HLT-NAACL.

[14]  Peter N. Robinson,et al.  Deep phenotyping for precision medicine , 2012, Human mutation.

[15]  Jimeng Sun,et al.  Opportunities and challenges in developing deep learning models using electronic health records data: a systematic review , 2018, J. Am. Medical Informatics Assoc..

[16]  Yuan Luo,et al.  Identifying patient smoking status from medical discharge records. , 2008, Journal of the American Medical Informatics Association : JAMIA.

[17]  Paul A. Harris,et al.  PheKB: a catalog and workflow for creating electronic phenotype algorithms for transportability , 2016, J. Am. Medical Informatics Assoc..

[18]  Kazuhiko Ohe,et al.  An ontological modeling approach for abnormal states and its application in the medical domain , 2014, Journal of Biomedical Semantics.

[19]  Lina Maria Rojas-Barahona,et al.  Deep learning for sentiment analysis , 2016, Lang. Linguistics Compass.

[20]  George Hripcsak,et al.  Deep phenotyping: Embracing complexity and temporality—Towards scalability, portability, and interoperability , 2020, Journal of Biomedical Informatics.

[21]  Klaus-Peter Adlassnig,et al.  Medical Fuzzy Control Systems with Fuzzy Arden Syntax , 2017, EUSFLAT/IWIFSGN.

[22]  Kazuhiko Ohe,et al.  River Flow Model of Diseases , 2011, ICBO.

[23]  W. Kibbe,et al.  Annotating the human genome with Disease Ontology , 2009, BMC Genomics.

[24]  George Hripcsak,et al.  Natural language processing in an operational clinical information system , 1995, Natural Language Engineering.

[25]  I. Masakane 2016 Annual Dialysis Data Report, JSDT Renal Data Registry , 2018 .

[26]  Werner Ceusters,et al.  Toward an Ontological Treatment of Disease and Diagnosis , 2009, Summit on translational bioinformatics.

[27]  Jürgen Schmidhuber,et al.  Long Short-Term Memory , 1997, Neural Computation.

[28]  Ma Xiaojun,et al.  Syntactic and semantic parser for clinical text based on medical and linguistic knowledge , 2016 .

[29]  Merlin C. Thomas,et al.  Diabetic kidney disease , 2015, Nature Reviews Disease Primers.