Deep Learning with Heterogeneous Graph Embeddings for Mortality Prediction from Electronic Health Records

Computational prediction of in-hospital mortality in the setting of an intensive care unit can help clinical practitioners to guide care and make early decisions for interventions. As clinical data are complex and varied in their structure and components, continued innovation of modeling strategies is required to identify architectures that can best model outcomes. In this work, we train a Heterogeneous Graph Model (HGM) on Electronic Health Record data and use the resulting embedding vector as additional information added to a Convolutional Neural Network (CNN) model for predicting in-hospital mortality. We show that the additional information provided by including time as a vector in the embedding captures the relationships between medical concepts, lab tests, and diagnoses, which enhances predictive performance. We find that adding HGM to a CNN model increases the mortality prediction accuracy up to 4%. This framework serves as a foundation for future experiments involving different EHR data types on important healthcare prediction tasks.

[1]  Roger G. Mark,et al.  Real-time Mortality Prediction in the Intensive Care Unit , 2018, AMIA.

[2]  Kipp W. Johnson,et al.  Phe2vec: Automated disease phenotyping based on unsupervised embeddings from electronic health records , 2020, medRxiv.

[3]  Parisa Rashidi,et al.  Deep EHR: A Survey of Recent Advances in Deep Learning Techniques for Electronic Health Record (EHR) Analysis , 2017, IEEE Journal of Biomedical and Health Informatics.

[4]  Spencer S. Jones,et al.  Development and Evaluation of a Machine Learning Model for the Early Identification of Patients at Risk for Sepsis , 2019, Annals of emergency medicine.

[5]  Cesare Furlanello,et al.  Deep representation learning of electronic health records to unlock patient stratification at scale , 2020, npj Digital Medicine.

[6]  S. Brunak,et al.  Mining electronic health records: towards better research applications and clinical care , 2012, Nature Reviews Genetics.

[7]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[8]  Jeffrey Dean,et al.  Scalable and accurate deep learning with electronic health records , 2018, npj Digital Medicine.

[9]  Ping Zhang,et al.  Risk Prediction with Electronic Health Records: A Deep Learning Approach , 2016, SDM.

[10]  Anupam Shukla,et al.  Mortality Prediction of ICU patients using Machine Leaning: A survey , 2017, ICCDA '17.

[11]  Li Li,et al.  Automated disease cohort selection using word embeddings from Electronic Health Records , 2018, PSB.

[12]  Saehoon Kim,et al.  A deep learning model for real-time mortality prediction in critically ill children , 2019, Critical Care.

[13]  Jiaqi Gong,et al.  HCNN: Heterogeneous Convolutional Neural Networks for Comorbid Risk Prediction with Electronic Health Records , 2017, 2017 IEEE/ACM International Conference on Connected Health: Applications, Systems and Engineering Technologies (CHASE).

[14]  Jeffrey Dean,et al.  Distributed Representations of Words and Phrases and their Compositionality , 2013, NIPS.

[15]  Jason Weston,et al.  Translating Embeddings for Modeling Multi-relational Data , 2013, NIPS.

[16]  Thomas Hofmann,et al.  Machine learning for real-time prediction of complications in critical care: a retrospective study. , 2018, The Lancet. Respiratory medicine.

[17]  Edward Choi,et al.  Graph Convolutional Transformer: Learning the Graphical Structure of Electronic Health Records , 2019, ArXiv.

[18]  Nitesh V. Chawla,et al.  metapath2vec: Scalable Representation Learning for Heterogeneous Networks , 2017, KDD.

[19]  Li Li,et al.  Deep Patient: An Unsupervised Representation to Predict the Future of Patients from the Electronic Health Records , 2016, Scientific Reports.

[20]  Jimeng Sun,et al.  MiME: Multilevel Medical Embedding of Electronic Health Records for Predictive Healthcare , 2018, NeurIPS.

[21]  Peter Szolovits,et al.  MIMIC-III, a freely accessible critical care database , 2016, Scientific Data.

[22]  Kipp W. Johnson,et al.  The next generation of precision medicine: observational studies, electronic health records, biobanks and continuous monitoring. , 2018, Human molecular genetics.