Deep Representation Learning of Patient Data from Electronic Health Records (EHR): A Systematic Review

OBJECTIVES Patient representation learning refers to learning a dense mathematical representation of a patient that encodes meaningful information from Electronic Health Records (EHRs). This is generally performed using advanced deep learning methods. This study presents a systematic review of this field and provides both qualitative and quantitative analyses from a methodological perspective. METHODS We identified studies developing patient representations from EHRs with deep learning methods from MEDLINE, EMBASE, Scopus, the Association for Computing Machinery (ACM) Digital Library, and the Institute of Electrical and Electronics Engineers (IEEE) Xplore Digital Library. After screening 363 articles, 49 papers were included for a comprehensive data collection. RESULTS Publications developing patient representations almost doubled each year from 2015 until 2019. We noticed a typical workflow starting with feeding raw data, applying deep learning models, and ending with clinical outcome predictions as evaluations of the learned representations. Specifically, learning representations from structured EHR data was dominant (37 out of 49 studies). Recurrent Neural Networks were widely applied as the deep learning architecture (Long short-term memory: 13 studies, Gated recurrent unit: 11 studies). Learning was mainly performed in a supervised manner (30 studies) optimized with cross-entropy loss. Disease prediction was the most common application and evaluation (31 studies). Benchmark datasets were mostly unavailable (28 studies) due to privacy concerns of EHR data, and code availability was assured in 20 studies. DISCUSSION & CONCLUSION The existing predictive models mainly focus on the prediction of single diseases, rather than considering the complex mechanisms of patients from a holistic review. We show the importance and feasibility of learning comprehensive representations of patient EHR data through a systematic review. Advances in patient representation learning techniques will be essential for powering patient-level EHR analyses. Future work will still be devoted to leveraging the richness and potential of available EHR data. Reproducibility and transparency of reported results will hopefully improve. Knowledge distillation and advanced learning techniques will be exploited to assist the capability of learning patient representation further.

[1]  Christian Guttmann,et al.  Deep learning architectures for vector representations of patients and exploring predictors of 30-day hospital readmissions in patients with multiple chronic conditions , 2018, AIH@IJCAI.

[2]  Geoffrey E. Hinton,et al.  Reducing the Dimensionality of Data with Neural Networks , 2006, Science.

[3]  Kenneth Jung,et al.  Effective Representations of Clinical Notes , 2017 .

[4]  Majid Sarrafzadeh,et al.  TAPER: Time-Aware Patient EHR Representation , 2019, IEEE Journal of Biomedical and Health Informatics.

[5]  Li Fei-Fei,et al.  ImageNet: A large-scale hierarchical image database , 2009, CVPR.

[6]  Fenglong Ma,et al.  KAME: Knowledge-based Attention Model for Diagnosis Prediction in Healthcare , 2018, CIKM.

[7]  Jeffrey Dean,et al.  Distributed Representations of Words and Phrases and their Compositionality , 2013, NIPS.

[8]  Svetha Venkatesh,et al.  Learning vector representation of medical objects via EMR-driven nonnegative restricted Boltzmann machines (eNRBM) , 2015, J. Biomed. Informatics.

[9]  Fei Wang,et al.  Patient Subtyping via Time-Aware LSTM Networks , 2017, KDD.

[10]  Yoshua Bengio,et al.  Extracting and composing robust features with denoising autoencoders , 2008, ICML '08.

[11]  Mehul Motani,et al.  Optimizing Autoencoders for Learning Deep Representations From Health Data , 2019, IEEE Journal of Biomedical and Health Informatics.

[12]  Markus Ringnér,et al.  What is principal component analysis? , 2008, Nature Biotechnology.

[13]  Jimeng Sun,et al.  MiME: Multilevel Medical Embedding of Electronic Health Records for Predictive Healthcare , 2018, NeurIPS.

[14]  Tamara G. Kolda,et al.  On Tensors, Sparsity, and Nonnegative Factorizations , 2011, SIAM J. Matrix Anal. Appl..

[15]  Zachariah Zhang,et al.  Deep EHR: Chronic Disease Prediction Using Medical Notes , 2018, MLHC.

[16]  Pietro Liò,et al.  Graph Attention Networks , 2017, ICLR.

[17]  Hui Xiong,et al.  Temporal Phenotyping from Longitudinal Electronic Health Records: A Graph Based Framework , 2015, KDD.

[18]  Li Li,et al.  Deep Patient: An Unsupervised Representation to Predict the Future of Patients from the Electronic Health Records , 2016, Scientific Reports.

[19]  Bernhard Pfahringer,et al.  Deep Holistic Representation Learning from EHR , 2018, 2018 12th International Symposium on Medical Information and Communication Technology (ISMICT).

[20]  Peter B. Walker,et al.  Federated Learning for Healthcare Informatics , 2019, Journal of Healthcare Informatics Research.

[21]  Pascal Vincent,et al.  Contractive Auto-Encoders: Explicit Invariance During Feature Extraction , 2011, ICML.

[22]  Majid Sarrafzadeh,et al.  HeteroMed: Heterogeneous Information Network for Medical Diagnosis , 2018, CIKM.

[23]  Philippe Lambin,et al.  Systematic Review of Privacy-Preserving Distributed Machine Learning From Federated Databases in Health Care , 2020, JCO clinical cancer informatics.

[24]  Jianxin Li,et al.  A Time-Sensitive Hybrid Learning Model for Patient Subgrouping , 2018, 2018 International Joint Conference on Neural Networks (IJCNN).

[25]  Fei Wang,et al.  Predictive Modeling of the Hospital Readmission Risk from Patients’ Claims Data Using Machine Learning: A Case Study on COPD , 2019, Scientific Reports.

[26]  Jaideep Vaidya,et al.  Privacy-preserving SVM using nonlinear kernels on horizontally partitioned data , 2006, SAC.

[27]  Kirk Roberts,et al.  Deep Patient Representation of Clinical Notes via Multi-Task Learning for Mortality Prediction. , 2019, AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science.

[28]  Jing Xu,et al.  Bi-Dimensional Representation of Patients for Diagnosis Prediction , 2019, 2019 IEEE 43rd Annual Computer Software and Applications Conference (COMPSAC).

[29]  Alistair E. W. Johnson,et al.  The eICU Collaborative Research Database, a freely available multi-center database for critical care research , 2018, Scientific Data.

[30]  M. de Rijke,et al.  Order-free Medicine Combination Prediction with Graph Convolutional Reinforcement Learning , 2019, CIKM.

[31]  Zoran Obradovic,et al.  Modeling Healthcare Quality via Compact Representations of Electronic Health Records , 2017, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[32]  Yoon Kim,et al.  Convolutional Neural Networks for Sentence Classification , 2014, EMNLP.

[33]  Yan Liu,et al.  Deep Computational Phenotyping , 2015, KDD.

[34]  Jürgen Schmidhuber,et al.  Long Short-Term Memory , 1997, Neural Computation.

[35]  Slobodan Vucetic,et al.  EHR phenotyping via jointly embedding medical concepts and words into a unified vector space , 2018, BMC Medical Informatics and Decision Making.

[36]  Yujia Li,et al.  Learning the Graphical Structure of Electronic Health Records with Graph Convolutional Transformer , 2020, AAAI.

[37]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[38]  Yu Cheng,et al.  Exploiting Convolutional Neural Network for Risk Prediction with Medical Feature Embedding , 2017, ArXiv.

[39]  Satrajit S. Ghosh,et al.  Distributed Weight Consolidation: A Brain Segmentation Case Study , 2018, NeurIPS.

[40]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[41]  Fei Wang,et al.  A Framework for Mining Signatures from Event Sequences and Its Applications in Healthcare Data , 2013, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[42]  Walter F. Stewart,et al.  Doctor AI: Predicting Clinical Events via Recurrent Neural Networks , 2015, MLHC.

[43]  Jimeng Sun,et al.  Marble: high-throughput phenotyping from electronic health records via sparse nonnegative tensor factorization , 2014, KDD.

[44]  D. Moher,et al.  Preferred reporting items for systematic reviews and meta-analyses: the PRISMA statement. , 2010, International journal of surgery.

[45]  Timothy A. Miller,et al.  Learning Patient Representations from Text , 2018, *SEM@NAACL-HLT.

[46]  Kirk Roberts,et al.  Patient Representation Transfer Learning from Clinical Notes based on Hierarchical Attention Network. , 2020, AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science.

[47]  Junzhou Huang,et al.  Imaging-genetic data mapping for clinical outcome prediction via supervised conditional Gaussian graphical model , 2016, 2016 IEEE International Conference on Bioinformatics and Biomedicine (BIBM).

[48]  Fei Wang,et al.  From micro to macro: data driven phenotyping by densification of longitudinal electronic medical records , 2014, KDD.

[49]  David Sontag,et al.  Learning Low-Dimensional Representations of Medical Concepts , 2016, CRI.

[50]  Quoc V. Le,et al.  Sequence to Sequence Learning with Neural Networks , 2014, NIPS.

[51]  Jiaqi Gong,et al.  HCNN: Heterogeneous Convolutional Neural Networks for Comorbid Risk Prediction with Electronic Health Records , 2017, 2017 IEEE/ACM International Conference on Connected Health: Applications, Systems and Engineering Technologies (CHASE).

[52]  David C. Kale,et al.  Do no harm: a roadmap for responsible machine learning for health care , 2019, Nature Medicine.

[53]  Harini Suresh,et al.  Learning Tasks for Multitask Learning: Heterogenous Patient Populations in the ICU , 2018, KDD.

[54]  Oscar Perez-Concha,et al.  A Deep Representation of Longitudinal EMR Data Used for Predicting Readmission to the ICU and Describing Patients-at-Risk , 2019, ArXiv.

[55]  Amir H. Payberah,et al.  Deep learning for electronic health records: A comparative review of multiple deep neural architectures , 2020, J. Biomed. Informatics.

[56]  Carl Doersch,et al.  Tutorial on Variational Autoencoders , 2016, ArXiv.

[57]  Jian Tang,et al.  Learning the Joint Representation of Heterogeneous Temporal Events for Clinical Endpoint Prediction , 2018, AAAI.

[58]  Nigam H. Shah,et al.  Language Models Are An Effective Patient Representation Learning Technique For Electronic Health Record Data , 2020, ArXiv.

[59]  Fei Wang,et al.  Privacy-Preserving Patient Similarity Learning in a Federated Environment: Development and Analysis , 2018, JMIR medical informatics.

[60]  Dongha Lee,et al.  Harmonized representation learning on dynamic EHR graphs , 2020, J. Biomed. Informatics.

[61]  Ronald M. Summers,et al.  The future of digital health with federated learning , 2020, npj Digital Medicine.

[62]  Walter Daelemans,et al.  Patient representation learning and interpretable evaluation using clinical notes , 2018, J. Biomed. Informatics.

[63]  Ziqian Xie,et al.  Med-BERT: pretrained contextualized embeddings on large-scale structured electronic health records for disease prediction , 2020, npj Digital Medicine.

[64]  Yoshua Bengio,et al.  Neural Machine Translation by Jointly Learning to Align and Translate , 2014, ICLR.

[65]  Le Zhang,et al.  An Effective Patient Representation Learning for Time-series Prediction Tasks Based on EHRs , 2018, 2018 IEEE International Conference on Bioinformatics and Biomedicine (BIBM).

[66]  Andreas Spanias,et al.  Attend and Diagnose: Clinical Time Series Analysis using Attention Models , 2017, AAAI.

[67]  Bruce R. Rosen,et al.  Distributed deep learning networks among institutions for medical imaging , 2018, J. Am. Medical Informatics Assoc..

[68]  Xiaolei Xie,et al.  Prediction task guided representation learning of medical codes in EHR , 2018, J. Biomed. Informatics.

[69]  Ping Zhang,et al.  Risk Prediction with Electronic Health Records: A Deep Learning Approach , 2016, SDM.

[70]  James H. Harrison,et al.  Patient2Vec: A Personalized Interpretable Deep Representation of the Longitudinal Electronic Health Record , 2018, IEEE Access.

[71]  Fei Wang,et al.  Readmission prediction via deep contextual embedding of clinical concepts , 2018, PloS one.

[72]  C. Jack,et al.  Alzheimer's Disease Neuroimaging Initiative , 2008 .

[73]  Jeffrey Dean,et al.  Efficient Estimation of Word Representations in Vector Space , 2013, ICLR.

[74]  Benjamin C. M. Fung,et al.  Learning Phenotypes and Dynamic Patient Representations via RNN Regularized Collective Non-Negative Tensor Factorization , 2019, AAAI.

[75]  Geoffrey E. Hinton,et al.  Visualizing Data using t-SNE , 2008 .

[76]  Weiqing Wang,et al.  MedGraph: Structural and Temporal Representation Learning of Electronic Medical Records , 2019, ECAI.

[77]  Kazem Rahimi,et al.  BEHRT: Transformer for Electronic Health Records , 2019, Scientific Reports.

[78]  Ratna Babu Chinnam,et al.  Representation Learning with Autoencoders for Electronic Health Records: A Comparative Study , 2018, ArXiv.

[79]  Fei Wang,et al.  Health-ATM: A Deep Architecture for Multifaceted Patient Health Record Representation and Risk Prediction , 2018, SDM.

[80]  Huan He,et al.  Distributed Tensor Decomposition for Large Scale Health Analytics , 2019, WWW.

[81]  Leland McInnes,et al.  UMAP: Uniform Manifold Approximation and Projection for Dimension Reduction , 2018, ArXiv.

[82]  Björn Nilsson,et al.  A Framework for Regularized Non-Negative Matrix Factorization, with Application to the Analysis of Gene Expression Data , 2012, PloS one.

[83]  Le Song,et al.  GRAM: Graph-based Attention Model for Healthcare Representation Learning , 2016, KDD.

[84]  Jakub Konecný,et al.  Federated Optimization: Distributed Optimization Beyond the Datacenter , 2015, ArXiv.

[85]  Walter Daelemans,et al.  Unsupervised patient representations from clinical notes with interpretable classification decisions , 2017, ArXiv.

[86]  Xiaoqian Jiang,et al.  Privacy Preserving Federated Big Data Analysis , 2018 .

[87]  Xiaoqian Jiang,et al.  Distributed learning from multiple EHR databases: Contextual embedding models for medical events , 2019, J. Biomed. Informatics.

[88]  Ming-Wei Chang,et al.  BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding , 2019, NAACL.

[89]  Roger G. Mark,et al.  Analyzing the eICU Collaborative Research Database , 2017, BCB.

[90]  Fei Wang,et al.  Should Health Care Demand Interpretable Artificial Intelligence or Accept “Black Box” Medicine? , 2019, Annals of Internal Medicine.

[91]  Fei Wang,et al.  Towards heterogeneous temporal clinical event pattern discovery: a convolutional approach , 2012, KDD.

[92]  Mathias Niepert,et al.  Learning Convolutional Neural Networks for Graphs , 2016, ICML.

[93]  K. K. Mueen Ahmed,et al.  Zotero: A bibliographic assistant to researcher , 2011, Journal of pharmacology & pharmacotherapeutics.

[94]  Olivier Gevaert,et al.  Deep learning with multimodal representation for pancancer prognosis prediction , 2019, Bioinform..

[95]  Tamara G. Kolda,et al.  Tensor Decompositions and Applications , 2009, SIAM Rev..

[96]  Sergey Levine,et al.  Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks , 2017, ICML.

[97]  Nigam H. Shah,et al.  The Effectiveness of Multitask Learning for Phenotyping with Electronic Health Records Data , 2019, PSB.

[98]  Yoshua Bengio,et al.  On the Properties of Neural Machine Translation: Encoder–Decoder Approaches , 2014, SSST@EMNLP.

[99]  Andrew M. Dai,et al.  Improved Hierarchical Patient Classification with Language Model Pretraining over Clinical Notes , 2019 .

[100]  Ankit Singh Rawat,et al.  Are Transformers universal approximators of sequence-to-sequence functions? , 2020, ICLR.

[101]  Weiqing Wang,et al.  $\mathtt{MedGraph:}$ Structural and Temporal Representation Learning of Electronic Medical Records , 2019 .

[102]  Jimeng Sun,et al.  RAIM: Recurrent Attentive and Intensive Model of Multimodal Patient Monitoring Data , 2018, KDD.

[103]  Fei Wang,et al.  TaGiTeD: Predictive Task Guided Tensor Decomposition for Representation Learning from Electronic Health Records , 2017, AAAI.

[104]  Fei Wang,et al.  Data-Driven Subtyping of Parkinson’s Disease Using Longitudinal Clinical Records: A Cohort Study , 2019, Scientific Reports.

[105]  Daniel Svozil,et al.  Introduction to multi-layer feed-forward neural networks , 1997 .

[106]  Shanshan Zhang,et al.  Interpretable Representation Learning for Healthcare via Capturing Disease Progression through Time , 2018, KDD.

[107]  Egoitz Laparra,et al.  Rethinking domain adaptation for machine learning over clinical language , 2020, JAMIA open.

[108]  Nick C Fox,et al.  The Alzheimer's disease neuroimaging initiative (ADNI): MRI methods , 2008, Journal of magnetic resonance imaging : JMRI.

[109]  Lawrence D. Jackel,et al.  Handwritten Digit Recognition with a Back-Propagation Network , 1989, NIPS.

[110]  Jeffrey Dean,et al.  Scalable and accurate deep learning with electronic health records , 2018, npj Digital Medicine.

[111]  Geoffrey E. Hinton,et al.  Deep Learning , 2015, Nature.

[112]  Mehul Motani,et al.  Learning Deep Representations from Heterogeneous Patient Data for Predictive Diagnosis , 2017, BCB.

[113]  Jure Leskovec,et al.  node2vec: Scalable Feature Learning for Networks , 2016, KDD.

[114]  Pascal Vincent,et al.  Representation Learning: A Review and New Perspectives , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[115]  Zhiting Hu,et al.  Learning Hierarchical Representations of Electronic Health Records for Clinical Outcome Prediction , 2019, AMIA.

[116]  Fenglong Ma,et al.  Deep Patient Similarity Learning for Personalized Healthcare , 2018, IEEE Transactions on NanoBioscience.

[117]  Fenglong Ma,et al.  Dipole: Diagnosis Prediction in Healthcare via Attention-based Bidirectional Recurrent Neural Networks , 2017, KDD.

[118]  Majid Sarrafzadeh,et al.  Unsupervised Representation for EHR Signals and Codes as Patient Status Vector , 2019, ArXiv.

[119]  Jiayu Zhou,et al.  MetaPred: Meta-Learning for Clinical Risk Prediction with Limited Patient Electronic Health Records , 2019, KDD.

[120]  Xiaoqian Jiang,et al.  SAFETY: Secure gwAs in Federated Environment through a hYbrid Solution , 2019, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[121]  Pascal Vincent,et al.  Stacked Denoising Autoencoders: Learning Useful Representations in a Deep Network with a Local Denoising Criterion , 2010, J. Mach. Learn. Res..

[122]  Eran Segal,et al.  Axes of a revolution: challenges and promises of big data in healthcare , 2020, Nature Medicine.

[123]  Timothy Miller,et al.  Toward a clinical text encoder: pretraining for clinical natural language processing with applications to substance misuse , 2019, J. Am. Medical Informatics Assoc..

[124]  Özlem Uzuner,et al.  Viewpoint Paper: Recognizing Obesity and Comorbidities in Sparse Data , 2009, J. Am. Medical Informatics Assoc..

[125]  Dmitriy Dligach,et al.  Two-stage Federated Phenotyping and Patient Representation Learning , 2019, BioNLP@ACL.

[126]  Jimeng Sun,et al.  RETAIN: An Interpretable Predictive Model for Healthcare using Reverse Time Attention Mechanism , 2016, NIPS.

[127]  Hossam M. Hammady,et al.  Rayyan—a web and mobile app for systematic reviews , 2016, Systematic Reviews.

[128]  Peter Szolovits,et al.  Clinical Intervention Prediction and Understanding with Deep Neural Networks , 2017, MLHC.