Incorporating temporal EHR data in predictive models for risk stratification of renal function deterioration

Predictive models built using temporal data in electronic health records (EHRs) can potentially play a major role in improving management of chronic diseases. However, these data present a multitude of technical challenges, including irregular sampling of data and varying length of available patient history. In this paper, we describe and evaluate three different approaches that use machine learning to build predictive models using temporal EHR data of a patient. The first approach is a commonly used non-temporal approach that aggregates values of the predictors in the patient's medical history. The other two approaches exploit the temporal dynamics of the data. The two temporal approaches vary in how they model temporal information and handle missing data. Using data from the EHR of Mount Sinai Medical Center, we learned and evaluated the models in the context of predicting loss of estimated glomerular filtration rate (eGFR), the most common assessment of kidney function. Our results show that incorporating temporal information in patient's medical history can lead to better prediction of loss of kidney function. They also demonstrate that exactly how this information is incorporated is important. In particular, our results demonstrate that the relative importance of different predictors varies over time, and that using multi-task learning to account for this is an appropriate way to robustly capture the temporal dynamics in EHR data. Using a case study, we also demonstrate how the multi-task learning based model can yield predictive models with better performance for identifying patients at high risk of short-term loss of kidney function.

[1]  R. Garrick,et al.  A Predictive Model for Progression of Chronic Kidney Disease to Kidney Failure , 2012 .

[2]  R. Schrier,et al.  Renal failure in cirrhosis. , 2009, The New England journal of medicine.

[3]  Josef Coresh,et al.  Conceptual model of CKD: applications and implications. , 2009, American journal of kidney diseases : the official journal of the National Kidney Foundation.

[4]  Wei-Hung Chen,et al.  Adults With Late Stage 3 Chronic Kidney Disease Are at High Risk for Prevalent Silent Brain Infarction: A Population-Based Study , 2011, Stroke.

[5]  Cheng Wang,et al.  A New Equation to Estimate Glomerular Filtration Rate in Chinese Elderly Population , 2013, PloS one.

[6]  D. Small,et al.  Methods for Estimating Kidney Disease Stage Transition Probabilities Using Electronic Medical Records , 2013, EGEMS.

[7]  S. Birman,et al.  Supplementary Table S2 , 2012 .

[8]  A. Kengne,et al.  Risk Models to Predict Chronic Kidney Disease and Its Progression: A Systematic Review , 2012, PLoS medicine.

[9]  Akshay S. Desai,et al.  Association between cardiac biomarkers and the development of ESRD in patients with type 2 diabetes mellitus, anemia, and CKD. , 2011, American journal of kidney diseases : the official journal of the National Kidney Foundation.

[10]  Kim F. Nimon,et al.  Interpreting Multiple Linear Regression: A Guidebook of Variable Importance , 2012 .

[11]  Tudor Toma,et al.  Discovery and inclusion of SOFA score episodes in mortality prediction , 2007, J. Biomed. Informatics.

[12]  Melania Pintilie,et al.  CKD stage at nephrology referral and factors influencing the risks of ESRD and death. , 2014, American journal of kidney diseases : the official journal of the National Kidney Foundation.

[13]  Milos Hauskrecht,et al.  Modeling Clinical Time Series Using Gaussian Process Sequences , 2013, SDM.

[14]  C. Schmid,et al.  A new equation to estimate glomerular filtration rate. , 2009, Annals of internal medicine.

[15]  Josef Coresh,et al.  Chronic kidney disease , 2012, The Lancet.

[16]  B. Stengel,et al.  Chronic kidney disease and cancer: a troubling connection. , 2010, Journal of nephrology.

[17]  Tudor Toma,et al.  Learning predictive models that use pattern discovery - A bootstrap evaluative approach applied in organ functioning sequences , 2010, J. Biomed. Informatics.

[18]  G H Neild,et al.  Community nephrology: audit of screening for renal insufficiency in a high risk population. , 1999, Nephrology, dialysis, transplantation : official publication of the European Dialysis and Transplant Association - European Renal Association.

[19]  Stian Lydersen,et al.  Combining GFR and albuminuria to classify CKD improves prediction of ESRD. , 2009, Journal of the American Society of Nephrology : JASN.