Personalized survival predictions via Trees of Predictors: An application to cardiac transplantation

Background Risk prediction is crucial in many areas of medical practice, such as cardiac transplantation, but existing clinical risk-scoring methods have suboptimal performance. We develop a novel risk prediction algorithm and test its performance on the database of all patients who were registered for cardiac transplantation in the United States during 1985-2015. Methods and findings We develop a new, interpretable, methodology (ToPs: Trees of Predictors) built on the principle that specific predictive (survival) models should be used for specific clusters within the patient population. ToPs discovers these specific clusters and the specific predictive model that performs best for each cluster. In comparison with existing clinical risk scoring methods and state-of-the-art machine learning methods, our method provides significant improvements in survival predictions, both post- and pre-cardiac transplantation. For instance: in terms of 3-month survival post-transplantation, our method achieves AUC of 0.660; the best clinical risk scoring method (RSS) achieves 0.587. In terms of 3-year survival/mortality predictions post-transplantation (in comparison to RSS), holding specificity at 80.0%, our algorithm correctly predicts survival for 2,442 (14.0%) more patients (of 17,441 who actually survived); holding sensitivity at 80.0%, our algorithm correctly predicts mortality for 694 (13.0%) more patients (of 5,339 who did not survive). ToPs achieves similar improvements for other time horizons and for predictions pre-transplantation. ToPs discovers the most relevant features (covariates), uses available features to best advantage, and can adapt to changes in clinical practice. Conclusions We show that, in comparison with existing clinical risk-scoring methods and other machine learning methods, ToPs significantly improves survival predictions both post- and pre-cardiac transplantation. ToPs provides a more accurate, personalized approach to survival prediction that can benefit patients, clinicians, and policymakers in making clinical decisions and setting clinical policy. Because survival prediction is widely used in clinical decision-making across diseases and clinical specialties, the implications of our methods are far-reaching.

[1]  J. Klein,et al.  Survival Analysis: Techniques for Censored and Truncated Data , 1997 .

[2]  Rajiv D. Banker,et al.  The Use of Categorical Variables in Data Envelopment Analysis , 1986 .

[3]  J. Fang,et al.  Moving beyond "bridges". , 2013, JACC. Heart failure.

[4]  William A. Baumgartner,et al.  Development of a quantitative donor risk index to predict short-term mortality in orthotopic heart transplantation. , 2012, The Journal of heart and lung transplantation : the official publication of the International Society for Heart Transplantation.

[5]  Rupert G. Miller,et al.  Survival Analysis , 2022, The SAGE Encyclopedia of Research Design.

[6]  R. Thiagarajan,et al.  Waiting List Mortality Among Children Listed for Heart Transplantation in the United States , 2009, Circulation.

[7]  Guangye He Regression with Dummy Variables (by Melissa Hardy) , 2012 .

[8]  Guy A. MacGowan,et al.  Effect of receiving a heart transplant , 2001, BMJ : British Medical Journal.

[9]  Donghua Zhou,et al.  Remaining useful life estimation - A review on the statistical data driven approaches , 2011, Eur. J. Oper. Res..

[10]  D. Mancini,et al.  Selection of Cardiac Transplantation Candidates in 2010 , 2010, Circulation.

[11]  A. Agresti,et al.  Categorical Data Analysis , 1991, International Encyclopedia of Statistical Science.

[12]  S. Russell,et al.  Creation of a quantitative recipient risk index for mortality prediction after cardiac transplantation (IMPACT). , 2011, The Annals of thoracic surgery.

[13]  M C Oz,et al.  Long-term use of a left ventricular assist device for end-stage heart failure. , 2001, The New England journal of medicine.

[14]  Dionne A. Graham,et al.  Decline in Heart Transplant Wait List Mortality in the United States Following Broader Regional Sharing of Donor Hearts , 2012, Circulation. Heart failure.

[15]  Gaëtan MacGrogan,et al.  Variables with time-varying effects and the Cox model: Some statistical concepts illustrated with a prognostic factor study in breast cancer , 2010, BMC medical research methodology.

[16]  Nader Moazami,et al.  Extended mechanical circulatory support with a continuous-flow rotary left ventricular assist device. , 2009, Journal of the American College of Cardiology.

[17]  J. Schwartz,et al.  Development and prospective validation of a clinical index to predict survival in ambulatory patients referred for cardiac transplant evaluation. , 1996, Circulation.

[18]  D. Mozaffarian,et al.  The Seattle Heart Failure Model: Prediction of Survival in Heart Failure , 2006, Circulation.

[19]  Ashish S Shah,et al.  Institutional volume and the effect of recipient risk on short-term mortality after orthotopic heart transplant. , 2012, The Journal of thoracic and cardiovascular surgery.

[20]  Yoshifumi Naka,et al.  Who is the high-risk recipient? Predicting mortality after heart transplant using pretransplant donor and recipient risk factors. , 2011, The Annals of thoracic surgery.

[21]  Karl Swedberg,et al.  Predicting survival in heart failure: a risk score based on 39 372 patients from 30 studies. , 2013, European heart journal.

[22]  H. T. Reynolds,et al.  Analysis of nominal data , 1977 .

[23]  M. Gonen,et al.  Concordance probability and discriminatory power in proportional hazards regression , 2005 .

[24]  Stef van Buuren,et al.  MICE: Multivariate Imputation by Chained Equations in R , 2011 .

[25]  A. Agresti An introduction to categorical data analysis , 1997 .

[26]  Huilin Chen,et al.  Statistical Inference Methods for Two Crossing Survival Curves: A Comparison of Methods , 2015, PloS one.

[27]  Yoshifumi Naka,et al.  Survival After Heart Transplantation Is Not Diminished Among Recipients With Uncomplicated Diabetes Mellitus: An Analysis of the United Network of Organ Sharing Database , 2006, Circulation.