Prediction of 30-Day All-Cause Readmissions in Patients Hospitalized for Heart Failure: Comparison of Machine Learning and Other Statistical Approaches

Importance Several attempts have been made at developing models to predict 30-day readmissions in patients with heart failure, but none have sufficient discriminatory capacity for clinical use. Machine-learning (ML) algorithms represent a novel approach and may have potential advantages over traditional statistical modeling. Objective To develop models using a ML approach to predict all-cause readmissions 30 days after discharge from a heart failure hospitalization and to compare ML model performance with models developed using “conventional” statistically based methods. Design, Setting, and Participants Models were developed using ML algorithms, specifically, a tree-augmented naive Bayesian network, a random forest algorithm, and a gradient-boosted model and compared with traditional statistical methods using 2 independently derived logistic regression models (a de novo model and an a priori model developed using electronic health records) and a least absolute shrinkage and selection operator method. The study sample was randomly divided into training (70%) and validation (30%) sets to develop and test model performance. This was a registry-based study, and the study sample was obtained by linking patients from the Get With the Guidelines Heart Failure registry with Medicare data. After applying appropriate inclusion and exclusion criteria, 56 477 patients were included in our analysis. The study was conducted between January 4, 2005, and December 1, 2010, and analysis of the data was conducted between November 25, 2014, and June 30, 2016. Main Outcomes and Measures C statistics were used for comparison of discriminatory capacity across models in the validation sample. Results The overall 30-day rehospitalization rate was 21.2% (11 959 of 56 477 patients). For the tree-augmented naive Bayesian network, random forest, gradient-boosted, logistic regression, and least absolute shrinkage and selection operator models, C statistics for the validation sets were similar: 0.618, 0.607, 0.614, 0.624, and 0.618, respectively. Applying the previously validated electronic health records model to our study sample yielded a C statistic of 0.589 for the validation set. Conclusions and Relevance Use of a number of ML algorithms did not improve prediction of 30-day heart failure readmissions compared with more traditional prediction models. Although there will likely be further applications of ML approaches in prognostic modeling, our study fits within the literature of limited predictive ability for heart failure readmissions.

[1]  Y Wang,et al.  Predictors of readmission among elderly survivors of admission with heart failure. , 2000, American heart journal.

[2]  Li Liang,et al.  Validated, electronic health record deployable prediction models for assessing patient risk of 30-day rehospitalization and mortality in older heart failure patients. , 2013, JACC. Heart failure.

[3]  Leora I. Horwitz,et al.  Diagnoses and timing of 30-day readmissions after hospitalization for heart failure, acute myocardial infarction, or pneumonia. , 2013, JAMA.

[4]  E. Philbin,et al.  Prediction of hospital readmission for heart failure: development of a simple risk score based on administrative data. , 1999, Journal of the American College of Cardiology.

[5]  I. Piña,et al.  National Survey of Hospital Strategies to Reduce Heart Failure Readmissions: Findings From the Get With the Guidelines-Heart Failure Registry , 2012, Circulation. Heart failure.

[6]  Sharon-Lise T. Normand,et al.  An Administrative Claims Measure Suitable for Profiling Hospital Performance on the Basis of 30-Day All-Cause Readmission Rates Among Patients With Heart Failure , 2008, Circulation. Cardiovascular quality and outcomes.

[7]  M. Drazner,et al.  2013 ACCF/AHA guideline for the management of heart failure: a report of the American College of Cardiology Foundation/American Heart Association Task Force on Practice Guidelines. , 2013, Journal of the American College of Cardiology.

[8]  G. Fonarow,et al.  The Prevention of Hospital Readmissions in Heart Failure. , 2016, Progress in cardiovascular diseases.

[9]  Harlan M Krumholz,et al.  Do Non-Clinical Factors Improve Prediction of Readmission Risk?: Results From the Tele-HF Study. , 2016, JACC. Heart failure.

[10]  Adrian F Hernandez,et al.  Linking inpatient clinical registry data to Medicare claims data using indirect identifiers. , 2009, American heart journal.

[11]  Lucila Ohno-Machado,et al.  Risk stratification in heart failure using artificial neural networks , 2000, AMIA.

[12]  C. Yancy,et al.  Incremental Value of Clinical Data Beyond Claims Data in Predicting 30-Day Outcomes After Heart Failure Hospitalization , 2011, Circulation. Cardiovascular quality and outcomes.

[13]  Y. Tabak,et al.  An Automated Model to Identify Heart Failure Patients at Risk for 30-Day Readmission or Death Using Electronic Medical Record Data , 2010, Medical care.

[14]  Amanda H. Salanitro,et al.  Risk prediction models for hospital readmission: a systematic review. , 2011, JAMA.