Predicting 30-day Hospital Readmission with Publicly Available Administrative Database

INTRODUCTION This article is part of the Focus Theme of Methods of Information in Medicine on "Big Data and Analytics in Healthcare". BACKGROUND Hospital readmissions raise healthcare costs and cause significant distress to providers and patients. It is, therefore, of great interest to healthcare organizations to predict what patients are at risk to be readmitted to their hospitals. However, current logistic regression based risk prediction models have limited prediction power when applied to hospital administrative data. Meanwhile, although decision trees and random forests have been applied, they tend to be too complex to understand among the hospital practitioners. OBJECTIVES Explore the use of conditional logistic regression to increase the prediction accuracy. METHODS We analyzed an HCUP statewide inpatient discharge record dataset, which includes patient demographics, clinical and care utilization data from California. We extracted records of heart failure Medicare beneficiaries who had inpatient experience during an 11-month period. We corrected the data imbalance issue with under-sampling. In our study, we first applied standard logistic regression and decision tree to obtain influential variables and derive practically meaning decision rules. We then stratified the original data set accordingly and applied logistic regression on each data stratum. We further explored the effect of interacting variables in the logistic regression modeling. We conducted cross validation to assess the overall prediction performance of conditional logistic regression (CLR) and compared it with standard classification models. RESULTS The developed CLR models outperformed several standard classification models (e.g., straightforward logistic regression, stepwise logistic regression, random forest, support vector machine). For example, the best CLR model improved the classification accuracy by nearly 20% over the straightforward logistic regression model. Furthermore, the developed CLR models tend to achieve better sensitivity of more than 10% over the standard classification models, which can be translated to correct labeling of additional 400 - 500 readmissions for heart failure patients in the state of California over a year. Lastly, several key predictor identified from the HCUP data include the disposition location from discharge, the number of chronic conditions, and the number of acute procedures. CONCLUSIONS It would be beneficial to apply simple decision rules obtained from the decision tree in an ad-hoc manner to guide the cohort stratification. It could be potentially beneficial to explore the effect of pairwise interactions between influential predictors when building the logistic regression models for different data strata. Judicious use of the ad-hoc CLR models developed offers insights into future development of prediction models for hospital readmissions, which can lead to better intuition in identifying high-risk patients and developing effective post-discharge care strategies. Lastly, this paper is expected to raise the awareness of collecting data on additional markers and developing necessary database infrastructure for larger-scale exploratory studies on readmission risk prediction.

[1]  E Rocha,et al.  Prediction of hospital readmission for heart failure: development of a simple risk score based on administrative data. , 1999, Revista portuguesa de cardiologia : orgao oficial da Sociedade Portuguesa de Cardiologia = Portuguese journal of cardiology : an official journal of the Portuguese Society of Cardiology.

[2]  J. J. Holloway,et al.  Clinical and sociodemographic risk factors for readmission of Medicare beneficiaries , 1988, Health care financing review.

[3]  Harlan M Krumholz,et al.  Development, validation, and results of a measure of 30-day readmission following hospitalization for pneumonia. , 2011, Journal of hospital medicine.

[4]  Mark V. Williams,et al.  Rehospitalizations among patients in the Medicare fee-for-service program. , 2009, The New England journal of medicine.

[5]  P. Hider,et al.  The readmission rate as an indicator of the quality of elective surgical inpatient care for the elderly in New Zealand. , 2009, The New Zealand medical journal.

[6]  Leora I. Horwitz,et al.  Diagnoses and timing of 30-day readmissions after hospitalization for heart failure, acute myocardial infarction, or pneumonia. , 2013, JAMA.

[7]  J. W. Thomas Does risk-adjusted readmission rate provide valid information on hospital quality? , 1996, Inquiry : a journal of medical care organization, provision and financing.

[8]  Robert P Kocher,et al.  Hospital readmissions and the Affordable Care Act: paying for coordinated quality care. , 2011, JAMA.

[9]  Pratik J. Parikh,et al.  Acute kidney injury (AKI) and risk of readmissions in patients with heart failure. , 2012, The American journal of cardiology.

[10]  Harlan M Krumholz,et al.  The performance of US hospitals as reflected in risk-standardized 30-day mortality and readmission rates for medicare beneficiaries with pneumonia. , 2010, Journal of hospital medicine.

[11]  D M Buchner,et al.  Risk factors for early unplanned hospital readmission in the elderly , 1991, Journal of general internal medicine.

[12]  Doina Precup,et al.  Assessing the Predictability of Hospital Readmission Using Machine Learning , 2013, IAAI.

[13]  Michael J Fine,et al.  Causes and risk factors for rehospitalization of patients hospitalized with community-acquired pneumonia. , 2008, Clinical infectious diseases : an official publication of the Infectious Diseases Society of America.

[14]  Radford M. Neal Pattern Recognition and Machine Learning , 2007, Technometrics.

[15]  M. Silverstein,et al.  Risk Factors for 30-Day Hospital Readmission in Patients ≥65 Years of Age , 2008, Proceedings.

[16]  P. McCullagh,et al.  Generalized Linear Models , 1984 .

[17]  Eric R. Ziegel,et al.  Generalized Linear Models , 2002, Technometrics.

[18]  Eun Whan Lee Selecting the Best Prediction Model for Readmission , 2012, Journal of preventive medicine and public health = Yebang Uihakhoe chi.

[19]  Richard Goldstein,et al.  Regression Methods in Biostatistics: Linear, Logistic, Survival and Repeated Measures Models , 2006, Technometrics.

[20]  P. Austin,et al.  Derivation and validation of an index to predict early death or unplanned readmission after discharge from hospital to the community , 2010, Canadian Medical Association Journal.

[21]  W G Henderson,et al.  Predicting non-elective hospital readmissions: a multi-site study. Department of Veterans Affairs Cooperative Study Group on Primary Care and Readmissions. , 2000, Journal of clinical epidemiology.

[22]  Kurt M. Bretthauer,et al.  Reducing Hospital Readmissions by Integrating Empirical Prediction with Resource Optimization , 2016 .

[23]  Haya R Rubin,et al.  Comprehensive discharge planning with postdischarge support for older patients with congestive heart failure: a meta-analysis. , 2004, JAMA.

[24]  L. Chu,et al.  Risk Factors for Early Emergency Hospital Readmission in Elderly Medical Patients , 1999, Gerontology.

[25]  Amanda H. Salanitro,et al.  Risk prediction models for hospital readmission: a systematic review. , 2011, JAMA.

[26]  M. Desai,et al.  Statistical Models and Patient Predictors of Readmission for Acute Myocardial Infarction: A Systematic Review , 2009, Circulation. Cardiovascular quality and outcomes.

[27]  Sharon-Lise T. Normand,et al.  An Administrative Claims Measure Suitable for Profiling Hospital Performance on the Basis of 30-Day All-Cause Readmission Rates Among Patients With Heart Failure , 2008, Circulation. Cardiovascular quality and outcomes.

[28]  C M Ashton,et al.  The association between the quality of inpatient care and early readmission: a meta-analysis of the evidence. , 1997, Medical care.

[29]  Harlan M. Krumholz,et al.  An Administrative Claims Measure Suitable for Profiling Hospital Performance Based on 30-Day All-Cause Readmission Rates Among Patients With Acute Myocardial Infarction , 2011, Circulation. Cardiovascular quality and outcomes.

[30]  Mark V. Williams,et al.  Rehospitalizations among patients in the Medicare fee-for-service program. , 2009, The New England journal of medicine.

[31]  Javier Llorca,et al.  Prediction of 30-day cardiac-related-emergency-readmissions using simple administrative hospital data. , 2013, International journal of cardiology.

[32]  J. B. Martin,et al.  Identification of factors associated with hospital readmission and development of a predictive model. , 1992, Health services research.

[33]  Stephen A. Martin,et al.  A Reengineered Hospital Discharge Program to Decrease Rehospitalization , 2009, Annals of Internal Medicine.

[34]  T A Brennan,et al.  Factors associated with unplanned hospital readmission among patients 65 years of age and older in a Medicare managed care plan. , 1999, The American journal of medicine.

[35]  Glenn Fung,et al.  Predicting Readmission Risk with Institution Specific Prediction Models , 2013, 2013 IEEE International Conference on Healthcare Informatics.