Hospital Readmission Prediction using Discriminative patterns

Avoidable hospital readmission is problematic as it increases the burden on healthcare systems, leads to a shortage of hospital beds and impacts on the costs of healthcare. Various machine learning algorithms have been applied to predict patient readmissions. However, existing literature has only focused on individual features of health conditions without consideration of associations between features. This paper proposes discriminative pattern-based features as a technique to improve readmission prediction. First, discriminative patterns that occur disproportionately between two classes: readmission and non-readmission, were generated based on hospital electronic health records. Second, the patterns were fed as features into a classification model for readmission prediction. Then, we have evaluated these discriminative pattern-based features in three datasets: diabetes, chronic kidney disease and all diseases. Experiments with each dataset showed that the features of chronic disease cohorts have fewer differences between the readmission and the non-readmission classes than the all-diseases cohort. Our proposed pattern-based model improved the prediction performance in terms of AUC (Area Under the receiver operating characteristic curve) by about 12% compared with the baseline models for the all-disease cohort, however, it showed little improvement for either diabetes or chronic kidney disease datasets.

[1]  C. Steiner,et al.  Comorbidity measures for use with administrative data. , 1998, Medical care.

[2]  Jeffrey M. Hausdorff,et al.  Physionet: Components of a New Research Resource for Complex Physiologic Signals". Circu-lation Vol , 2000 .

[3]  J. Friedman Greedy function approximation: A gradient boosting machine. , 2001 .

[4]  Nitesh V. Chawla,et al.  SMOTE: Synthetic Minority Over-sampling Technique , 2002, J. Artif. Intell. Res..

[5]  Stephen D. Bay,et al.  Detecting Group Differences: Mining Contrast Sets , 2001, Data Mining and Knowledge Discovery.

[6]  Philip S. Yu,et al.  Direct Discriminative Pattern Mining for Effective Classification , 2008, 2008 IEEE 24th International Conference on Data Engineering.

[7]  Y. Tabak,et al.  An Automated Model to Identify Heart Failure Patients at Risk for 30-Day Readmission or Death Using Electronic Medical Record Data , 2010, Medical care.

[8]  P. Austin,et al.  Derivation and validation of an index to predict early death or unplanned readmission after discharge from hospital to the community , 2010, Canadian Medical Association Journal.

[9]  Carl van Walraven,et al.  Predicting the risk of unplanned readmission or death within 30 days of discharge after a heart failure hospitalization. , 2012, American heart journal.

[10]  Manish Gupta,et al.  Mining Low-Support Discriminative Patterns from Dense and High-Dimensional Data , 2012, IEEE Transactions on Knowledge and Data Engineering.

[11]  Beata Strack,et al.  Impact of HbA1c Measurement on Hospital Readmission Rates: Analysis of 70,000 Clinical Database Patient Records , 2014, BioMed research international.

[12]  Jun Wu,et al.  Mining Conditional Phosphorylation Motifs , 2014, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[13]  Sang Won Yoon,et al.  Predictive modeling of hospital readmissions using metaheuristics and data mining , 2015, Expert Syst. Appl..

[14]  Joseph Futoma,et al.  A comparison of models for predicting early hospital readmissions , 2015, J. Biomed. Informatics.

[15]  S. Dhaliwal,et al.  Utility of models to predict 28-day or 30-day unplanned hospital readmissions: an updated systematic review , 2016, BMJ Open.

[16]  Chad Bradford,et al.  Patient and clinical characteristics that heighten risk for heart failure readmission , 2017, Research in social & administrative pharmacy : RSAP.

[17]  I. Abbass,et al.  Predictors of all-cause 30 day readmission among Medicare patients with type 2 diabetes , 2017, Current medical research and opinion.

[18]  W. Hoy,et al.  CKD.QLD: establishment of a chronic kidney disease [CKD] registry in Queensland, Australia , 2017, BMC Nephrology.

[19]  Amir H. Payberah,et al.  Predicting the risk of emergency admission with machine learning: Development and validation using linked electronic health records , 2018, PLoS medicine.

[20]  Yashar Maali,et al.  Predicting 7-day, 30-day and 60-day all-cause unplanned readmission: a case study of a Sydney hospital , 2018, BMC Medical Informatics and Decision Making.

[21]  Yaochu Jin,et al.  An improved support vector machine-based diabetic readmission prediction , 2018, Comput. Methods Programs Biomed..

[22]  Fei Wang,et al.  Predictive Modeling of the Hospital Readmission Risk from Patients’ Claims Data Using Machine Learning: A Case Study on COPD , 2019, Scientific Reports.