Predictive risk modelling for early hospital readmission of patients with diabetes in India

Hospital readmission is an important contributor to total medical expenditure and is an emerging indicator of quality of care. The goal of this study is to analyze key factors using machine learning methods and patients’ medical records of a reputed Indian hospital which impact the all-purpose readmission of a patient with diabetes and compare different classification models that predict readmission and evaluate the best model. This study classified the patients into two different risk groups of readmission (Yes or No) within 30 days of discharge based on patients’ characteristics using 2-year clinical and administrative data. It proposed an architecture of this prediction model and identified various risk factors using text mining techniques. Also, groups of consistently occurring factors that inference readmission rates were revealed by associative rule mining. It then evaluated the classification accuracy using five different data mining classifiers and conducted cost analysis. Out of total 9381 records, 1211 (12.9 %) encounters were found as readmissions. This study found that risk factors like hospital department where readmission happens, history of recent prior hospitalization and length of stay are strong predictors of readmission. Random forest was found to be the optimal classifier for this task using the evaluation metric area under precision-recall curve (0.296). From the cost analysis, it is observed that a cost of INR 15.92 million can be saved for 9381 instances of diabetic patient encounters. This work, the first such study done from Indian Healthcare perspective, built a model to predict the risk of readmission within 30 days of discharge for diabetes. This study concludes that the model could be incorporated in healthcare institutions to witness its effectiveness. Cost analysis shows huge savings which is significant for any healthcare system especially in developing countries like India.

[1]  A Clarke,et al.  Readmission to hospital: a measure of quality or outcome? , 2004, Quality and Safety in Health Care.

[2]  Ian H. Witten,et al.  The WEKA data mining software: an update , 2009, SKDD.

[3]  P. Donnan,et al.  Development and validation of a model for predicting emergency admissions over the next year (PEONY): a UK historical cohort study. , 2008, Archives of internal medicine.

[4]  Sunil Kumar Khatri,et al.  Impact of selected pre-processing techniques on prediction of risk of early readmission for diabetic patients in India , 2016, International Journal of Diabetes in Developing Countries.

[5]  Graham J. Williams,et al.  Data Mining , 2000, Communications in Computer and Information Science.

[6]  Sunghwan Sohn,et al.  Mayo clinical Text Analysis and Knowledge Extraction System (cTAKES): architecture, component evaluation and applications , 2010, J. Am. Medical Informatics Assoc..

[7]  Donald E. Brown,et al.  Random Forests on Ubiquitous Data for Heart Failure 30-Day Readmissions Prediction , 2013, 2013 12th International Conference on Machine Learning and Applications.

[8]  Sunil Kumar Khatri,et al.  Improving patient matching: Single patient view for Clinical Decision Support using Big Data analytics , 2015, 2015 4th International Conference on Reliability, Infocom Technologies and Optimization (ICRITO) (Trends and Future Directions).

[9]  John Billings,et al.  Development of a predictive model to identify inpatients at risk of re-admission within 30 days of discharge (PARR-30) , 2012, BMJ Open.

[10]  E. Rackow Rehospitalizations among patients in the Medicare fee-for-service program. , 2009, The New England journal of medicine.

[11]  Naren Meadem Exploring Preprocessing Techniques for Prediction of Risk of Readmission for Congestive Heart Failure Patients , 2013 .

[12]  Malladihalli S. Bhuvan,et al.  Identifying Diabetic Patients with High Risk of Readmission , 2016, ArXiv.

[13]  Amanda H. Salanitro,et al.  Risk prediction models for hospital readmission: a systematic review. , 2011, JAMA.

[14]  K. Dungan,et al.  The Effect of Diabetes on Hospital Readmissions , 2012, Journal of diabetes science and technology.

[15]  D. Lindberg,et al.  The Unified Medical Language System , 1993, Methods of Information in Medicine.

[16]  Maria Yu,et al.  Predictors of 30 day hospital readmission in patients with type 2 diabetes: a retrospective, case–control, database study , 2015, Current medical research and opinion.

[17]  Lei Lei,et al.  A Review of Missing Data Treatment Methods , 2005 .

[18]  Bernard Friedman,et al.  Multiple hospitalizations for patients with diabetes. , 2003, Diabetes care.

[19]  Beata Strack,et al.  Impact of HbA1c Measurement on Hospital Readmission Rates: Analysis of 70,000 Clinical Database Patient Records , 2014, BioMed research international.

[20]  Ramakrishnan Srikant,et al.  Fast algorithms for mining association rules , 1998, VLDB 1998.

[21]  Sherita Hill Golden,et al.  Early readmission among patients with diabetes: a qualitative assessment of contributing factors. , 2014, Journal of diabetes and its complications.

[22]  Kevin Donnelly,et al.  SNOMED-CT: The advanced terminology and coding system for eHealth. , 2006, Studies in health technology and informatics.

[23]  Glenn Fung,et al.  Predicting Readmission Risk with Institution Specific Prediction Models , 2013, ICHI.

[24]  Phil Hider,et al.  The validity of readmission rate as a marker of the quality of hospital care, and a recommendation for its definition. , 2009, The New Zealand medical journal.

[25]  Mark Goadrich,et al.  The relationship between Precision-Recall and ROC curves , 2006, ICML.

[26]  Rich Caruana,et al.  An empirical comparison of supervised learning algorithms , 2006, ICML.

[27]  J. Ross,et al.  Scheduled and unscheduled hospital readmissions among patients with diabetes. , 2010, The American journal of managed care.

[28]  J. Schnipper,et al.  Potentially avoidable 30-day hospital readmissions in medical patients: derivation and validation of a prediction model. , 2013, JAMA internal medicine.