A theoretical framework for research on readmission risk prediction

On the one hand, predictive analytics is an important field of research in Information Systems (IS); however, research on predictive analytics in healthcare is still scarce in IS literature. One area where predictive analytics can be of great benefit is with regard to unplanned readmissions. While a number of studies on readmission prediction already exists in related research areas, there are few guidelines to date on how to conduct such analytics projects. To address this gap the paper presents the general process to develop empirical models by Shmueli and Koppius (2011) and extends this to the specific requirements of readmission risk prediction. Based on a systematic literature review, the resulting process defines important aspects of readmission prediction. It also structures relevant questions and tasks that need to be taken care of in this context. This extension of the guidelines by Shmueli and Koppius (2011) provides a best practice as well as a template that can be used in future studies on readmission risk prediction, thus allowing for more comparable results across various research fields.

[1]  L. Resnik,et al.  Comorbidity Indices Versus Function as Potential Predictors of 30-Day Readmission in Older Patients Following Postacute Rehabilitation , 2017, The journals of gerontology. Series A, Biological sciences and medical sciences.

[2]  Tosha B. Wetterneck,et al.  Hospital Readmission in General Medicine Patients: A Prediction Model , 2009, Journal of General Internal Medicine.

[3]  Yijing Li,et al.  Learning from class-imbalanced data: Review of methods and applications , 2017, Expert Syst. Appl..

[4]  Mehdi Jamei,et al.  Predicting all-cause risk of 30-day hospital readmission using artificial neural networks , 2017, PloS one.

[5]  Quanquan Gu,et al.  Multi-task cox proportional hazard model for predicting risk of unplanned hospital readmission , 2017, Systems and Information Engineering Design Symposium.

[6]  Robert E. Schapire,et al.  The strength of weak learnability , 1990, Mach. Learn..

[7]  Dimitris Kanellopoulos,et al.  Handling imbalanced datasets: A review , 2006 .

[8]  Rushi Longadge,et al.  Class Imbalance Problem in Data Mining Review , 2013, ArXiv.

[9]  John Billings,et al.  Choosing a model to predict hospital admission: an observational study of new variants of predictive models for case finding , 2013, BMJ Open.

[10]  H. Bitterman,et al.  Predicting 30-Day Readmissions With Preadmission Electronic Health Record Data , 2015, Medical care.

[11]  M. Bech,et al.  A roadmap for comparing readmission policies with application to Denmark, England, Germany and the United States. , 2015, Health policy.

[12]  C. Subbe,et al.  Patients’ and providers’ perceptions of the preventability of hospital readmission: a prospective, observational study in four European countries , 2017, BMJ Quality & Safety.

[13]  P. Austin,et al.  Derivation and validation of an index to predict early death or unplanned readmission after discharge from hospital to the community , 2010, Canadian Medical Association Journal.

[14]  Isabelle Guyon,et al.  An Introduction to Variable and Feature Selection , 2003, J. Mach. Learn. Res..

[15]  Peri Rosenfeld,et al.  Early Discharge Planning and Improved Care Transitions: Pre-Admission Assessment for Readmission Risk in an Elective Orthopedic and Cardiovascular Surgical Population , 2016, International journal of integrated care.

[16]  Blaz Zupan,et al.  Predictive data mining in clinical medicine: Current issues and guidelines , 2008, Int. J. Medical Informatics.

[17]  J. Schnipper,et al.  Potentially avoidable 30-day hospital readmissions in medical patients: derivation and validation of a prediction model. , 2013, JAMA internal medicine.

[18]  S. Dhaliwal,et al.  Utility of models to predict 28-day or 30-day unplanned hospital readmissions: an updated systematic review , 2016, BMJ Open.

[19]  A. G. Asuero,et al.  The Correlation Coefficient: An Overview , 2006 .

[20]  Huan Liu,et al.  Feature Selection for High-Dimensional Data: A Fast Correlation-Based Filter Solution , 2003, ICML.

[21]  Yoav Freund,et al.  A decision-theoretic generalization of on-line learning and an application to boosting , 1995, EuroCOLT.

[22]  Andrew K. C. Wong,et al.  Classification of Imbalanced Data: a Review , 2009, Int. J. Pattern Recognit. Artif. Intell..

[23]  A. Vitonis,et al.  Trends in readmission rate by route of hysterectomy – a single‐center experience , 2018, Acta obstetricia et gynecologica Scandinavica.

[24]  Derivation and validation of a diagnostic score based on case-mix groups to predict 30-day death or urgent readmission , 2012, Open medicine : a peer-reviewed, independent, open-access journal.

[25]  Leo Breiman,et al.  Bagging Predictors , 1996, Machine Learning.

[26]  Erhard Rahm,et al.  Data Cleaning: Problems and Current Approaches , 2000, IEEE Data Eng. Bull..

[27]  Ping Yu,et al.  A Review of Data Quality Assessment Methods for Public Health Information Systems , 2014, International journal of environmental research and public health.

[28]  Examining the Drivers of Readmissions and Reducing Unnecessary Readmissions for Better Patient Care , 2022 .

[29]  Nitesh V. Chawla,et al.  Data Mining for Imbalanced Datasets: An Overview , 2005, The Data Mining and Knowledge Discovery Handbook.

[30]  Amanda H. Salanitro,et al.  Risk prediction models for hospital readmission: a systematic review. , 2011, JAMA.

[31]  P. Austin,et al.  Proportion of hospital readmissions deemed avoidable: a systematic review , 2011, Canadian Medical Association Journal.

[32]  Haibo He,et al.  Learning from Imbalanced Data , 2009, IEEE Transactions on Knowledge and Data Engineering.

[33]  Nitesh V. Chawla,et al.  SMOTEBoost: Improving Prediction of the Minority Class in Boosting , 2003, PKDD.

[34]  Manuel Graña,et al.  Predictive models for hospital readmission risk: A systematic review of methods , 2018, Comput. Methods Programs Biomed..

[35]  Ron Kohavi,et al.  Wrappers for Feature Subset Selection , 1997, Artif. Intell..

[36]  T. Heggestad,et al.  Do hospital length of stay and staffing ratio affect elderly patients' risk of readmission? A nation-wide study of Norwegian hospitals. , 2002, Health services research.

[37]  Francisco Herrera,et al.  A Review on Ensembles for the Class Imbalance Problem: Bagging-, Boosting-, and Hybrid-Based Approaches , 2012, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[38]  Nitesh V. Chawla,et al.  SMOTE: Synthetic Minority Over-sampling Technique , 2002, J. Artif. Intell. Res..

[39]  Ian A Scott,et al.  Preventing the rebound: improving care transition in hospital discharge processes. , 2010, Australian health review : a publication of the Australian Hospital Association.

[40]  Yoshua Bengio,et al.  Random Search for Hyper-Parameter Optimization , 2012, J. Mach. Learn. Res..

[41]  Galit Shmueli,et al.  Predictive Analytics in Information Systems Research , 2010, MIS Q..