Data Mining for Generalizable Pre-admission Prediction of Elective Length of Stay

An elective admission involves an hospital admission for which a doctor schedules the day and time of the visit in advance. That is, it is not an emergency or urgent admission. Factors in the scheduling of such admissions and also resource impacts include bed availability and utilization, and as such it is of benefit to be able to predict in advance the length of stay of patients. In this work we report on the use of data mining on a large, cross-provider dataset to develop potentially generalizable machine learning models to predict all- condition, high length of stay for elective admissions, prior to the time of admission based upon patient, admitting condition, hospital and locale attributes. The best of these models achieves an AUC score of 0.900. We also carry out evaluations to consider the generalizability of such models via measuring their predictive performance on a dataset from a different time period, and the models demonstrate that they maintain equivalent predictive levels on data from a different period.

[1]  Robert Steele,et al.  Personal health record architectures: Technology infrastructure implications and dependencies , 2012, J. Assoc. Inf. Sci. Technol..

[2]  Robert Steele,et al.  Predicting Hospital Length of Stay Using Neural Networks on MIMIC III Data , 2017, 2017 IEEE 15th Intl Conf on Dependable, Autonomic and Secure Computing, 15th Intl Conf on Pervasive Intelligence and Computing, 3rd Intl Conf on Big Data Intelligence and Computing and Cyber Science and Technology Congress(DASC/PiCom/DataCom/CyberSciTech).

[3]  Ioannis A. Kakadiaris,et al.  A Comparison of Supervised Machine Learning Techniques for Predicting Short-Term In-Hospital Length of Stay among Diabetic Patients , 2014, 2014 13th International Conference on Machine Learning and Applications.

[4]  Pavel Brazdil,et al.  Factors influencing hospital high length of stay outliers , 2012, BMC Health Services Research.

[5]  Peyman Rezaei Hachesu,et al.  Use of Data Mining Techniques to Determine and Predict Length of Stay of Cardiac Patients , 2013, Healthcare informatics research.

[6]  J. Hanley,et al.  The meaning and use of the area under a receiver operating characteristic (ROC) curve. , 1982, Radiology.

[7]  S. Leinhardt,et al.  The cost and length of a hospital stay. , 1976, Inquiry : a journal of medical care organization, provision and financing.

[8]  Rachid Benlamri,et al.  Application of Data Mining Techniques to Predict the Length of Stay of Hospitalized Patients with Diabetes , 2018, 2018 4th International Conference on Big Data Innovations and Applications (Innovate-Data).

[9]  Abul Hasanat,et al.  Application of data mining techniques to predict length of stay of stroke patients , 2017, 2017 International Conference on Informatics, Health & Technology (ICIHT).

[10]  Ian H. Witten,et al.  The WEKA data mining software: an update , 2009, SKDD.

[11]  Karim O. Elish,et al.  Machine Learning-Based Prediction of Prolonged Length of Stay in Newborns , 2018, 2018 17th IEEE International Conference on Machine Learning and Applications (ICMLA).

[12]  Robert Steele,et al.  Elderly persons' perception and acceptance of using wireless sensor networks to assist healthcare , 2009, Int. J. Medical Informatics.

[13]  Lola B Chambless,et al.  Predicting Inpatient Length of Stay After Brain Tumor Surgery: Developing Machine Learning Ensembles to Improve Predictive Performance. , 2018, Neurosurgery.

[14]  Robert Steele,et al.  Machine Learning-based Pre-discharge Prediction of Hospital Readmission , 2018, 2018 International Conference on Computer, Information and Telecommunication Systems (CITS).

[15]  Anne Elixhauser,et al.  The healthcare cost and utilization project: an overview. , 2002, Effective clinical practice : ECP.

[16]  Michael L. Johnson,et al.  Predicting in-hospital mortality and hospital length of stay in diabetic patients , 2013 .