Using Machine Learning Models to Predict the Length of Stay in a Hospital Setting

Proper prediction of Length Of Stay (LOS) has become increasingly important these years. The LOS prediction provides better services, managing hospital resources and controls their costs. In this paper, we implemented and compared two Machine Learning (ML) methods, the Random Forest (RF) and the Gradient Boosting model (GB), using an open source available dataset. This data are been firstly preprocessed by combining data transformation, data standardization and data codification. Then, the RF and the GB were carried out, with a phase of hyper parameters tuning until setting optimal coefficients. Finally, the Mean Square Error (MAE), the R-squared (\(R^{2}\)) and the Adjusted R-squared (Adjusted \(R^{2}\)) metrics are selected to evaluate model with parameters.

[1]  David Loshin Chapter 10 – Data Consolidation and Integration , 2008 .

[2]  Neil O'Hare,et al.  The use of artificial neural networks to stratify the length of stay of cardiac patients based on preoperative and initial postoperative factors , 2007, Artif. Intell. Medicine.

[3]  Robert Steele,et al.  Predicting Hospital Length of Stay Using Neural Networks on MIMIC III Data , 2017, 2017 IEEE 15th Intl Conf on Dependable, Autonomic and Secure Computing, 15th Intl Conf on Pervasive Intelligence and Computing, 3rd Intl Conf on Big Data Intelligence and Computing and Cyber Science and Technology Congress(DASC/PiCom/DataCom/CyberSciTech).

[4]  Shuang Wang,et al.  Differentially private genome data dissemination through top-down specialization , 2014, BMC Medical Informatics and Decision Making.

[5]  R. Lafaro,et al.  Neural Network Prediction of ICU Length of Stay Following Cardiac Surgery Based on Pre-Incision Variables , 2015, PloS one.

[6]  Jerrold H. May,et al.  Insights from a machine learning model for predicting the hospital Length of Stay (LOS) at the time of admission , 2017, Expert Syst. Appl..

[7]  Paul D. Clayton,et al.  Research Paper: Computer-generated Informational Messages Directed to Physicians: Effect on Length of Hospital Stay , 1995, J. Am. Medical Informatics Assoc..

[8]  M. Johnson,et al.  Circulating microRNAs in Sera Correlate with Soluble Biomarkers of Immune Activation but Do Not Predict Mortality in ART Treated Individuals with HIV-1 Infection: A Case Control Study , 2015, PloS one.

[9]  Henry W. W. Potts,et al.  Predicting length of stay from an electronic patient record system: a primary total knee replacement example , 2014, BMC Medical Informatics and Decision Making.

[10]  Peter Szolovits,et al.  MIMIC-III, a freely accessible critical care database , 2016, Scientific Data.

[11]  Yu Tian,et al.  Applying a BP Neural Network Model to Predict the Length of Hospital Stay , 2013, HIS.

[12]  R. Kalhor,et al.  Factors affecting length of stay in teaching hospitals of a middle-income country , 2016, Electronic physician.

[13]  M. Kargari,et al.  Determining Factors Influencing Length of Stay and Predicting Length of Stay Using Data Mining in the General Surgery Department , 2016 .

[14]  Chih-Fong Tsai,et al.  The Identification of Prolonged Length of Stay for Surgery Patients , 2015, 2015 IEEE International Conference on Systems, Man, and Cybernetics.

[15]  Parag C. Pendharkar,et al.  Machine Learning Techniques for Predicting Hospital Length of Stay in Pennsylvania Federal and Specialty Hospitals , 2014, Int. J. Comput. Sci. Appl..