Predicting inpatient flow at a major hospital using interpretable analytics

Problem definition: Turn raw data from Electronic Health Records into accurate predictions on patient flows and inform daily decision-making at a major hospital. Practical Relevance: In a hospital environment under increasing financial and operational stress, forecasts on patient demand patterns could help match capacity and demand and improve hospital operations. Methodology: We use data from 63,432 admissions at a large academic hospital (50.0% female, median age 64 years old, median length-of-stay 3.12 days). We construct an expertise-driven patient representation on top of their EHR data and apply a broad class of machine learning methods to predict several aspects of patient flows. Results: With a unique patient representation, we estimate short-term discharges, identify long-stay patients, predict discharge destination and anticipate flows in and out of intensive care units with accuracy in the 80%+ range. More importantly, we implement this machine learning pipeline into the EHR system of the hospital and construct prediction-informed dashboards to support daily bed placement decisions. Managerial Implications: Our study demonstrates that interpretable machine learning techniques combined with EHR data can be used to provide visibility on patient flows. Our approach provides an alternative to deep learning techniques which is equally accurate, interpretable, frugal in data and computational power, and production-ready.

[1]  S. Tamang,et al.  Potential Biases in Machine Learning Algorithms Using Electronic Health Record Data , 2018, JAMA internal medicine.

[2]  Philip Troy,et al.  Using simulation to determine the need for ICU beds for surgery patients. , 2009, Surgery.

[3]  Ting Zhu,et al.  Time-Series Approaches for Forecasting the Number of Hospital Daily Discharged Inpatients , 2017, IEEE Journal of Biomedical and Health Informatics.

[4]  Ting Zhu,et al.  Time-Series Approaches for Forecasting the Number of Hospital Daily Discharged Inpatients. , 2017 .

[5]  Qiang Li,et al.  Region compatibility based stability assessment for decision trees , 2018, Expert Syst. Appl..

[6]  Jeffrey Dean,et al.  Scalable and accurate deep learning with electronic health records , 2018, npj Digital Medicine.

[7]  A. Forster,et al.  The TEND (Tomorrow’s Expected Number of Discharges) Model Accurately Predicted the Number of Patients Who Were Discharged from the Hospital the Next Day , 2017, Journal of hospital medicine.

[8]  Andreas Holzinger,et al.  Data Mining with Decision Trees: Theory and Applications , 2015, Online Inf. Rev..

[9]  Wei-Yin Loh,et al.  Classification and regression trees , 2011, WIREs Data Mining Knowl. Discov..

[10]  Xiaowu Sun,et al.  Using electronic health record data to develop inpatient mortality predictive model: Acute Laboratory Risk of Mortality Score (ALaRMS) , 2013, J. Am. Medical Informatics Assoc..

[11]  BertsimasDimitris,et al.  From predictive methods to missing data imputation , 2017 .

[12]  Zahra Mirzamomen,et al.  A framework to induce more stable decision trees for pattern classification , 2017, Pattern Analysis and Applications.

[13]  Mohamed Bader-El-Den,et al.  Patient length of stay and mortality prediction: A survey , 2017, Health services management research.

[14]  Nilmini Wickramasinghe,et al.  Deepr: A Convolutional Net for Medical Records , 2016, ArXiv.

[15]  M. Lechner The Estimation of Causal Effects by Difference-in-Difference Methods , 2011 .

[16]  M. Mazumdar,et al.  Effect of Emergency Department and ICU Occupancy on Admission Decisions and Outcomes for Critically Ill Patients* , 2018, Critical care medicine.

[17]  Dimitris Bertsimas,et al.  From Predictive Methods to Missing Data Imputation: An Optimization Approach , 2017, J. Mach. Learn. Res..

[18]  Elisa F. Long,et al.  The Boarding Patient: Effects of ICU and Hospital Occupancy Surges on Patient Flow , 2018, Production and operations management.

[19]  Li Luo,et al.  Short-term forecasting of hospital discharge volume based on time series analysis , 2017, 2017 IEEE 19th International Conference on e-Health Networking, Applications and Services (Healthcom).

[20]  Carri W. Chan,et al.  Association Among ICU Congestion, ICU Admission Decision, and Patient Outcomes* , 2016, Critical care medicine.

[21]  Edilson F. Arruda,et al.  DEMAND FORECAST AND OPTIMAL PLANNING OF INTENSIVE CARE UNIT (ICU) CAPACITY , 2017 .

[22]  Patrick Riley,et al.  Three pitfalls to avoid in machine learning , 2019, Nature.

[23]  Andrew McCallum,et al.  Energy and Policy Considerations for Deep Learning in NLP , 2019, ACL.

[24]  Sean L. Barnes,et al.  Real-time prediction of inpatient length of stay for discharge prioritization , 2016, J. Am. Medical Informatics Assoc..

[25]  Mark T. Seelen,et al.  Development and Validation of a Machine Learning Model to Aid Discharge Processes for Inpatient Surgical Care , 2019, JAMA network open.

[26]  J. Friedman Greedy function approximation: A gradient boosting machine. , 2001 .

[27]  R. Tibshirani Regression Shrinkage and Selection via the Lasso , 1996 .

[28]  Radford M. Neal Pattern Recognition and Machine Learning , 2007, Technometrics.

[29]  Kenneth D. Mandl,et al.  SMART on FHIR: a standards-based, interoperable apps platform for electronic health records , 2016, J. Am. Medical Informatics Assoc..

[30]  Jean-Philippe Vert,et al.  Consistency of Random Forests , 2014, 1405.2881.

[31]  G. Escobar,et al.  Length of Stay Predictions: Improvements Through the Use of Automated Laboratory and Comorbidity Variables , 2010, Medical care.

[32]  L. Breiman Heuristics of instability and stabilization in model selection , 1996 .

[33]  Li Li,et al.  Deep Patient: An Unsupervised Representation to Predict the Future of Patients from the Electronic Health Records , 2016, Scientific Reports.

[34]  Dimitris Bertsimas,et al.  Optimal classification trees , 2017, Machine Learning.

[35]  Thomas H. McCoy,et al.  Assessment of Time-Series Machine Learning Methods for Forecasting Hospital Discharge Volume , 2018, JAMA network open.