Real-time Machine Learning Alerts to Prevent Escalation of Care: A Pragmatic Clinical Trial

Importance: Automated machine learning algorithms have been shown to outperform older methods in predicting clinical deterioration requiring escalation of care, but rigorous prospective data on their real-world efficacy are limited. Objective: We hypothesized that real-time deterioration prediction alerts sent directly to front-line providers would reduce escalations. Design: Single-center prospective pragmatic clinical trial conducted from July 2019 to March 2020. The trial was terminated early due to the COVID-19 pandemic. Patients were followed for 30 days post-discharge. Setting: Academic tertiary care medical center located in New York City. Participants: All adult patients admitted to any of four medical-surgical units were included. Assignment to intervention or control arms was determined by initial unit admission. Intervention: Real-time alerts stratified according to predicted likelihood of clinical deterioration sent to the nursing/primary team or directly to the rapid response team. Clinical care and interventions were at the discretion of the providers. For the control units, alerts were generated but not sent. Main Outcomes: The primary outcome was the incidence of escalation of care. Secondary outcomes included orders placed for cardiovascular support, in-hospital and 30-day mortality. Ad-hoc outcomes included time to ICU escalation and time to discharge. Results: 2,780 patients were enrolled, 1,506 in the intervention group and the 1,274 in the control group. Average age was 66.2 years and 1,446 (52%) of participants were female. There was no difference in escalation between the groups, relative risk(RR) 1.22(95% Confidence Interval[CI] (0.97,1.54),p=0.10). Patients in the intervention group were more likely to receive cardiovascular support orders RR 1.35(95% CI (1.10,1.66),p=0.022). Median time to escalation with alerts was 50.6 [21.6-103] versus 58.6 [25.4-115] hours (difference -5.70;95% CI (-10.00,-2.00),p<0.001). The hazard ratio for likelihood of ICU escalation within 12 hours of an alert was 3.36 (95% CI (1.38,8.21),p=0.01) and for faster hospital discharge was 1.10 (95% CI (1.01,1.19),p=0.02). Combined in-hospital and 30-day-mortality was lower in the intervention group, RR 0.72 (95% CI (0.54,0.94),p=0.01). Conclusions and Relevance: Preliminary evidence suggests that real-time machine learning alerts do not reduce the incidence of escalation but are effective in reducing time to escalation, hospital length of stay and mortality. Trial Registration: ClinicalTrials.gov, NCT04026555, https://clinicaltrials.gov

[1]  L. Halasyamani,et al.  The Impact of a Machine Learning Early Warning Score on Hospital Mortality: A Multicenter Clinical Intervention Trial , 2022, Critical care medicine.

[2]  M. Akel,et al.  Less is More: Detecting Clinical Deterioration in the Hospital with Machine Learning Using Only Age, Heart Rate, and Respiratory Rate. , 2021, Resuscitation.

[3]  Pallavi V. Kulkarni,et al.  A Simulated Prospective Evaluation of a Deep Learning Model for Real-Time Prediction of Clinical Deterioration Among Ward Patients* , 2021, Critical care medicine.

[4]  James M. Naessens,et al.  Using machine learning to improve the accuracy of patient deterioration predictions: Mayo Clinic Early Warning Score (MC-EWS) , 2021, J. Am. Medical Informatics Assoc..

[5]  G. Escobar,et al.  Automated Identification of Adults at Risk for In-Hospital Clinical Deterioration. , 2020, The New England journal of medicine.

[6]  Prem Timsina,et al.  MEWS++: Enhancing the Prediction of Clinical Deterioration in Admitted Patients through a Machine Learning Model , 2020, Journal of clinical medicine.

[7]  S. Bernstein,et al.  Emergency department monitor alarms rarely change clinical management: An observational study. , 2020, The American journal of emergency medicine.

[8]  K. Carey,et al.  Validation of Early Warning Scores at Two Long-Term Acute Care Hospitals. , 2019, Critical care medicine.

[9]  Rebecca C. Steorts,et al.  Minimal Impact of Implemented Early Warning Score and Best Practice Alert for Patient Deterioration* , 2019, Critical care medicine.

[10]  Philipp Probst,et al.  Random forest versus logistic regression: a large-scale benchmark experiment , 2018, BMC Bioinformatics.

[11]  M. Churpek,et al.  Predictors of In-Hospital Mortality After Rapid Response Team Calls in a 274 Hospital Nationwide Sample* , 2018, Critical care medicine.

[12]  Malcolm Green,et al.  Comparison of the Between the Flags calling criteria to the MEWS, NEWS and the electronic Cardiac Arrest Risk Triage (eCART) score for the identification of deteriorating ward patients. , 2018, Resuscitation.

[13]  Jonathan Rubin,et al.  An Ensemble Boosting Model for Predicting Transfer to the Pediatric Intensive Care Unit , 2017, Int. J. Medical Informatics.

[14]  M. Churpek,et al.  Validating the Electronic Cardiac Arrest Risk Triage (eCART) Score for Risk Stratification of Surgical Inpatients in the Postoperative Setting: Retrospective Cohort Study. , 2018, Annals of surgery.

[15]  Vincent Liu,et al.  Development and validation of an electronic medical record-based alert score for detection of inpatient deterioration outside the ICU , 2016, J. Biomed. Informatics.

[16]  Jane C. Deng,et al.  Prediction of Clinical Deterioration in Hospitalized Adult Patients with Hematologic Malignancies Using a Neural Network Model , 2016, PloS one.

[17]  David O. Meltzer,et al.  Multicenter Comparison of Machine Learning Methods and Conventional Regression for Predicting Clinical Deterioration on the Wards , 2016, Critical care medicine.

[18]  Douglas G Altman,et al.  Inverse probability weighting , 2016, British Medical Journal.

[19]  C. Winslow,et al.  Multicenter development and validation of a risk stratification tool for ward patients. , 2014, American journal of respiratory and critical care medicine.

[20]  Chenyang Lu,et al.  A randomized trial of real-time automated clinical deterioration alerts sent to a rapid response team. , 2014, Journal of hospital medicine.

[21]  E. Jansma,et al.  The impact of the use of the Early Warning Score (EWS) on patient outcomes: a systematic review. , 2014, Resuscitation.

[22]  C. Subbe,et al.  Standardized measurement of the Modified Early Warning Score results in enhanced implementation of a Rapid Response System: a quasi-experimental study. , 2014, Resuscitation.

[23]  Michael J Rothman,et al.  Measuring the modified early warning score and the Rothman Index: Advantages of utilizing the electronic medical record in an early warning system , 2013, Journal of hospital medicine.

[24]  C. Subbe,et al.  Into the night: factors affecting response to abnormal Early Warning Scores out-of-hours and implications for service improvement. , 2014, Acute medicine.

[25]  G. McNeill,et al.  Do either early warning systems or emergency response teams improve hospital patient survival? A systematic review. , 2013, Resuscitation.

[26]  Michael J. Rothman,et al.  Development and validation of a continuous measure of patient condition using the Electronic Medical Record , 2013, J. Biomed. Informatics.

[27]  Kevin M. Heard,et al.  A trial of a real-time alert for clinical deterioration in patients hospitalized on general medical wards. , 2013, Journal of hospital medicine.

[28]  Jos Boekhorst,et al.  Data mining in the Life Sciences with Random Forest: a walk in the park or lost in the jungle? , 2012, Briefings Bioinform..

[29]  R. Pearse,et al.  A national early warning score for acutely ill patients , 2012, BMJ : British Medical Journal.

[30]  Susanne M. Smorenburg,et al.  UvA-DARE ( Digital Academic Repository ) Rapid response systems . Recognition and management of the deteriorating patient , 2018 .

[31]  M. Cvach Monitor alarm fatigue: an integrative review. , 2012, Biomedical instrumentation & technology.

[32]  C. Subbe,et al.  Validation of a modified Early Warning Score in medical admissions. , 2001, QJM : monthly journal of the Association of Physicians.

[33]  Y. Benjamini,et al.  Controlling the false discovery rate: a practical and powerful approach to multiple testing , 1995 .