Development and Evaluation of a Machine Learning Model for the Early Identification of Patients at Risk for Sepsis

Study objective: The Third International Consensus Definitions (Sepsis‐3) Task Force recommended the use of the quick Sequential [Sepsis‐related] Organ Failure Assessment (qSOFA) score to screen patients for sepsis outside of the ICU. However, subsequent studies raise concerns about the sensitivity of qSOFA as a screening tool. We aim to use machine learning to develop a new sepsis screening tool, the Risk of Sepsis (RoS) score, and compare it with a slate of benchmark sepsis‐screening tools, including the Systemic Inflammatory Response Syndrome, Sequential Organ Failure Assessment (SOFA), qSOFA, Modified Early Warning Score, and National Early Warning Score. Methods: We used retrospective electronic health record data from adult patients who presented to 49 urban community hospital emergency departments during a 22‐month period (N=2,759,529). We used the Rhee clinical surveillance criteria as our standard definition of sepsis and as the primary target for developing our model. The data were randomly split into training and test cohorts to derive and then evaluate the model. A feature selection process was carried out in 3 stages: first, we reviewed existing models for sepsis screening; second, we consulted with local subject matter experts; and third, we used a supervised machine learning called gradient boosting. Key metrics of performance included alert rate, area under the receiver operating characteristic curve, sensitivity, specificity, and precision. Performance was assessed at 1, 3, 6, 12, and 24 hours after an index time. Results: The RoS score was the most discriminant screening tool at all time thresholds (area under the receiver operating characteristic curve 0.93 to 0.97). Compared with the next most discriminant benchmark (Sequential Organ Failure Assessment), RoS was significantly more sensitive (67.7% versus 49.2% at 1 hour and 84.6% versus 80.4% at 24 hours) and precise (27.6% versus 12.2% at 1 hour and 28.8% versus 11.4% at 24 hours). The sensitivity of qSOFA was relatively low (3.7% at 1 hour and 23.5% at 24 hours). Conclusion: In this retrospective study, RoS was more timely and discriminant than benchmark screening tools, including those recommend by the Sepsis‐3 Task Force. Further study is needed to validate the RoS score at independent sites.

[1]  Hamid Mohamadlou,et al.  High-performance detection and early prediction of septic shock for alcohol-use disorder patients , 2016, Annals of medicine and surgery.

[2]  C. Sprung,et al.  Surviving Sepsis Campaign: International Guidelines for Management of Severe Sepsis and Septic Shock 2012 , 2013, Critical care medicine.

[3]  J. Greenslade,et al.  SIRS, qSOFA and organ dysfunction: insights from a prospective database of emergency department patients with infection. , 2016 .

[4]  Gary B. Smith,et al.  The ability of the National Early Warning Score (NEWS) to discriminate patients at risk of early cardiac arrest, unanticipated intensive care unit admission, and death. , 2013, Resuscitation.

[5]  M. Howell,et al.  Quick Sepsis‐related Organ Failure Assessment, Systemic Inflammatory Response Syndrome, and Early Warning Scores for Detecting Clinical Deterioration in Infected Patients outside the Intensive Care Unit , 2017, American journal of respiratory and critical care medicine.

[6]  W. Knaus,et al.  Definitions for sepsis and organ failure and guidelines for the use of innovative therapies in sepsis. The ACCP/SCCM Consensus Conference Committee. American College of Chest Physicians/Society of Critical Care Medicine. , 1992, Chest.

[7]  Uli K. Chettipally,et al.  Prediction of Sepsis in the Intensive Care Unit With Minimal Electronic Health Record Data: A Machine Learning Approach , 2016, JMIR medical informatics.

[8]  T. Rea,et al.  Assessment of Clinical Criteria for Sepsis: For the Third International Consensus Definitions for Sepsis and Septic Shock (Sepsis-3). , 2016, JAMA.

[9]  Counting Sepsis, an Imprecise but Improving Science. , 2017, JAMA.

[10]  O. Bouamra,et al.  The value of traditional vital signs, shock index, and age-based markers in predicting trauma mortality , 2013, The journal of trauma and acute care surgery.

[11]  P. Claret,et al.  Micro-organismen en infectieziekten bij de mens: algemene principes , 2016, JAMA.

[12]  C. Subbe,et al.  Validation of a modified Early Warning Score in medical admissions. , 2001, QJM : monthly journal of the Association of Physicians.

[13]  Leo Breiman,et al.  Statistical Modeling: The Two Cultures (with comments and a rejoinder by the author) , 2001 .

[14]  S. Simpson SIRS in the Time of Sepsis-3. , 2018, Chest.

[15]  R. Bellomo,et al.  The Third International Consensus Definitions for Sepsis and Septic Shock (Sepsis-3). , 2016, JAMA.

[16]  R. Bellomo,et al.  Prognostic Accuracy of the SOFA Score, SIRS Criteria, and qSOFA Score for In-Hospital Mortality Among Adults With Suspected Infection Admitted to the Intensive Care Unit , 2017, JAMA.

[17]  S. Lemeshow,et al.  Surviving Sepsis Campaign: association between performance metrics and outcomes in a 7.5-year study. , 2015, Critical care medicine.

[18]  P. Pronovost,et al.  A targeted real-time early warning score (TREWScore) for septic shock , 2015, Science Translational Medicine.

[19]  Spencer S. Jones,et al.  Development and Evaluation of an Automated Machine Learning Algorithm for In-Hospital Mortality Risk Adjustment Among Critical Care Patients* , 2018, Critical care medicine.

[20]  Steven Horng,et al.  Creating an automated trigger for sepsis clinical decision support at emergency department triage using machine learning , 2017, PloS one.

[21]  Christopher W. Barton,et al.  A computational approach to early sepsis detection , 2016, Comput. Biol. Medicine.

[22]  B. Efron Bootstrap Methods: Another Look at the Jackknife , 1979 .

[23]  Robert C. Amland,et al.  Quick Sequential [Sepsis-Related] Organ Failure Assessment (qSOFA) and St. John Sepsis Surveillance Agent to Detect Patients at Risk of Sepsis: An Observational Cohort Study , 2017, AMIA.

[24]  I. Kohane,et al.  Big Data and Machine Learning in Health Care. , 2018, JAMA.

[25]  J. Friedman Greedy function approximation: A gradient boosting machine. , 2001 .

[26]  Susan Gruber,et al.  Incidence and Trends of Sepsis in US Hospitals Using Clinical vs Claims Data, 2009-2014 , 2017, JAMA.

[27]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .