An Ensemble Machine Learning Model For the Early Detection of Sepsis From Clinical Data

Sepsis is a life-threatening disease with high mortality and expensive cost of treatment. In order to improve the outcomes of patients, it is important to detect at-risk patients with sepsis at an early stage. The PhysioNet/Computing in Cardiology Challenge 2019 focused on improving predicting sepsis six hours before the clinical diagnosis by using the latest definition of Sepsis-3. A total of 40,336 ICU patients were provided as public training data, A hidden test dataset was used to evaluate. An ensemble model, which combined boosting and bagging tree models (lightgbm, xgboost and random forest ) were designed to predict sepsis based on the records of the patient’s hourly data. We compared the ensemble model and each single model of evaluation metrics results on selected inner test data Offline, the best performance was achieved AUC of 0.792, ACC of 0.727. Finally, the proposed model was evaluated on the full test sets received an official utility score, defined by the organizers, was 0.087, ranked 75/105 (our team name: cinc_sepsis_pass). While the single model of lightgbm only received a utility score of -0.036. The ensemble model utilized the preprocessing data and achieved better performance than a single tree-based model.

[1]  Uli K. Chettipally,et al.  Prediction of Sepsis in the Intensive Care Unit With Minimal Electronic Health Record Data: A Machine Learning Approach , 2016, JMIR medical informatics.

[2]  Peter Szolovits,et al.  MIMIC-III, a freely accessible critical care database , 2016, Scientific Data.

[3]  Laura E. Barnes,et al.  Predictive Models of Sepsis in Adult ICU Patients , 2018, 2018 IEEE International Conference on Healthcare Informatics (ICHI).

[4]  Matthew D. Stanley,et al.  Early sepsis detection in critical care patients using multiscale blood pressure and heart rate dynamics. , 2017, Journal of electrocardiology.

[5]  R. Bellomo,et al.  The Third International Consensus Definitions for Sepsis and Septic Shock (Sepsis-3). , 2016, JAMA.

[6]  K. Wood,et al.  Duration of hypotension before initiation of effective antimicrobial therapy is the critical determinant of survival in human septic shock* , 2006, Critical care medicine.

[7]  S. Lemeshow,et al.  Time to Treatment and Mortality during Mandated Emergency Care for Sepsis , 2017, The New England journal of medicine.

[8]  Shamim Nemati,et al.  An Interpretable Machine Learning Model for Accurate Prediction of Sepsis in the ICU , 2017, Critical care medicine.

[9]  Christopher W. Barton,et al.  A computational approach to early sepsis detection , 2016, Comput. Biol. Medicine.

[10]  Ashish Sharma,et al.  Early Prediction of Sepsis from Clinical Data: the PhysioNet/Computing in Cardiology Challenge 2019 , 2019, 2019 Computing in Cardiology (CinC).

[11]  Shamim Nemati,et al.  Early Prediction of Sepsis From Clinical Data: The PhysioNet/Computing in Cardiology Challenge 2019 , 2019, 2019 Computing in Cardiology (CinC).

[12]  Anahita Khojandi,et al.  A minimal set of physiomarkers in continuous high frequency data streams predict adult sepsis onset earlier , 2019, Int. J. Medical Informatics.

[13]  Xia Fan,et al.  Prediction of sepsis in trauma patients , 2014, Burns & Trauma.

[14]  C. Torio,et al.  National Inpatient Hospital Costs: The Most Expensive Conditions by Payer, 2011 , 2013 .