Early Sepsis Prediction Using Ensemble Learning with Features Extracted from LSTM Recurrent Neural Network

Early prediction of sepsis can help to identify potential risks in time and help take necessary measures to prevent more dangerous situations from occurring. In PhysioNet/Computing in Cardiology Challenge 2019, we integrate Long Short Term Memory (LSTM) recurrent neural network and ensemble learning to achieve early sepsis prediction. Specifically, we tackle the problem of class imbalance and data missing firstly, and then we manually extract features according to the prior knowledge from the medical field. In addition, we regard the prediction of sepsis as a time series prediction problem and pre-train LSTM-based models as feature extractors to obtain the "deep" features on time series that might be related to the onset of sepsis. Manual features and "deep" features are then used to train prediction models under the framework of ensemble learning, including Extreme Gradient Boosting (XGBoost) and Gradient Boosting Decision Tree (GBDT) regressor. The final normalized utility score our team (UCAS_DataMiner) have obtained was 0.313 on full hidden test set.

[1]  Lovekesh Vig,et al.  Anomaly detection in ECG time signals via deep long short-term memory networks , 2015, 2015 IEEE International Conference on Data Science and Advanced Analytics (DSAA).

[2]  Feng Jiang,et al.  Recurrent Neural Network Based Classification of ECG Signal Features for Obstruction of Sleep Apnea Detection , 2017, 22017 IEEE International Conference on Computational Science and Engineering (CSE) and IEEE International Conference on Embedded and Ubiquitous Computing (EUC).

[3]  Fan Wu,et al.  The Prediction of Severe Sepsis Using SVM Model , 2009, BIOCOMP.

[4]  R. Bellomo,et al.  The Third International Consensus Definitions for Sepsis and Septic Shock (Sepsis-3). , 2016, JAMA.

[5]  David J. C. MacKay,et al.  Ensemble Learning for Blind Image Separation and Deconvolution , 2000 .

[6]  P. Heegaard,et al.  The use of sequential organ failure assessment parameters in an awake porcine model of severe Staphylococcus aureus sepsis , 2012, APMIS : acta pathologica, microbiologica, et immunologica Scandinavica.

[7]  Tianqi Chen,et al.  XGBoost: A Scalable Tree Boosting System , 2016, KDD.

[8]  J. Friedman Greedy function approximation: A gradient boosting machine. , 2001 .

[9]  Shamim Nemati,et al.  Early Prediction of Sepsis From Clinical Data: The PhysioNet/Computing in Cardiology Challenge 2019 , 2019, 2019 Computing in Cardiology (CinC).

[10]  Friedhelm Schwenker,et al.  Ensemble Methods: Foundations and Algorithms [Book Review] , 2013, IEEE Computational Intelligence Magazine.

[11]  Alan E Jones,et al.  The Sequential Organ Failure Assessment score for predicting outcome in patients with severe sepsis and evidence of hypoperfusion at the time of emergency department presentation* , 2009, Critical care medicine.

[12]  Jian Ma,et al.  A comparative assessment of ensemble learning for credit scoring , 2011, Expert Syst. Appl..

[13]  S. Pinto‐Sietsma,et al.  Prognosis of patients with haematological malignancies admitted to the intensive care unit: Sequential Organ Failure Assessment (SOFA) trend is a powerful predictor of mortality. , 2011, European journal of internal medicine.

[14]  Evert de Jonge,et al.  Using hierarchical dynamic Bayesian networks to investigate dynamics of organ failure in patients in the Intensive Care Unit , 2010, J. Biomed. Informatics.

[15]  U. Rajendra Acharya,et al.  Automated diagnosis of arrhythmia using combination of CNN and LSTM techniques with variable length heart beats , 2018, Comput. Biol. Medicine.

[16]  Ashish Sharma,et al.  Early Prediction of Sepsis from Clinical Data: the PhysioNet/Computing in Cardiology Challenge 2019 , 2019, 2019 Computing in Cardiology (CinC).

[17]  Kin Keung Lai,et al.  Credit risk assessment with a multistage neural network ensemble learning approach , 2008, Expert Syst. Appl..