论文信息 - Early detection of sepsis utilizing deep learning on electronic health record event sequences

Early detection of sepsis utilizing deep learning on electronic health record event sequences

BACKGROUND The timeliness of detection of a sepsis incidence in progress is a crucial factor in the outcome for the patient. Machine learning models built from data in electronic health records can be used as an effective tool for improving this timeliness, but so far, the potential for clinical implementations has been largely limited to studies in intensive care units. This study will employ a richer data set that will expand the applicability of these models beyond intensive care units. Furthermore, we will circumvent several important limitations that have been found in the literature: (1) Model evaluations neglect the clinical consequences of a decision to start, or not start, an intervention for sepsis. (2) Models are evaluated shortly before sepsis onset without considering interventions already initiated. (3) Machine learning models are built on a restricted set of clinical parameters, which are not necessarily measured in all departments. (4) Model performance is limited by current knowledge of sepsis, as feature interactions and time dependencies are hard-coded into the model. METHODS In this study, we present a model to overcome these shortcomings using a deep learning approach on a diverse multicenter data set. We used retrospective data from multiple Danish hospitals over a seven-year period. Our sepsis detection system is constructed as a combination of a convolutional neural network and a long short-term memory network. We assess model quality by standard concepts of accuracy as well as clinical usefulness, and we suggest a retrospective assessment of interventions by looking at intravenous antibiotics and blood cultures preceding the prediction time. RESULTS Results show performance ranging from AUROC 0.856 (3 h before sepsis onset) to AUROC 0.756 (24 h before sepsis onset). Evaluating the clinical utility of the model, we find that a large proportion of septic patients did not receive antibiotic treatment or blood culture at the time of the sepsis prediction, and the model could, therefore, facilitate such interventions at an earlier point in time. CONCLUSION We present a deep learning system for early detection of sepsis that can learn characteristics of the key factors and interactions from the raw event sequence data itself, without relying on a labor-intensive feature extraction work. Our system outperforms baseline models, such as gradient boosting, which rely on specific data elements and therefore suffer from many missing values in our dataset.

[1] J. Habbema,et al. The measurement of performance in probabilistic diagnosis. II. Trustworthiness of the exact values of the diagnostic probabilities. , 1978, Methods of information in medicine.

[2] C. Pedersen,et al. The Danish Civil Registration System , 2011, Scandinavian journal of public health.

[3] Alex Graves,et al. Generating Sequences With Recurrent Neural Networks , 2013, ArXiv.

[4] R. Bellomo,et al. The Third International Consensus Definitions for Sepsis and Septic Shock (Sepsis-3). , 2016, JAMA.

[5] Alexander Binder,et al. Explaining nonlinear classification decisions with deep Taylor decomposition , 2015, Pattern Recognit..

[6] Jiaquan Xu,et al. Deaths: final data for 2010. , 2013, National vital statistics reports : from the Centers for Disease Control and Prevention, National Center for Health Statistics, National Vital Statistics System.

[7] Yvonne Vergouwe,et al. Towards better clinical prediction models: seven steps for development and an ABCD for validation. , 2014, European heart journal.

[8] Mitchell M. Levy,et al. 2001 SCCM/ESICM/ACCP/ATS/SIS International Sepsis Definitions Conference , 2003, Intensive Care Medicine.

[9] Trevor Darrell,et al. Long-term recurrent convolutional networks for visual recognition and description , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[10] Parisa Rashidi,et al. Deep EHR: A Survey of Recent Advances in Deep Learning Techniques for Electronic Health Record (EHR) Analysis , 2017, IEEE Journal of Biomedical and Health Informatics.

[11] Harry J de Koning,et al. Risk prediction models for selection of lung cancer screening candidates: A retrospective validation study , 2017, PLoS medicine.

[12] Steven Horng,et al. Creating an automated trigger for sepsis clinical decision support at emergency department triage using machine learning , 2017, PloS one.

[13] Tara N. Sainath,et al. Convolutional, Long Short-Term Memory, fully connected Deep Neural Networks , 2015, 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[14] Christopher W. Barton,et al. A computational approach to early sepsis detection , 2016, Comput. Biol. Medicine.

[15] P. Bossuyt,et al. An assessment of the relationship between clinical utility and predictive ability measures and the impact of mean risk in the population , 2014, BMC Medical Research Methodology.

[16] Benjamin Recht,et al. Do CIFAR-10 Classifiers Generalize to CIFAR-10? , 2018, ArXiv.

[17] I. Kohane,et al. Biases in electronic health record data due to processes within the healthcare system: retrospective observational study , 2018, British Medical Journal.

[18] Valentin Rousson,et al. Decision curve analysis revisited: overall net benefit, relationships to ROC curve analysis, and application to case-control studies , 2011, BMC Medical Informatics Decis. Mak..