CREST - Risk Prediction for Clostridium Difficile Infection Using Multimodal Data Mining

Clostridium difficile infection (CDI) is a common hospital acquired infection with a $1B annual price tag that resulted in \(\sim \)30,000 deaths in 2011. Studies have shown that early detection of CDI significantly improves the prognosis for the individual patient and reduces the overall mortality rates and associated medical costs. In this paper, we present CREST: CDI Risk Estimation, a data-driven framework for early and continuous detection of CDI in hospitalized patients. CREST uses a three-pronged approach for high accuracy risk prediction. First, CREST builds a rich set of highly predictive features from Electronic Health Records. These features include clinical and non-clinical phenotypes, key biomarkers from the patient’s laboratory tests, synopsis features processed from time series vital signs, and medical history mined from clinical notes. Given the inherent multimodality of clinical data, CREST bins these features into three sets: time-invariant, time-variant, and temporal synopsis features. CREST then learns classifiers for each set of features, evaluating their relative effectiveness. Lastly, CREST employs a second-order meta learning process to ensemble these classifiers for optimized estimation of the risk scores. We evaluate the CREST framework using publicly available critical care data collected for over 12 years from Beth Israel Deaconess Medical Center, Boston. Our results demonstrate that CREST predicts the probability of a patient acquiring CDI with an AUC of 0.76 five days prior to diagnosis. This value increases to 0.80 and even 0.82 for prediction two days and one day prior to diagnosis, respectively.

[1]  Ella S. Franklin,et al.  Learning Data-Driven Patient Risk Stratification Models for Clostridium difficile , 2014, Open forum infectious diseases.

[2]  Hui Zhang,et al.  Feature Extraction for Time Series Classification Using Discriminating Wavelet Coefficients , 2006, ISNN.

[3]  Philip M Polgreen,et al.  Incidence of and risk factors for community-associated Clostridium difficile infection: A nested case-control study , 2011, BMC infectious diseases.

[4]  Peter Szolovits,et al.  MIMIC-III, a freely accessible critical care database , 2016, Scientific Data.

[5]  W. Art Chaovalitwongse,et al.  Electroencephalogram (EEG) time series classification: Applications in epilepsy , 2006, Ann. Oper. Res..

[6]  Joshua A. Doherty,et al.  Development and Validation of a Clostridium difficile Infection Risk Prediction Model Author ( s ) : , 2018 .

[7]  S. Solomon,et al.  Antibiotic resistance threats in the United States: stepping back from the brink. , 2014, American family physician.

[8]  D. Gerding,et al.  Clinical Practice Guidelines for Clostridium difficile Infection in Adults: 2010 Update by the Society for Healthcare Epidemiology of America (SHEA) and the Infectious Diseases Society of America (IDSA) , 2010, Infection Control & Hospital Epidemiology.

[9]  Nasia Safdar,et al.  Current Trends in the Epidemiology and Outcomes of Clostridium difficile Infection. , 2015, Clinical infectious diseases : an official publication of the Infectious Diseases Society of America.

[10]  J. Wiens,et al.  Learning Evolving Patient Risk Processes for C. Diff Colonization , 2012 .

[11]  D. Warren,et al.  Hospital-Associated Clostridium difficile Infection: Is It Necessary to Track Community-Onset Disease? , 2009, Infection Control & Hospital Epidemiology.

[12]  J. Tanner,et al.  Waterlow score to predict patients at risk of developing Clostridium difficile-associated disease. , 2009, The Journal of hospital infection.

[13]  Lisa G Winston,et al.  Burden of Clostridium difficile Infection in the United States , 2015 .

[14]  Jenna Wiens,et al.  Patient Risk Stratification for Hospital-Associated C. diff as a Time-Series Classification Task , 2012, NIPS.

[15]  Klaus-Robert Müller,et al.  Spatio-spectral filters for improving the classification of single trial EEG , 2005, IEEE Transactions on Biomedical Engineering.

[16]  Sriram V. Pemmaraju,et al.  Improving Risk Prediction of Clostridium Difficile Infection Using Temporal Event-Pairs , 2015, 2015 International Conference on Healthcare Informatics.

[17]  Karsten Sternickel,et al.  Automatic pattern recognition in ECG time series , 2002, Comput. Methods Programs Biomed..