Patient Risk Stratification for Hospital-Associated C. diff as a Time-Series Classification Task

A patient's risk for adverse events is affected by temporal processes including the nature and timing of diagnostic and therapeutic activities, and the overall evolution of the patient's pathophysiology over time. Yet many investigators ignore this temporal aspect when modeling patient outcomes, considering only the patient's current or aggregate state. In this paper, we represent patient risk as a time series. In doing so, patient risk stratification becomes a time-series classification task. The task differs from most applications of time-series analysis, like speech processing, since the time series itself must first be extracted. Thus, we begin by defining and extracting approximate risk processes, the evolving approximate daily risk of a patient. Once obtained, we use these signals to explore different approaches to time-series classification with the goal of identifying high-risk patterns. We apply the classification to the specific task of identifying patients at risk of testing positive for hospital acquired Clostridium difficile. We achieve an area under the receiver operating characteristic curve of 0.79 on a held-out set of several hundred patients. Our two-stage approach to risk stratification outperforms classifiers that consider only a patient's current state (p<0.05).

[1]  L. Gentry,et al.  A clinical risk index for Clostridium difficile infection in hospitalised patients receiving broad-spectrum antibiotics. , 2008, The Journal of hospital infection.

[2]  Katharina Morik,et al.  Combining Statistical Learning with a Knowledge-Based Approach - A Case Study in Intensive Care Monitoring , 1999, ICML.

[3]  S. Chiba,et al.  Dynamic programming algorithm optimization for spoken word recognition , 1978 .

[4]  Eamonn J. Keogh,et al.  Three Myths about Dynamic Time Warping Data Mining , 2005, SDM.

[5]  Thorsten Joachims,et al.  Making large scale SVM learning practical , 1998 .

[6]  Joshua A. Doherty,et al.  Development and Validation of a Clostridium difficile Infection Risk Prediction Model Author ( s ) : , 2018 .

[7]  T. Warren Liao,et al.  Clustering of time series data - a survey , 2005, Pattern Recognit..

[8]  G. Krapohl Preventing Health Care-Associated Infection: Development of a Clinical Prediction Rule for Clostridium difficile Infection. , 2011 .

[9]  Lawrence R. Rabiner,et al.  A tutorial on hidden Markov models and selected applications in speech recognition , 1989, Proc. IEEE.

[10]  Susan T. Dumais,et al.  The Combination of Text Classifiers Using Reliability Indicators , 2016, Information Retrieval.

[11]  Shonali Krishnaswamy,et al.  Mining data streams: a review , 2005, SGMD.

[12]  Claus Bahlmann,et al.  Online handwriting recognition with support vector machines - a kernel approach , 2002, Proceedings Eighth International Workshop on Frontiers in Handwriting Recognition.

[13]  Yan Yan,et al.  Clostridium difficile--associated disease in a setting of endemicity: identification of novel risk factors. , 2007, Clinical infectious diseases : an official publication of the Infectious Diseases Society of America.

[14]  Yoram Bloch,et al.  Predicting Clostridium difficile Toxin in Hospitalized Patients With Antibiotic-Associated Diarrhea , 2007, Infection Control &#x0026; Hospital Epidemiology.

[15]  J. Wiens,et al.  Learning Evolving Patient Risk Processes for C. Diff Colonization , 2012 .

[16]  Jian Pei,et al.  A brief survey on sequence classification , 2010, SKDD.

[17]  B Littenberg,et al.  Clinical prediction rules to optimize cytotoxin testing for Clostridium difficile in hospitalized patients with diarrhea. , 1996, The American journal of medicine.

[19]  J. Tanner,et al.  Waterlow score to predict patients at risk of developing Clostridium difficile-associated disease. , 2009, The Journal of hospital infection.