Early Deterioration Warning for Hospitalized Patients by Mining Clinical Data

Data mining on medical data has great potential to improve the treatment quality of hospitals and increase the survival rate of patients. Every year, 4-17% of patients undergo cardiopulmonary or respiratory arrest while in hospitals. Clinical study has found early detection and intervention to be essential for preventing clinical deterioration in patients at general hospital units. This paper proposes an early warning system (EWS) designed to identify the signs of clinical deterioration and provide early warning for serious clinical events. The EWS is designed to provide reliable early alarms for patients at the general hospital wards (GHWs). The main task of EWS is a challenging classification problem on high-dimensional stream data with irregular, multi-scale data gaps, measurement errors, outliers, and class imbalance. This paper proposes a novel data mining framework for analyzing such medical data streams. The authors assess the feasibility of the proposed EWS approach through retrospective study that includes data from 41,503 visits at a major hospital. Finally, the system is applied in a clinical trial at a major hospital and obtains promising results. This project is an example of multidisciplinary cyber-physical systems involving researchers in clinical science, data mining, and nursing staff.

[1]  Andrew P. Bradley,et al.  The use of the area under the ROC curve in the evaluation of machine learning algorithms , 1997, Pattern Recognit..

[2]  E Michel,et al.  Artificial neural network for risk assessment in preterm neonates , 1998, Archives of disease in childhood. Fetal and neonatal edition.

[3]  Fernando Pereira,et al.  Shallow Parsing with Conditional Random Fields , 2003, NAACL.

[4]  Jing Li,et al.  Heterogeneous data fusion for alzheimer's disease study , 2008, KDD.

[5]  Yixin Chen,et al.  Toward a two-tier clinical warning system for hospitalized patients. , 2011, AMIA ... Annual Symposium proceedings. AMIA Symposium.

[6]  Amaia Bilbao,et al.  Prospective comparison of severity scores for predicting clinically relevant outcomes for patients hospitalized with community-acquired pneumonia. , 2009, Chest.

[7]  Zhi-Hua Zhou,et al.  Exploratory Under-Sampling for Class-Imbalance Learning , 2006, ICDM.

[8]  Steve B. Jiang,et al.  Learning methods for lung tumor markerless gating in image-guided radiotherapy , 2008, KDD.

[9]  JeongGil Ko,et al.  MEDiSN: Medical emergency detection in sensor networks , 2010, TECS.

[10]  K. Vijay-Shanker,et al.  Taking into Account the Differences between Actively and Passively Acquired Data: The Case of Active Learning with Support Vector Machines for Imbalanced Datasets , 2009, NAACL.

[11]  Zhi-Hua Zhou,et al.  Exploratory Undersampling for Class-Imbalance Learning , 2009, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[12]  Katharina Morik,et al.  Combining Statistical Learning with a Knowledge-Based Approach - A Case Study in Intensive Care Monitoring , 1999, ICML.

[13]  M. P. Griffin,et al.  Toward the early diagnosis of neonatal sepsis and sepsis-like illness using novel heart rate analysis. , 2001, Pediatrics.

[14]  TaeHyun Hwang,et al.  Learning on Weighted Hypergraphs to Integrate Protein Interactions and Gene Expressions for Cancer Outcome Prediction , 2008, 2008 Eighth IEEE International Conference on Data Mining.

[15]  E. Draper,et al.  APACHE II: A severity of disease classification system , 1985, Critical care medicine.

[16]  Joshua A. Doherty,et al.  Early prediction of septic shock in hospitalized patients. , 2010, Journal of hospital medicine.

[17]  K. Shojania,et al.  National Patient Safety Goals. , 2006, Bulletin of the American College of Surgeons.

[18]  Richard J. Povinelli,et al.  Time series classification using Gaussian mixture models of reconstructed phase spaces , 2004, IEEE Transactions on Knowledge and Data Engineering.

[19]  Jeffrey A. Kline,et al.  The effect of a quantitative resuscitation strategy on mortality in patients with sepsis: A meta-analysis* , 2008, Critical care medicine.

[20]  Peter J. W. Rayner,et al.  Time series classification using the Volterra connectionist model and Bayes decision theory , 1993, 1993 IEEE International Conference on Acoustics, Speech, and Signal Processing.