Cost-sensitive Deep Learning for Early Readmission Prediction at A Major Hospital

With increased use of electronic medical records (EMRs), data mining on medical data has great potential to improve the quality of hospital treatment and increase the survival rate of patients. Early readmission prediction enables early intervention, which is essential to preventing serious or life-threatening events, and act as a substantial contributor to reducing healthcare costs. Existing works on predicting readmission often focus on certain vital signs and diseases by extracting statistical features. They also fail to consider skewness of class labels in medical data and different costs of misclassification errors. In this paper, we recur to the merits of convolutional neural networks (CNN) to automatically learn features from time series of vital sign, and categorical feature embedding to effectively extend feature vectors with heterogeneous clinical features, such as demographics, hospitalization history, vital signs and laboratory tests. Then, both learnt features via CNN and statistical features via feature embedding are fed into a multilayer perceptron (MLP) for prediction. We use a cost-sensitive formulation to train MLP during prediction to tackle the imbalance and skewness challenge. We validate the proposed approach on two real medical datasets from Barnes-Jewish Hospital, and all data is taken from historical EMR databases and reflects the kinds of data that would realistically be available at the clinical prediction system in hospitals. We find that early prediction of readmission is possible and when compared with state-of-the-art existing methods used by hospitals, our methods perform significantly better. Based on these results, a system is being deployed in hospital settings with the proposed forecasting algorithms to support treatment.

[1]  Zhi-Hua Zhou,et al.  Exploratory Undersampling for Class-Imbalance Learning , 2009, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[2]  Jun Wu,et al.  Boosting for Real-Time Multivariate Time Series Classification , 2017, AAAI.

[3]  Yixin Chen,et al.  Multi-Scale Convolutional Neural Networks for Time Series Classification , 2016, ArXiv.

[4]  Jimeng Sun,et al.  RETAIN: An Interpretable Predictive Model for Healthcare using Reverse Time Attention Mechanism , 2016, NIPS.

[5]  Johannes Gehrke,et al.  Intelligible Models for HealthCare: Predicting Pneumonia Risk and Hospital 30-day Readmission , 2015, KDD.

[6]  Jimeng Sun,et al.  Using recurrent neural network models for early detection of heart failure onset , 2016, J. Am. Medical Informatics Assoc..

[7]  Cole Erdmann,et al.  A Public-Private Partnership Develops and Externally Validates a 30-Day Hospital Readmission Risk Prediction Model , 2013, Online journal of public health informatics.

[8]  Susan Hutfless,et al.  Mining high-dimensional administrative claims data to predict early hospital readmissions , 2014, J. Am. Medical Informatics Assoc..

[9]  Waheeda Almayyan Lymph Diseases Prediction Using Random Forest and Particle Swarm Optimization , 2016 .

[10]  Kazunori Matsumoto,et al.  Sequence-to-Sequence Model with Attention for Time Series Classification , 2016, 2016 IEEE 16th International Conference on Data Mining Workshops (ICDMW).

[11]  E. Rackow Rehospitalizations among patients in the Medicare fee-for-service program. , 2009, The New England journal of medicine.

[12]  Ivor W. Tsang,et al.  Incremental Subgraph Feature Selection for Graph Classification , 2017, IEEE Transactions on Knowledge and Data Engineering.

[13]  Rayid Ghani,et al.  Early Prediction of Cardiac Arrest (Code Blue) using Electronic Medical Records , 2015, KDD.

[14]  Yixin Chen,et al.  An integrated data mining approach to real-time clinical monitoring and deterioration warning , 2012, KDD.

[15]  Naif Alajlan,et al.  Deep learning approach for active classification of electrocardiogram signals , 2016, Inf. Sci..

[16]  Yi Zheng,et al.  Time Series Classification Using Multi-Channels Deep Convolutional Neural Networks , 2014, WAIM.

[17]  Joseph Futoma,et al.  A comparison of models for predicting early hospital readmissions , 2015, J. Biomed. Informatics.

[18]  Eun Whan Lee Selecting the Best Prediction Model for Readmission , 2012, Journal of preventive medicine and public health = Yebang Uihakhoe chi.

[19]  Martine De Cock,et al.  Predicting 30-Day Risk and Cost of "All-Cause" Hospital Readmissions , 2016, AAAI Workshop: Expanding the Boundaries of Health Informatics Using AI.

[20]  Shuo Wang,et al.  Overview of deep learning , 2016, 2016 31st Youth Academic Annual Conference of Chinese Association of Automation (YAC).

[21]  John Cristian Borges Gamboa,et al.  Deep Learning for Time-Series Analysis , 2017, ArXiv.