Recent Context-Aware LSTM for Clinical Event Time-Series Prediction

In this work, we propose a novel clinical event time-series model based on the long short-term memory architecture (LSTM) that can predict future event occurrences for a large number of different clinical events. Our model relies on two sources of information to predict future events. One source is derived from the set of recently observed clinical events. The other one is based on the hidden state space defined by the LSTM that aims to abstract past, more distant, patient information that is predictive of future events. We evaluate our proposed model on electronic health record (EHRs) data derived from MIMIC-III dataset. We show that the combination of the two sources of information implemented in our method leads to improved prediction performance compared to the models based on individual sources.

[1]  S. P. Pederson,et al.  Hidden Markov and Other Models for Discrete-Valued Time Series , 1998 .

[2]  R. Kálmán Mathematical description of linear dynamical systems , 1963 .

[3]  Alex Graves,et al.  DRAW: A Recurrent Neural Network For Image Generation , 2015, ICML.

[4]  Mark Goadrich,et al.  The relationship between Precision-Recall and ROC curves , 2006, ICML.

[5]  Jürgen Schmidhuber,et al.  Long Short-Term Memory , 1997, Neural Computation.

[6]  Gilles Clermont,et al.  Outlier detection for patient monitoring and alerting , 2013, J. Biomed. Informatics.

[7]  Milos Hauskrecht,et al.  Clinical time series prediction: Toward a hierarchical dynamical system framework , 2015, Artif. Intell. Medicine.

[8]  Jimeng Sun,et al.  RETAIN: An Interpretable Predictive Model for Healthcare using Reverse Time Attention Mechanism , 2016, NIPS.

[9]  Milos Hauskrecht,et al.  A Regularized Linear Dynamical System Framework for Multivariate Time Series Analysis , 2015, AAAI.

[10]  Jimeng Sun,et al.  Multi-layer Representation Learning for Medical Concepts , 2016, KDD.

[11]  Milos Hauskrecht,et al.  Learning Linear Dynamical Systems from Multivariate Time Series: A Matrix Factorization Based Framework , 2016, SDM.

[12]  Paul J. Werbos,et al.  Backpropagation Through Time: What It Does and How to Do It , 1990, Proc. IEEE.

[13]  Peter Szolovits,et al.  MIMIC-III, a freely accessible critical care database , 2016, Scientific Data.

[14]  Padhraic Smyth,et al.  Clustering Sequences with Hidden Markov Models , 1996, NIPS.

[15]  Milos Hauskrecht,et al.  Modeling Clinical Time Series Using Gaussian Process Sequences , 2013, SDM.

[16]  Jeffrey Dean,et al.  Distributed Representations of Words and Phrases and their Compositionality , 2013, NIPS.

[17]  Gilles Clermont,et al.  Outlier-based detection of unusual patient-management actions: An ICU study , 2016, J. Biomed. Informatics.

[18]  Navdeep Jaitly,et al.  Towards End-To-End Speech Recognition with Recurrent Neural Networks , 2014, ICML.

[19]  R. L. Stratonovich CONDITIONAL MARKOV PROCESSES , 1960 .

[20]  Anders Krogh,et al.  Hidden Markov models for sequence analysis: extension and analysis of the basic method , 1996, Comput. Appl. Biosci..

[21]  Yoshua Bengio,et al.  Learning Phrase Representations using RNN Encoder–Decoder for Statistical Machine Translation , 2014, EMNLP.

[22]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[23]  Zachary Chase Lipton A Critical Review of Recurrent Neural Networks for Sequence Learning , 2015, ArXiv.

[24]  Takaya Saito,et al.  The Precision-Recall Plot Is More Informative than the ROC Plot When Evaluating Binary Classifiers on Imbalanced Datasets , 2015, PloS one.

[25]  Li-Chiu Chang,et al.  Reinforced recurrent neural networks for multi-step-ahead flood forecasts , 2013 .

[26]  Quoc V. Le,et al.  Sequence to Sequence Learning with Neural Networks , 2014, NIPS.

[27]  Yoshua Bengio,et al.  Gradient Flow in Recurrent Nets: the Difficulty of Learning Long-Term Dependencies , 2001 .

[28]  Eddie McKenzie,et al.  Discrete variate time series , 2003 .

[29]  Min Han,et al.  Prediction of chaotic time series based on the recurrent predictor neural network , 2004, IEEE Transactions on Signal Processing.

[30]  Lain L. MacDonald,et al.  Hidden Markov and Other Models for Discrete- valued Time Series , 1997 .

[31]  Volker Tresp,et al.  Predicting Sequences of Clinical Events by Using a Personalized Temporal Latent Embedding Model , 2015, 2015 International Conference on Healthcare Informatics.

[32]  Milos Hauskrecht,et al.  Feature importance analysis for patient management decisions , 2010, MedInfo.