Early Prediction of Vital Signs Using Generative Boosting via LSTM Networks

Vital signs including heart rate, respiratory rate, body temperature and blood pressure, are critical in the clinical decision making process. Abnormal vital signs help to alert medical practitioners to potential health problems. Effective and long-range early prediction of vital signs may prevent adverse health outcomes and reduce cost. In this paper, we suggest a new approach called generative boosting, in order to effectively perform long-range early prediction of vital signs. Generative boosting consists of a generative model, to generate synthetic data for the next few time steps, and a predictive model, to directly make long-range predictions based on observed and generated data. We explore generative boosting via long short-term memory (LSTM) for both the predictive and generative models, leading to a scheme called generative LSTM (GLSTM). Our experiments indicate that GLSTM outperforms a diverse range of strong benchmark models, with and without generative boosting. As expected, the results also indicate that more accurately generated data leads to more accurate long-range predictions. In light of this, we use a mutual information based clustering algorithm to select a more representative dataset to train the generative model. This leads to significantly improved accuracy of the long-range prediction of high variation vital signs such as heart rate and systolic blood pressure. Overall, our best results indicate that the proposed method is able to predict heart rate and systolic blood pressure 20 minutes in advance, with a mean absolute percentage error of 7.41% and 6.17%, respectively.

[1]  T. Simon,et al.  Vital Signs: Trends in State Suicide Rates — United States, 1999–2016 and Circumstances Contributing to Suicide — 27 States, 2015 , 2018, MMWR. Morbidity and mortality weekly report.

[2]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[3]  Yunpeng Wang,et al.  Long short-term memory neural network for traffic speed prediction using remote microwave sensor data , 2015 .

[4]  Jürgen Schmidhuber,et al.  Learning Precise Timing with LSTM Recurrent Networks , 2003, J. Mach. Learn. Res..

[5]  Sibanda Wilbert,et al.  Novel Application of Multi-Layer Perceptrons (MLP) Neural Networks to Model HIV in South Africa using Seroprevalence Data from Antenatal Clinics , 2011 .

[6]  Charles Elkan,et al.  Learning to Diagnose with LSTM Recurrent Neural Networks , 2015, ICLR.

[7]  Katherine A. Heller,et al.  Learning to Detect Sepsis with a Multitask Gaussian Process RNN Classifier , 2017, ICML.

[8]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[9]  Diyar Akay,et al.  Comparison of direct and iterative artificial neural network forecast approaches in multi-periodic time series forecasting , 2009, Expert Syst. Appl..

[10]  Gianluca Bontempi,et al.  Machine Learning Strategies for Time Series Forecasting , 2012, eBISS.

[11]  Jürgen Schmidhuber,et al.  Applying LSTM to Time Series Predictable through Time-Window Approaches , 2000, ICANN.

[12]  Haipeng Shen,et al.  Artificial intelligence in healthcare: past, present and future , 2017, Stroke and Vascular Neurology.

[13]  Jürgen Schmidhuber,et al.  Long Short-Term Memory , 1997, Neural Computation.

[14]  J. Stock,et al.  A Comparison of Direct and Iterated Multistep Ar Methods for Forecasting Macroeconomic Time Series , 2005 .

[15]  Hossein Deldari,et al.  Job failure prediction in grid environment based on workload characteristics , 2009, 2009 14th International CSI Computer Conference.

[16]  G. Box,et al.  Distribution of Residual Autocorrelations in Autoregressive-Integrated Moving Average Time Series Models , 1970 .

[17]  Bernhard Schölkopf,et al.  A tutorial on support vector regression , 2004, Stat. Comput..

[18]  Brian A. King,et al.  Vital Signs: Current Cigarette Smoking Among Adults Aged ≥18 Years with Mental Illness — United States, 2009–2011 , 2013, MMWR. Morbidity and mortality weekly report.

[19]  Ahmed M. Alaa,et al.  Deep Multi-task Gaussian Processes for Survival Analysis with Competing Risks , 2017, NIPS.

[20]  Fei Wang,et al.  Deep learning for healthcare: review, opportunities and challenges , 2018, Briefings Bioinform..

[21]  Min Chen,et al.  Disease Prediction by Machine Learning Over Big Data From Healthcare Communities , 2017, IEEE Access.

[22]  Mehul Motani,et al.  Spatio-temporal autoencoder for feature learning in patient data with missing observations , 2017, 2017 IEEE International Conference on Bioinformatics and Biomedicine (BIBM).

[23]  Amaury Lendasse,et al.  Methodology for long-term prediction of time series , 2007, Neurocomputing.

[24]  U. Rajendra Acharya,et al.  Deep learning for healthcare applications based on physiological signals: A review , 2018, Comput. Methods Programs Biomed..

[25]  Daniel B. Neill,et al.  Using Artificial Intelligence to Improve Hospital Inpatient Care , 2013, IEEE Intelligent Systems.

[26]  S. Hyakin,et al.  Neural Networks: A Comprehensive Foundation , 1994 .

[27]  Jing Gao,et al.  Multistep-Ahead Time Series Prediction , 2006, PAKDD.

[29]  Gang Luo,et al.  MLBCD: a machine learning tool for big clinical data , 2015, Health Inf. Sci. Syst..

[30]  H. Fukayama,et al.  Monitoring of vital signs during dental care. , 2006, International dental journal.

[31]  Eiji Kajii,et al.  Importance of vital signs to the early diagnosis and severity of sepsis: association between vital signs and sequential organ failure assessment score in patients with sepsis. , 2012, Internal medicine.

[32]  Renato Cesar Sato,et al.  Disease management with ARIMA model in time series , 2013, Einstein.

[33]  U Helfenstein,et al.  Box-Jenkins modelling in medical research , 1996, Statistical methods in medical research.

[34]  U. Gather,et al.  Time series analysis in intensive care medicine , 1998 .

[35]  Mehul Motani,et al.  Learning Deep Representations from Heterogeneous Patient Data for Predictive Diagnosis , 2017, BCB.

[36]  Jimeng Sun,et al.  Using recurrent neural network models for early detection of heart failure onset , 2016, J. Am. Medical Informatics Assoc..

[37]  Lovekesh Vig,et al.  Long Short Term Memory Networks for Anomaly Detection in Time Series , 2015, ESANN.

[38]  Ravi Sankar,et al.  Time Series Prediction Using Support Vector Machines: A Survey , 2009, IEEE Computational Intelligence Magazine.

[39]  A. Kraskov,et al.  Estimating mutual information. , 2003, Physical review. E, Statistical, nonlinear, and soft matter physics.

[40]  Mehul Motani,et al.  SURI: Feature Selection Based on Unique Relevant Information for Health Data , 2018, 2018 IEEE International Conference on Bioinformatics and Biomedicine (BIBM).

[41]  R. Fagan,et al.  Vital Signs: Improving Antibiotic Use Among Hospitalized Patients , 2014, MMWR. Morbidity and mortality weekly report.

[42]  E. A. Mary Anita,et al.  A Survey of Big Data Analytics in Healthcare and Government , 2015 .

[43]  Huiru Zheng,et al.  Machine Learning for Medical Applications , 2015, TheScientificWorldJournal.

[44]  Rob J. Hyndman,et al.  Recursive and direct multi-step forecasting: the best of both worlds , 2012 .

[45]  Jürgen Schmidhuber,et al.  Learning to Forget: Continual Prediction with LSTM , 2000, Neural Computation.