Evaluation of the Impact of Data Uncertainty on the Prediction of Physiological Patient Deterioration

In this paper, we investigate the impact of data uncertainty on the prediction of a vital sign of abnormalities. We analyze the evolution of mean arterial blood pressure (MAP) and heart rate (HR) vital signs time series over a given forecasting window, whose values are classified according to a model of normality that models patient’s physiological deterioration. To this end, we developed a generic prediction framework that enables the deployment of different methods for data preparation, training, prediction, and classification so as to perform a set of experiments. To address our purpose, we selected a general regression neural network model (to predict vital sign trends) and a support vector machine (to recognize abnormalities) as the appropriate methods for a baseline analysis. However, other algorithm choices could be tailored in the defined prediction framework. We extract from the multiparameter intelligent monitoring in intensive care II waveform database a set of 302, 940 vital sign samples (organized as 459 records) to define training and testing data sets. From our experiments, we obtained evidence demonstrating the incidence of null values in the MAP/HR input time series on forecasting tasks. This was performed under different conditions in terms of null values in the observation window, and varying other factors like labeling methods and forecast window sizes. In our experiments, we concentrate on analyzing the extent of prediction performance under such conditions and identify that a tolerable level of null values is around 25% where it is possible to still derive forecast windows with an accuracy target $\geq 0.9$ for both MAP and HR vital signs. The presented evaluation contributes to acknowledge the importance of uncertainty conditions on the prediction performance of vital signs and paves the way for future extensions involving other prediction approaches.

[1]  David A. Clifton,et al.  Modelling physiological deterioration in post-operative patient vital-sign data , 2013, Medical & Biological Engineering & Computing.

[2]  Divya Tomar,et al.  A survey on Data Mining approaches for Healthcare , 2013, BSBT 2013.

[3]  James L Rudolph,et al.  Vital signs in older patients: age-related changes. , 2011, Journal of the American Medical Directors Association.

[4]  Paulo Carvalho,et al.  Prediction of acute hypotensive episodes by means of neural network multi-models , 2011, Comput. Biol. Medicine.

[5]  Shamim Nemati,et al.  Machine Learning and Decision Support in Critical Care , 2016, Proceedings of the IEEE.

[6]  David A. Clifton,et al.  A Two-Class Approach to the Detection of Physiological Deterioration in Patient Vital Signs, With Clinical Label Refinement , 2012, IEEE Transactions on Information Technology in Biomedicine.

[7]  Shourya Roy,et al.  Predicting Complications in Critical Care Using Heterogeneous Clinical Data , 2016, IEEE Access.

[8]  Yang Zhang,et al.  Probabilistic Novelty Detection With Support Vector Machines , 2014, IEEE Transactions on Reliability.

[9]  David A. Clifton,et al.  Identification of patient deterioration in vital-sign data using one-class support vector machines , 2011, 2011 Federated Conference on Computer Science and Information Systems (FedCSIS).

[10]  Chih-Jen Lin,et al.  A Practical Guide to Support Vector Classication , 2008 .

[11]  Donald F. Specht,et al.  A general regression neural network , 1991, IEEE Trans. Neural Networks.

[12]  Mohammed Saeed,et al.  Open-access MIMIC-II database for intensive care research , 2011, 2011 Annual International Conference of the IEEE Engineering in Medicine and Biology Society.

[13]  K. Hillman,et al.  A comparison of Antecedents to Cardiac Arrests, Deaths and EMergency Intensive care Admissions in Australia and New Zealand, and the United Kingdom—the ACADEMIA study , 2004 .

[14]  David A. Clifton,et al.  A review of novelty detection , 2014, Signal Process..

[15]  F. Baret,et al.  A comparison of methods for smoothing and gap filling time series of remote sensing observations - application to MODIS LAI products , 2012 .

[16]  Anthony K. H. Tung,et al.  Spatial clustering methods in data mining : A survey , 2001 .

[17]  F. Tay,et al.  Application of support vector machines in financial time series forecasting , 2001 .

[18]  L. M. Anderson Statistics with Confidence. Confidence Intervals and Statistical Guidelines , 1989 .

[19]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[20]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[21]  Lionel Tarassenko,et al.  BIOSIGN™ : multi-parameter monitoring for early warning of patient deterioration , 2005 .

[22]  Christopher W. Barton,et al.  A computational approach to early sepsis detection , 2016, Comput. Biol. Medicine.

[23]  Wei-Chen Chen,et al.  EM Algorithm for Model-Based Clustering of Finite MixtureGaussian Distribution , 2015 .

[24]  Christopher M. Bishop,et al.  Pattern Recognition and Machine Learning (Information Science and Statistics) , 2006 .

[25]  Xuan Zeng,et al.  A Machine Learning-Empowered System for Long-Term Motion-Tolerant Wearable Monitoring of Blood Pressure and Heart Rate With Ear-ECG/PPG , 2017, IEEE Access.

[26]  Michael Ian Shamos,et al.  Computational geometry: an introduction , 1985 .

[27]  G.B. Moody,et al.  Robust parameter extraction for decision support using multimodal intensive care data , 2008, Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences.

[28]  Ibrahim Khalil,et al.  ViSiBiD: A learning model for early discovery and real-time prediction of severe clinical events using vital signs as big data , 2017, Comput. Networks.