Classification of Patients Using Novel Multivariate Time Series Representations of Physiological Data

In this paper we present two novel multivariate time series representations to classify physiological data of different lengths. The representations may be applied to any group of multivariate time series data that examine the state or health of an entity. Multivariate Bag-of-Patterns and Stacked Bags of-Patterns improve on their univariate counterpart, inspired by the bag-of-words model, by using multiple time series and analyzing the data in a multivariate fashion. We also borrow techniques from the natural language processing domain such as term frequency and inverse document frequency to improve classification accuracy. We introduce a technique named inverse frequency and present experimental results on classifying patients who have experienced acute episodes of hypotension.

[1]  Eamonn J. Keogh,et al.  A symbolic representation of time series, with implications for streaming algorithms , 2003, DMKD '03.

[2]  Eamonn J. Keogh,et al.  Probabilistic discovery of time series motifs , 2003, KDD '03.

[3]  Eamonn J. Keogh,et al.  Detecting time series motifs under uniform scaling , 2007, KDD '07.

[4]  JH Henriques,et al.  Prediction of acute hypotensive episodes using neural network multi-models , 2009, 2009 36th Annual Computers in Cardiology Conference (CinC).

[5]  Ben Shneiderman,et al.  PatternFinder in Microsoft Amalga : Temporal Query Formulation and Result Visualization in Action , 2008 .

[6]  MA Mneimneh,et al.  A rule-based approach for the prediction of acute hypotensive episodes , 2009, 2009 36th Annual Computers in Cardiology Conference (CinC).

[7]  Christos Faloutsos,et al.  Efficient Similarity Search In Sequence Databases , 1993, FODO.

[8]  Hilltop Circle Baltimore,et al.  Multivariate Time Series Analysis of Clinical and Physiological Data , 2010 .

[9]  M. Lemay,et al.  Computers in Cardiology / Physionet Challenge 2009: Predicting acute hypotensive episodes , 2009, 2009 36th Annual Computers in Cardiology Conference (CinC).

[10]  Eamonn J. Keogh,et al.  UCR Time Series Data Mining Archive , 1983 .

[11]  Heikki Mannila,et al.  Time series segmentation for context recognition in mobile devices , 2001, Proceedings 2001 IEEE International Conference on Data Mining.

[12]  Eamonn J. Keogh,et al.  Dimensionality Reduction for Fast Similarity Search in Large Time Series Databases , 2001, Knowledge and Information Systems.

[13]  A Murray,et al.  Predicting acute hypotensive episodes from mean arterial pressure , 2009, 2009 36th Annual Computers in Cardiology Conference (CinC).

[14]  Mohammed Saeed,et al.  A Novel Method for the Efficient Retrieval of Similar Multiparameter Physiologic Time Series Using Wavelet-Based Symbolic Representations , 2006, AMIA.

[15]  Irfan A. Essa,et al.  Detecting Subdimensional Motifs: An Efficient Algorithm for Generalized Multivariate Pattern Discovery , 2007, Seventh IEEE International Conference on Data Mining (ICDM 2007).

[16]  J. Bain,et al.  PSYCHOLOGICAL SCIENCE Research Article How Many Variables Can Humans Process? , 2022 .

[17]  Eamonn J. Keogh,et al.  iSAX: indexing and mining terabyte sized time series , 2008, KDD.

[18]  Gerard Salton,et al.  Automatic Text Processing: The Transformation, Analysis, and Retrieval of Information by Computer , 1989 .

[19]  Yuan Li,et al.  Finding Structural Similarity in Time Series Data Using Bag-of-Patterns Representation , 2009, SSDBM.

[20]  Li Wei,et al.  Experiencing SAX: a novel symbolic representation of time series , 2007, Data Mining and Knowledge Discovery.

[21]  Eamonn J. Keogh A decade of progress in indexing and mining large time series databases , 2006, VLDB.

[22]  R. Mark,et al.  Integrating Data, Models, and Reasoning in Critical Care , 2006, 2006 International Conference of the IEEE Engineering in Medicine and Biology Society.

[23]  Toshiyuki Amagasa,et al.  The L - index: An indexing structure for ecient subsequence matching in time sequence databases , 2001 .

[24]  Jeffrey M. Hausdorff,et al.  Physionet: Components of a New Research Resource for Complex Physiologic Signals". Circu-lation Vol , 2000 .

[25]  Ben Shneiderman,et al.  Dynamic Query Tools for Time Series Data Sets: Timebox Widgets for Interactive Exploration , 2004, Inf. Vis..

[26]  Tom Armstrong,et al.  Discovering Patterns in Real-Valued Time Series , 2006, PKDD.

[27]  Ben Shneiderman,et al.  Aligning temporal data by sentinel events: discovering patterns in electronic health records , 2008, CHI.

[28]  Ada Wai-Chee Fu,et al.  Efficient time series matching by wavelets , 1999, Proceedings 15th International Conference on Data Engineering (Cat. No.99CB36337).

[29]  Padhraic Smyth,et al.  Deformable Markov model templates for time-series pattern matching , 2000, KDD '00.