Transition Icons for Time-Series Visualization and Exploratory Analysis

The modern healthcare landscape has seen the rapid emergence of techniques and devices that temporally monitor and record physiological signals. The prevalence of time-series data within the healthcare field necessitates the development of methods that can analyze the data in order to draw meaningful conclusions. Time-series behavior is notoriously difficult to intuitively understand due to its intrinsic high-dimensionality, which is compounded in the case of analyzing groups of time series collected from different patients. Our framework, which we call transition icons, renders common patterns in a visual format useful for understanding the shared behavior within groups of time series. Transition icons are adept at detecting and displaying subtle differences and similarities, e.g., between measurements taken from patients receiving different treatment strategies or stratified by demographics. We introduce various methods that collectively allow for exploratory analysis of groups of time series, while being free of distribution assumptions and including simple heuristics for parameter determination. Our technique extracts discrete transition patterns from symbolic aggregate approXimation representations, and compiles transition frequencies into a bag of patterns constructed for each group. These transition frequencies are normalized and aligned in icon form to intuitively display the underlying patterns. We demonstrate the transition icon technique for two time-series datasets—postoperative pain scores, and hip-worn accelerometer activity counts. We believe transition icons can be an important tool for researchers approaching time-series data, as they give rich and intuitive information about collective time-series behaviors.

[1]  Kuniaki Uehara,et al.  Discovery of Time-Series Motif from Multi-Dimensional Data Based on MDL Principle , 2005, Machine Learning.

[2]  Purnamrita Sarkar,et al.  The Big Data Bootstrap , 2012, ICML.

[3]  R. Fillingim,et al.  Characterizations of Temporal Postoperative Pain Signatures With Symbolic Aggregate Approximations , 2017, The Clinical journal of pain.

[4]  Stuart Barber,et al.  All of Statistics: a Concise Course in Statistical Inference , 2005 .

[5]  Tom Armstrong,et al.  Using Modified Multivariate Bag-of-Words Models to Classify Physiological Data , 2011, 2011 IEEE 11th International Conference on Data Mining Workshops.

[6]  Eamonn J. Keogh,et al.  Dimensionality Reduction for Fast Similarity Search in Large Time Series Databases , 2001, Knowledge and Information Systems.

[7]  David A. Clifton,et al.  Multitask Gaussian Processes for Multivariate Physiological Time-Series Analysis , 2015, IEEE Transactions on Biomedical Engineering.

[8]  Eamonn J. Keogh,et al.  iSAX: indexing and mining terabyte sized time series , 2008, KDD.

[9]  Li Wei,et al.  Intelligent Icons: Integrating Lite-Weight Data Mining and Visualization into GUI Operating Systems , 2006, Sixth International Conference on Data Mining (ICDM'06).

[10]  Eamonn J. Keogh A decade of progress in indexing and mining large time series databases , 2006, VLDB.

[11]  P. Rheingans,et al.  Temporal visualization of planning polygons for efficient partitioning of geo-spatial data , 2005, IEEE Symposium on Information Visualization, 2005. INFOVIS 2005..

[12]  T. Warren Liao,et al.  Clustering of time series data - a survey , 2005, Pattern Recognit..

[13]  Spencer S. Jones,et al.  Health Information Technology: An Updated Systematic Review With a Focus on Meaningful Use , 2014, Annals of Internal Medicine.

[14]  Jignesh M. Patel,et al.  Estimating the selectivity of tf-idf based cosine similarity predicates , 2007, SGMD.

[15]  Visa Koivunen,et al.  Robust, Scalable, and Fast Bootstrap Method for Analyzing Large Scale Data , 2016, IEEE Transactions on Signal Processing.

[16]  L. Mâsse,et al.  Physical activity in the United States measured by accelerometer. , 2008, Medicine and science in sports and exercise.

[17]  Nitin Kumar,et al.  Time-series Bitmaps: a Practical Visualization Tool for Working with Large Time Series Databases , 2005, SDM.

[18]  S. Pocock,et al.  Strengthening the reporting of observational studies in epidemiology (STROBE) statement: guidelines for reporting observational studies , 2007, BMJ : British Medical Journal.

[19]  Daniel A. Keim,et al.  Matrix-based visual correlation analysis on large timeseries data , 2012, 2012 IEEE Conference on Visual Analytics Science and Technology (VAST).

[20]  Parisa Rashidi,et al.  Using symbolic aggregate approximation (SAX) to visualize activity transitions among older adults. , 2016, Physiological measurement.

[21]  Heidrun Schumann,et al.  Visualization of Time-Oriented Data , 2011, Human-Computer Interaction Series.

[22]  Eamonn J. Keogh,et al.  A symbolic representation of time series, with implications for streaming algorithms , 2003, DMKD '03.

[23]  Andrew Vande Moere,et al.  Time-Varying Data Visualization Using Information Flocking Boids , 2004, IEEE Symposium on Information Visualization.

[24]  David A. Clifton,et al.  Multi-Task Gaussian Processes for Multivariate Physiological Time-Series Analysis , 2014 .

[25]  K. Pillai Some New Test Criteria in Multivariate Analysis , 1955 .

[26]  Carl de Boor,et al.  A Practical Guide to Splines , 1978, Applied Mathematical Sciences.

[27]  H. Hotelling The Generalization of Student’s Ratio , 1931 .

[28]  Peter Fu-Ming Hu,et al.  Predicting Patient Outcomes from a Few Hours of High Resolution Vital Signs Data , 2012, 2012 11th International Conference on Machine Learning and Applications.

[29]  Hung-Hsuan Huang,et al.  Time Series Classification Method Based on Longest Common Subsequence and Textual Approximation , 2012, Seventh International Conference on Digital Information Management (ICDIM 2012).

[30]  Tak-Chung Fu,et al.  A review on time series data mining , 2011, Eng. Appl. Artif. Intell..