NetScatter: Visual analytics of multivariate time series with a hybrid of dynamic and static variable relationships

The ability to capture common characteristics among complex multi-variate time series variables can profoundly impact big data analytics in uncovering valuable insights into the relationships among them and enabling a dimensionality reduction technique. Two widely used data displays include time series and scatter plots. While the former focuses on the dynamics over time, the latter deals with static relationships among variables. Motivated by these distinctive perspectives, our research aims to maximally utilize the information captured by both at the same time. This paper presents NetScatter, a visual analytic approach to characterizing changes of pairwise relationships in a high-dimensional time series. Unlike most traditional techniques that employ a single perspective of the visual display, our approach combines static perspectives of two variables in multi-variate time series into a single representation by comparing all data instances over two different time steps. The paper also introduces a list of visual features of the representation to capture how overall data evolve. We have implemented a web-based prototype that supports a full range of operations, such as ranking, filtering, and details on demand. The paper illustrates the proposed approach on data of various sizes in different domains to demonstrate its benefits.

[1]  Anil K. Jain,et al.  Data clustering: a review , 1999, CSUR.

[2]  Leland Wilkinson,et al.  TimeSeer: Scagnostics for High-Dimensional Time Series , 2013, IEEE Transactions on Visualization and Computer Graphics.

[3]  Florian Heimerl,et al.  Visual Designs for Binned Aggregation of Multi-Class Scatterplots , 2018, ArXiv.

[4]  Tak-Chung Fu,et al.  A review on time series data mining , 2011, Eng. Appl. Artif. Intell..

[5]  Leland Wilkinson,et al.  Visualizing Big Data Outliers Through Distributed Aggregation , 2018, IEEE Transactions on Visualization and Computer Graphics.

[6]  M. B. Heng,et al.  Cambodian health in transition , 1995, BMJ.

[7]  Daniel A. Keim,et al.  Visual Analytics: Scope and Challenges , 2008, Visual Data Mining.

[8]  Helwig Löffelmann,et al.  Visualizing the behaviour of higher dimensional dynamical systems , 1997, Proceedings. Visualization '97 (Cat. No. 97CB36155).

[9]  Heng Tao Shen,et al.  Principal Component Analysis , 2009, Encyclopedia of Biometrics.

[10]  Natwar M. Gandhi Government of the District of Columbia , 2010 .

[11]  Martin Kleinsteuber,et al.  Glyphboard: Visual Exploration of High-Dimensional Data Combining Glyphs with Dimensionality Reduction , 2020, IEEE Transactions on Visualization and Computer Graphics.

[12]  Manish Marwah,et al.  Visual exploration of frequent patterns in multivariate time series , 2012, Inf. Vis..

[13]  Tamara Munzner,et al.  MizBee: A Multiscale Synteny Browser , 2009, IEEE Transactions on Visualization and Computer Graphics.

[14]  Doris Dransch,et al.  A Visual Analytics Approach to Multiscale Exploration of Environmental Time Series , 2012, IEEE Transactions on Visualization and Computer Graphics.

[15]  Lei Jin A data-driven test to compare two or multiple time series , 2011, Comput. Stat. Data Anal..

[16]  Ruey S. Tsay,et al.  Multivariate Time Series Analysis: With R and Financial Applications , 2013 .

[17]  Heidrun Schumann,et al.  Axes-based visualizations with radial layouts , 2004, SAC '04.

[18]  Werner Purgathofer,et al.  Visual Analytics for Complex Engineering Systems: Hybrid Visual Steering of Simulation Ensembles , 2014, IEEE Transactions on Visualization and Computer Graphics.

[19]  Daniel A. Keim,et al.  Temporal MDS Plots for Analysis of Multivariate Data , 2016, IEEE Transactions on Visualization and Computer Graphics.

[20]  Marta Mattoso,et al.  Adaptive Normalization: A novel data normalization approach for non-stationary time series , 2010, The 2010 International Joint Conference on Neural Networks (IJCNN).

[21]  Leland Wilkinson,et al.  ScagExplorer: Exploring Scatterplots by Their Scagnostics , 2014, 2014 IEEE Pacific Visualization Symposium.

[22]  Didier Sornette,et al.  Critical Market Crashes , 2003, cond-mat/0301543.

[23]  D. Richardson,et al.  Quantitative metrics for comparison of in-cylinder velocity fields using particle image velocimetry , 2020, Experiments in Fluids.

[24]  David C. Hoaglin,et al.  Volume 16: How to Detect and Handle Outliers , 2013 .

[25]  Xin-She Yang,et al.  Introduction to Algorithms , 2021, Nature-Inspired Optimization Algorithms.

[26]  Rattikorn Hewett,et al.  Congnostics: Visual Features for Doubly Time Series Plots , 2020, EuroVA@Eurographics/EuroVis.

[27]  Lai Guan Ng,et al.  Dimensionality reduction for visualizing single-cell data using UMAP , 2018, Nature Biotechnology.

[28]  Patrick Riehmann,et al.  Time‐Series Plots Integrated in Parallel‐Coordinates Displays , 2016, Comput. Graph. Forum.

[29]  Tommy Dang,et al.  Visual Features for Multivariate Time Series , 2020, IAIT.

[30]  R. Grossman,et al.  Graph-theoretic scagnostics , 2005, IEEE Symposium on Information Visualization, 2005. INFOVIS 2005..

[31]  Kwan-Liu Ma,et al.  An Incremental Dimensionality Reduction Method for Visualizing Streaming Multidimensional Data , 2019, IEEE Transactions on Visualization and Computer Graphics.

[32]  Hanspeter Pfister,et al.  What Makes a Visualization Memorable? , 2013, IEEE Transactions on Visualization and Computer Graphics.

[33]  J. Tukey,et al.  Variations of Box Plots , 1978 .

[34]  Leland Wilkinson,et al.  Timeseer: detecting interesting distributions in multiple time series data , 2012, VINCI.

[35]  Silvia Miksch,et al.  A Visual Analytics Approach to Segmenting and Labeling Multivariate Time Series Data , 2014, EuroVA@EuroVis.

[36]  Chad A. Steed,et al.  Falcon: Visual analysis of large, irregularly sampled, and multivariate time series data in additive manufacturing , 2017, Comput. Graph..

[37]  Rainald Löhner,et al.  PedPIV: Pedestrian Velocity Extraction From Particle Image Velocimetry , 2020, IEEE Transactions on Intelligent Transportation Systems.

[38]  Luke J. Gosink,et al.  Meta parallel coordinates for visualizing features in large, high-dimensional, time-varying data , 2012, IEEE Symposium on Large Data Analysis and Visualization (LDAV).

[39]  Xianghua Xie,et al.  Clustering and Classification for Time Series Data in Visual Analytics: A Survey , 2019, IEEE Access.

[40]  Liz J. Stuart,et al.  Animator: a tool for the animation of parallel coordinates , 2004, Proceedings. Eighth International Conference on Information Visualisation, 2004. IV 2004..

[41]  Yong Chen,et al.  RadarViewer : Visualizing the dynamics of multivariate data , 2020, PEARC.

[42]  D. W. Scott A Note on Choices of Bivariate Histogram Bin Shape , 1985 .

[43]  Steven Franconeri,et al.  Ranking Visualizations of Correlation Using Weber's Law , 2014, IEEE Transactions on Visualization and Computer Graphics.

[44]  B. Pompe,et al.  Permutation entropy: a natural complexity measure for time series. , 2002, Physical review letters.

[45]  Vung Pham,et al.  MTSAD: Multivariate Time Series Abnormality Detection and Visualization , 2019, 2019 IEEE International Conference on Big Data (Big Data).

[46]  Robert L. Grossman,et al.  High-Dimensional Visual Analytics: Interactive Exploration Guided by Pairwise Views of Point Distributions , 2006, IEEE Transactions on Visualization and Computer Graphics.

[47]  Leland McInnes,et al.  UMAP: Uniform Manifold Approximation and Projection for Dimension Reduction , 2018, ArXiv.

[48]  Michael Burch,et al.  On the Benefits and Drawbacks of Radial Diagrams , 2014, Handbook of Human Centric Visualization.