Sequen-C: A Multilevel Overview of Temporal Event Sequences

Building a visual overview of temporal event sequences with an optimal level-of-detail (i.e. simplified but informative) is an ongoing challenge - expecting the user to zoom into every important aspect of the overview can lead to missing insights. We propose a technique to build a multilevel overview of event sequences, whose granularity can be transformed across sequence clusters (vertical level-of-detail) or longitudinally (horizontal level-of-detail), using hierarchical aggregation and a novel cluster data representation Align-Score-Simplify. By default, the overview shows an optimal number of sequence clusters obtained through the average silhouette width metric - then users are able to explore alternative optimal sequence clusterings. The vertical level-of-detail of the overview changes along with the number of clusters, whilst the horizontal level-of-detail refers to the level of summarization applied to each cluster representation. The proposed technique has been implemented into a visualization system called Sequence Cluster Explorer (Sequen-C) that allows multilevel and detail-on-demand exploration through three coordinated views, and the inspection of data attributes at cluster, unique sequence, and individual sequence level. We present two case studies using real-world datasets in the healthcare domain: CUREd and MIMIC-III; which demonstrate how the technique can aid users to obtain a summary of common and deviating pathways, and explore data attributes for selected patterns.

[1]  Ben Shneiderman,et al.  LifeFlow: visualizing an overview of event sequences , 2011, CHI.

[2]  Cody Dunne,et al.  IDMVis: Temporal Event Sequence Visualization for Type 1 Diabetes Treatment Decision Support , 2019, IEEE Transactions on Visualization and Computer Graphics.

[3]  Jarke J. van Wijk,et al.  Exploring Multivariate Event Sequences Using Rules, Aggregations, and Selections , 2018, IEEE Transactions on Visualization and Computer Graphics.

[4]  Leo Goodstadt,et al.  CHROMA: consensus-based colouring of multiple alignments for publication , 2001, Bioinform..

[5]  Yang Wang,et al.  Patterns and Sequences: Interactive Exploration of Clickstreams to Understand Common Visitor Paths , 2017, IEEE Transactions on Visualization and Computer Graphics.

[6]  Ben Shneiderman,et al.  Temporal Event Sequence Simplification , 2013, IEEE Transactions on Visualization and Computer Graphics.

[7]  P. Alam ‘W’ , 2021, Composites Engineering.

[8]  Gang Wang,et al.  Unsupervised Clickstream Clustering for User Behavior Analysis , 2016, CHI.

[9]  Ben Shneiderman,et al.  A Visual Interface for Multivariate Temporal Data: Finding Patterns of Events across Multiple Histories , 2006, 2006 IEEE Symposium On Visual Analytics Science And Technology.

[10]  David Borland,et al.  Visual Analysis of High-Dimensional Event Sequence Data via Dynamic Hierarchical Aggregation , 2020, IEEE Transactions on Visualization and Computer Graphics.

[11]  Cody Dunne,et al.  Sequence Braiding: Visual Overviews of Temporal Event Sequences and Attributes. , 2020, IEEE transactions on visualization and computer graphics.

[12]  Jimeng Sun,et al.  Visual cluster analysis in support of clinical decision intelligence. , 2011, AMIA ... Annual Symposium proceedings. AMIA Symposium.

[13]  P. Rousseeuw Silhouettes: a graphical aid to the interpretation and validation of cluster analysis , 1987 .

[14]  David Gotz,et al.  DecisionFlow: Visual Analytics for High-Dimensional Temporal Event Sequence Data , 2014, IEEE Transactions on Visualization and Computer Graphics.

[15]  Kwan-Liu Ma,et al.  Visual cluster exploration of web clickstream data , 2012, 2012 IEEE Conference on Visual Analytics Science and Technology (VAST).

[16]  Joydeep Ghosh,et al.  Data Clustering Algorithms And Applications , 2013 .

[17]  Wei Liu,et al.  SSRDVis: Interactive visualization for event sequences summarization and rare detection , 2020, J. Vis..

[18]  Lusheng Wang,et al.  SEGID: Identifying Interesting Segments in (Multiple) Sequence Alignments , 2003, Bioinform..

[19]  Mira Dontcheva,et al.  CoreFlow: Extracting and Visualizing Branching Patterns from Event Sequences , 2017, Comput. Graph. Forum.

[20]  Ke Xu,et al.  EventThread: Visual Summarization and Stage Analysis of Event Sequence Data , 2018, IEEE Transactions on Visualization and Computer Graphics.

[21]  Hongyuan Zha,et al.  Visual Progression Analysis of Event Sequence Data , 2019, IEEE Transactions on Visualization and Computer Graphics.

[22]  Ben Shneiderman,et al.  LifeLines: using visualization to enhance navigation and analysis of patient records , 1998, AMIA.

[23]  Jimmy J. Lin,et al.  Using visualizations to monitor changes and harvest insights from a global-scale logging infrastructure at Twitter , 2014, 2014 IEEE Conference on Visual Analytics Science and Technology (VAST).

[24]  Esko Ukkonen,et al.  Approximate String Matching with q-grams and Maximal Matches , 1992, Theor. Comput. Sci..

[25]  Christopher J. Lee Generating Consensus Sequences from Partial Order Multiple Sequence Alignment Graphs , 2003, Bioinform..

[26]  Matthew D. Cooper,et al.  ActiviTree: Interactive Visual Exploration of Sequences in Event-Based Data Using Graph Similarity , 2009, IEEE Transactions on Visualization and Computer Graphics.

[27]  Peter J. Rousseeuw,et al.  Finding Groups in Data: An Introduction to Cluster Analysis , 1990 .

[28]  Zhaonan Sun,et al.  DPVis: Visual Analytics With Hidden Markov Models for Disease Progression Pathways , 2019, IEEE Transactions on Visualization and Computer Graphics.

[29]  Ben Shneiderman,et al.  The eyes have it: a task by data type taxonomy for information visualizations , 1996, Proceedings 1996 IEEE Symposium on Visual Languages.

[30]  David Gotz,et al.  Exploring Flow, Factors, and Outcomes of Temporal Event Sequences with the Outflow Visualization , 2012, IEEE Transactions on Visualization and Computer Graphics.

[31]  P. Cochat,et al.  Et al , 2008, Archives de pediatrie : organe officiel de la Societe francaise de pediatrie.

[32]  Yuanzhe Chen,et al.  Sequence Synopsis: Optimize Visual Summary of Temporal Event Data , 2018, IEEE Transactions on Visualization and Computer Graphics.

[33]  Ben Shneiderman,et al.  Coping with Volume and Variety in Temporal Event Sequences: Strategies for Sharpening Analytic Focus , 2017, IEEE Transactions on Visualization and Computer Graphics.

[34]  G. G. Stokes "J." , 1890, The New Yale Book of Quotations.

[35]  R. Doolittle,et al.  Progressive sequence alignment as a prerequisitetto correct phylogenetic trees , 2007, Journal of Molecular Evolution.

[36]  Evangelos E. Milios,et al.  LogView: Visualizing Event Log Clusters , 2008, 2008 Sixth Annual Conference on Privacy, Security and Trust.

[37]  R. Zimdahl in and Other , 2020, Agricultural Ethics - An Invitation.

[38]  Bum Chul Kwon,et al.  Peekquence : Visual Analytics for Event Sequence Data , 2016 .

[39]  Cody Dunne,et al.  Evaluating Alignment Approaches in Superimposed Time-Series and Temporal Event-Sequence Visualizations , 2019, ArXiv.

[40]  Ben Shneiderman,et al.  The challenges of specifying intervals and absences in temporal queries: a graphical language approach , 2013, CHI.

[41]  Ivan Marsic,et al.  Evaluation of Trace Alignment Quality and its Application in Medical Process Mining , 2017, 2017 IEEE International Conference on Healthcare Informatics (ICHI).

[42]  Mira Dontcheva,et al.  MatrixWave: Visual Comparison of Event Sequence Data , 2015, CHI.

[43]  Wil M. P. van der Aalst,et al.  Trace Alignment in Process Mining: Opportunities for Process Diagnostics , 2010, BPM.

[44]  Femke Ongenae,et al.  Data Mining in the Development of Mobile Health Apps: Assessing In-App Navigation Through Markov Chain Analysis , 2018, Journal of medical Internet research.

[45]  Vladimir I. Levenshtein,et al.  Binary codes capable of correcting deletions, insertions, and reversals , 1965 .

[46]  Fei Wang,et al.  Frequence: interactive mining and visualization of temporal frequent event sequences , 2014, IUI.

[47]  Peter Szolovits,et al.  MIMIC-III, a freely accessible critical care database , 2016, Scientific Data.

[48]  Jean-Daniel Fekete,et al.  Hierarchical Aggregation for Information Visualization: Overview, Techniques, and Design Guidelines , 2010, IEEE Transactions on Visualization and Computer Graphics.

[49]  Mark P. J. van der Loo,et al.  The stringdist Package for Approximate String Matching , 2014, R J..

[50]  Jenny Dankelman,et al.  Workflow mining and outlier detection from clinical activity logs , 2012, J. Biomed. Informatics.

[51]  Olivier Thonnard,et al.  VASABI: Hierarchical User Profiles for Interactive Visual User Behaviour Analytics , 2020, IEEE Transactions on Visualization and Computer Graphics.

[52]  Padhraic Smyth,et al.  Model-Based Clustering and Visualization of Navigation Patterns on a Web Site , 2003, Data Mining and Knowledge Discovery.

[53]  Emanuel Zgraggen,et al.  (s|qu)eries: Visual Regular Expressions for Querying and Exploring Event Sequences , 2015, CHI.