Interactive Visual Transformation for Symbolic Representation of Time-Oriented Data

Data Mining on time-oriented data has many real-world applications, like optimizing shift plans for shops or hospitals, or analyzing traffic or climate. As those data are often very large and multi-variate, several methods for symbolic representation of time-series have been proposed. Some of them are statistically robust, have a lower-bound distance measure, and are easy to configure, but do not consider temporal structures and domain knowledge of users. Other approaches, proposed as basis for Apriori pattern finding and similar algorithms, are strongly configurable, but the parametrization is hard to perform, resulting in ad-hoc decisions. Our contribution combines the strengths of both approaches: an interactive visual interface that helps defining event classes by applying statistical computations and domain knowledge at the same time. We are not focused on a particular application domain, but intend to make our approach useful for any kind of time-oriented data.

[1]  Yuval Shahar,et al.  Intelligent selection and retrieval of multiple time-oriented records , 2010, Journal of Intelligent Information Systems.

[2]  M S Magnusson,et al.  Discovering hidden time patterns in behavior: T-patterns and their detection , 2000, Behavior research methods, instruments, & computers : a journal of the Psychonomic Society, Inc.

[3]  P. Fayers,et al.  The Visual Display of Quantitative Information , 1990 .

[4]  P. S. Sastry,et al.  A survey of temporal data mining , 2006 .

[5]  Ramakrishnan Srikant,et al.  Mining Sequential Patterns: Generalizations and Performance Improvements , 1996, EDBT.

[6]  Yuval Shahar,et al.  Intelligent visualization and exploration of time-oriented data of multiple patients , 2010, Artif. Intell. Medicine.

[7]  H. Funkhouser,et al.  A Note on a Tenth Century Graph , 1936, Osiris.

[8]  Eamonn J. Keogh,et al.  A symbolic representation of time series, with implications for streaming algorithms , 2003, DMKD '03.

[9]  Yen-Liang Chen,et al.  On mining multi-time-interval sequential patterns , 2009, Data Knowl. Eng..

[10]  Heidrun Schumann,et al.  Visualization of Time-Oriented Data , 2011, Human-Computer Interaction Series.

[11]  Silvia Miksch,et al.  To Score or Not to Score? Tripling Insights for Participatory Design , 2009, IEEE Computer Graphics and Applications.

[12]  A. Akhmetova Discovery of Frequent Episodes in Event Sequences , 2006 .

[13]  Eamonn J. Keogh,et al.  Dimensionality Reduction for Fast Similarity Search in Large Time Series Databases , 2001, Knowledge and Information Systems.

[14]  K. Pearson Contributions to the Mathematical Theory of Evolution , 1894 .

[15]  William Frawley,et al.  Knowledge Discovery in Databases , 1991 .

[16]  Alfred Inselberg,et al.  Parallel coordinates for visualizing multi-dimensional geometry , 1987 .

[17]  Guoyin Wang,et al.  Rough Sets, Fuzzy Sets, Data Mining, and Granular Computing , 2013, Lecture Notes in Computer Science.

[18]  HighWire Press Philosophical Transactions of the Royal Society of London , 1781, The London Medical Journal.

[19]  Li Wei,et al.  Experiencing SAX: a novel symbolic representation of time series , 2007, Data Mining and Knowledge Discovery.

[20]  Georges Gardarin,et al.  Advances in Database Technology — EDBT '96 , 1996, Lecture Notes in Computer Science.

[21]  Stefano Lonardi,et al.  Global detectors of unusual words: design, implementation, and applications to pattern discovery in biosequences , 2001 .

[22]  Otto-von-Guericke Connecting Time-Oriented Data and Information to a Coherent Interactive Visualization , 2004 .

[23]  Andreas Holzinger,et al.  HCI and Usability for Education and Work, 4th Symposium of the WG HCI&UE of the OCG Austrian Computer Society, USAB 2008, LNCS 5298 , 2008 .

[24]  Silvia Miksch,et al.  MuTIny : A MULTI-TIME INTERVAL PATTERN DISCOVERY APPROACH TO PRESERVE THE TEMPORAL INFORMATION IN BETWEEN , 2010 .

[25]  Dimitrios Gunopulos,et al.  Discovering similar multidimensional trajectories , 2002, Proceedings 18th International Conference on Data Engineering.

[26]  K. Pearson Contributions to the Mathematical Theory of Evolution. II. Skew Variation in Homogeneous Material , 1895 .

[27]  Ben Shneiderman,et al.  Temporal Summaries: Supporting Temporal Categorical Searching, Aggregation and Comparison , 2009, IEEE Transactions on Visualization and Computer Graphics.

[28]  Alfred Inselberg,et al.  Parallel coordinates: a tool for visualizing multi-dimensional geometry , 1990, Proceedings of the First IEEE Conference on Visualization: Visualization `90.

[29]  Heikki Mannila,et al.  Principles of Data Mining , 2001, Undergraduate Topics in Computer Science.

[30]  Silvia Miksch,et al.  Visualizations at First Sight: Do Insights Require Training? , 2008, USAB.

[31]  Silvia Miksch,et al.  Does Jason Bourne need Visual Analytics to catch the Jackal? , 2010, EuroVAST@EuroVis.

[32]  Srinath Srinivasa,et al.  Intelligence and Security Informatics, Pacific Asia Workshop, PAISI 2010, Hyderabad, India, June 21, 2010. Proceedings , 2010, PAISI.

[33]  Wolfgang Aigner,et al.  Comparative Evaluation of an Interactive Time‐Series Visualization that Combines Quantitative Data with Qualitative Abstractions , 2012, Comput. Graph. Forum.

[34]  Silvia Miksch,et al.  Hierarchical Temporal Patterns and Interactive Aggregated Views for Pixel-Based Visualizations , 2009, 2009 13th International Conference Information Visualisation.

[35]  Tuan Trung Nguyen,et al.  Rough Set Approach to Domain Knowledge Approximation , 2003, Fundam. Informaticae.

[36]  Matthew D. Cooper,et al.  ActiviTree: Interactive Visual Exploration of Sequences in Event-Based Data Using Graph Similarity , 2009, IEEE Transactions on Visualization and Computer Graphics.

[37]  Stefano Lonardi,et al.  Monotony of surprise and large-scale quest for unusual words. , 2003 .

[38]  B. Marx The Visual Display of Quantitative Information , 1985 .

[39]  Clu-istos Foutsos,et al.  Fast subsequence matching in time-series databases , 1994, SIGMOD '94.

[40]  Christian Tominski,et al.  Event-Based Concepts for User-Driven Visualization , 2011, Inf. Vis..

[41]  R. Larsen,et al.  An introduction to mathematical statistics and its applications (2nd edition) , by R. J. Larsen and M. L. Marx. Pp 630. £17·95. 1987. ISBN 13-487166-9 (Prentice-Hall) , 1987, The Mathematical Gazette.

[42]  ShanGuo Lv,et al.  Study on Ontology Model Based on Rough Set , 2010, 2010 Third International Symposium on Intelligent Information Technology and Security Informatics.

[43]  Eamonn J. Keogh,et al.  Locally adaptive dimensionality reduction for indexing large time series databases , 2001, SIGMOD '01.

[44]  Hans-Peter Kriegel,et al.  Recursive pattern: a technique for visualizing very large amounts of data , 1995, Proceedings Visualization '95.

[45]  Gennady L. Andrienko,et al.  Exploratory analysis of spatial and temporal data - a systematic approach , 2005 .

[46]  Das Amrita,et al.  Mining Association Rules between Sets of Items in Large Databases , 2013 .

[47]  G. U. Yule,et al.  The Foundations of Econometric Analysis: On a Method of Investigating Periodicities in Disturbed Series, with Special Reference to Wolfer's Sunspot Numbers ( Philosophical Transactions of the Royal Society of London , A, vol. 226, 1927, pp. 267–73) , 1995 .

[48]  Ming-Tat Ko,et al.  Discovering time-interval sequential patterns in sequence databases , 2003, Expert Syst. Appl..