Dynamic query tools for time series data sets: timebox widgets for interactive exploration

Timeboxes are rectangular widgets that can be used in direct-manipulation graphical user interfaces (GUIs) to specify query constraints on time series data sets. Timeboxes are used to specify simultaneously two sets of constraints: given a set of N time series profiles, a timebox covering time periods x1...x2 (x1<x2) and values y1...y2(y1≤y2) will retrieve only those n ∈N that have values y1≤y≤y2 during all times x1≤x≤x2. TimeSearcher is an information visualization tool that combines timebox queries with overview displays, query-by-example facilities, and support for queries over multiple time-varying attributes. Query manipulation tools including pattern inversion and 'leaders & laggards' graphical bookmarks provide additional support for interactive exploration of data sets. Extensions to the basic timebox model that provide additional expressivity include variable time timeboxes, which can be used to express queries with variability in the time interval, and angular queries, which search for ranges of differentials, rather than absolute values. Analysis of the algorithmic requirements for providing dynamic query performance for timebox queries showed that a sequential search outperformed searches based on geometric indices. Design studies helped identify the strengths and weaknesses of the query tools. Extended case studies involving the analysis of two different types of data from molecular biology experiments provided valuable feedback and validated the utility of both the timebox model and the TimeSearcher tool. Timesearcher is available at http://www.cs.umd.edu/hcil/timesearcher

[1]  Chris North,et al.  Snap-Together Visualization: Evaluating Coordination Usage and Construction , 1999 .

[2]  Martin Wattenberg,et al.  Sketching a graph to query a time-series database , 2001, CHI Extended Abstracts.

[3]  Ben Shneiderman,et al.  An Augmented Visual Query Mechanism for Finding Patterns in Time Series Data , 2002, FQAS.

[4]  Phillip A Sharp,et al.  Predictive Identification of Exonic Splicing Enhancers in Human Genes , 2002, Science.

[5]  John V. Carlis,et al.  Interactive visualization of serial periodic data , 1998, UIST '98.

[6]  Jonathan Goldstein,et al.  When Is ''Nearest Neighbor'' Meaningful? , 1999, ICDT.

[7]  Chris North,et al.  Snap-together visualization: a user interface for coordinating visualizations via relational schemata , 2000, AVI '00.

[8]  P. Brown,et al.  Exploring the metabolic and genetic control of gene expression on a genomic scale. , 1997, Science.

[9]  C. Thummel,et al.  E93 directs steroid-triggered programmed cell death in Drosophila. , 2000, Molecular cell.

[10]  Christos Faloutsos,et al.  Fast subsequence matching in time-series databases , 1994, SIGMOD '94.

[11]  Hans-Jörg Schek,et al.  A Quantitative Analysis and Performance Study for Similarity-Search Methods in High-Dimensional Spaces , 1998, VLDB.

[12]  Ben Shneiderman,et al.  Dynamic queries for visual information seeking , 1994, IEEE Software.

[13]  Matthew O. Ward,et al.  High Dimensional Brushing for Interactive Exploration of Multivariate Data , 1995, Proceedings Visualization '95.

[14]  Ramakrishnan Srikant,et al.  Mining sequential patterns , 1995, Proceedings of the Eleventh International Conference on Data Engineering.

[15]  Jock D. Mackinlay,et al.  Visualizing the evolution of Web ecologies , 1998, CHI.

[16]  Hisham K. Hamadeh,et al.  Gene Chips and Functional Genomics , 2000, American Scientist.

[17]  Stephen M. Mount,et al.  Splicing signals in Drosophila: intron size, information content, and consensus sequences. , 1992, Nucleic acids research.

[18]  Daniel A. Keim,et al.  Pixel-Oriented Visualization Techniques for Exploring Very Large Data Bases , 1996 .

[19]  Eamonn J. Keogh,et al.  Relevance feedback retrieval of time series data , 1999, SIGIR '99.

[20]  John W. S. Brown,et al.  Mutational analysis of a plant branchpoint and polypyrimidine tract required for constitutive splicing of a mini-exon. , 2002, RNA.

[21]  Ben Shneiderman,et al.  Tree visualization with tree-maps: 2-d space-filling approach , 1992, TOGS.

[22]  E. Baehrecke,et al.  Steroid regulation of programmed cell death during Drosophila development , 2000, Cell Death and Differentiation.

[23]  Ben Shneiderman,et al.  Visualization and analysis of microarray and gene ontology data with treemaps , 2004, BMC Bioinformatics.

[24]  D. Botstein,et al.  Cluster analysis and display of genome-wide expression patterns. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[25]  Jeffrey P. Morrill Distributed recognition of patterns in time series data , 1998, CACM.

[26]  Ben Shneiderman,et al.  Interactive Exploration of Time Series Data , 2001, Discovery Science.

[27]  Giuseppe Psaila,et al.  Querying Shapes of Histories , 1995, VLDB.

[28]  Kyuseok Shim,et al.  Fast Similarity Search in the Presence of Noise, Scaling, and Translation in Time-Series Databases , 1995, VLDB.

[29]  Helwig Hauser,et al.  Angular brushing of extended parallel coordinates , 2002, IEEE Symposium on Information Visualization, 2002. INFOVIS 2002..

[30]  Tanya M. Teslovich,et al.  Genome-Wide Analyses of Steroid- and Radiation-Triggered Programmed Cell Death in Drosophila , 2003, Current Biology.

[31]  Mark de Berg,et al.  Computational geometry: algorithms and applications , 1997 .

[32]  Ben Shneiderman,et al.  Interactively Exploring Hierarchical Clustering Results , 2002, Computer.

[33]  Alfred Inselberg,et al.  Classification and visualization for high-dimensional data , 2000, KDD '00.

[34]  Eric H. Baehrecke,et al.  How death shapes life during development , 2002, Nature Reviews Molecular Cell Biology.

[35]  Benjamin B. Bederson,et al.  Toolkit design for interactive structured graphics , 2004, IEEE Transactions on Software Engineering.

[36]  Ben Shneiderman,et al.  Understanding Hierarchical Clustering Results by Interactive Exploration of Dendrograms: A Case Study with Genomic Microarray Data , 2003 .

[37]  Chris North,et al.  Snap-together visualization: can users construct and operate coordinated visualizations? , 2000, Int. J. Hum. Comput. Stud..

[38]  Ben Shneiderman,et al.  Interactive graphical querying of time series and linear sequence data sets , 2003 .

[39]  Raghu Ramakrishnan,et al.  MIMSY: A System for Analyzing Time Series Data in the Stock Market Domain , 1993, Workshop on Programming with Logic Databases , ILPS.