A Visual Digital Library Approach for Time-Oriented Scientific Primary Data

Digital Library support for textual and certain types of nontextual documents has significantly advanced over the last years. While Digital Library support implies many aspects along the whole library workflow model, interactive and visual retrieval allowing effective query formulation and result presentation are important functions. Recently, new kinds of non-textual documents which merit Digital Library support, but yet cannot be accommodated by existing Digital Library technology, have come into focus. Scientific primary data, as produced for example, by scientific experimentation, earth observation, or simulation, is such a data type. We report on a concept and first implementation of Digital Library functionality, supporting visual retrieval and exploration in a specific important class of scientific primary data, namely, time-oriented data. The approach is developed in an interdisciplinary effort by experts from the library, natural sciences, and visual analytics communities. In addition to presenting the concept and discussing relevant challenges, we present results from a first implementation of our approach as applied on a real-world scientific primary data set.

[1]  Ada Wai-Chee Fu,et al.  Efficient time series matching by wavelets , 1999, Proceedings 15th International Conference on Data Engineering (Cat. No.99CB36337).

[2]  Ramakrishnan Srikant,et al.  An Implementation of P3P Using Database Technology , 2004, EDBT.

[3]  Jon W. Dunn,et al.  VARIATIONS: a digital music library system at Indiana University , 1999, DL '99.

[4]  Jarke J. van Wijk,et al.  Cluster and Calendar Based Visualization of Time Series Data , 1999, INFOVIS.

[5]  Tobias Schreck,et al.  Visual Cluster Analysis of Trajectory Data with Interactive Kohonen Maps , 2009 .

[6]  Kyuseok Shim,et al.  Fast Similarity Search in the Presence of Noise, Scaling, and Translation in Time-Series Databases , 1995, VLDB.

[7]  Eamonn J. Keogh,et al.  A symbolic representation of time series, with implications for streaming algorithms , 2003, DMKD '03.

[8]  Christos Faloutsos,et al.  Efficient Similarity Search In Sequence Databases , 1993, FODO.

[9]  William Ribarsky,et al.  WireVis: Visualization of Categorical, Time-Varying Data From Financial Transactions , 2007, 2007 IEEE Symposium on Visual Analytics Science and Technology.

[10]  Reinhard Klein,et al.  The PROBADO Project - Approach and Lessons Learned in Building a Digital Library System for Heterogeneous Non-textual Documents , 2010, ECDL.

[11]  Ben Shneiderman,et al.  Dynamic query tools for time series data sets: timebox widgets for interactive exploration , 2004 .

[12]  Ben Shneiderman,et al.  The eyes have it: a task by data type taxonomy for information visualizations , 1996, Proceedings 1996 IEEE Symposium on Visual Languages.

[13]  Kresimir Simunic Visualization of Stock Market Charts , 2003, WSCG.

[14]  Raoul Wessel,et al.  Demonstration of User Interfaces for Querying in 3D Architectural Content in PROBADO3D , 2009, ECDL.

[15]  Martin Wattenberg,et al.  Sketching a graph to query a time-series database , 2001, CHI Extended Abstracts.

[16]  Daniel A. Keim,et al.  Visual market sector analysis for financial time series data , 2010, 2010 IEEE Symposium on Visual Analytics Science and Technology.

[17]  Ben Shneiderman,et al.  Visual information seeking: tight coupling of dynamic query filters with starfield displays , 1994, CHI '94.

[18]  Jan Brase Using Digital Library Techniques - Registration of Scientific Primary Data , 2004, ECDL.

[19]  Hannes Grobe PANGAEA - Publishing Network for Geoscientific & Environmental Data , 2009 .

[20]  Sandra Payette,et al.  Fedora: an architecture for complex objects and their relationships , 2005, International Journal on Digital Libraries.

[21]  Pasquale Pagano,et al.  OpenDLib: A Digital Library Service System , 2002, ECDL.

[22]  T. Warren Liao,et al.  Clustering of time series data - a survey , 2005, Pattern Recognit..

[23]  Eamonn J. Keogh,et al.  Dimensionality Reduction for Fast Similarity Search in Large Time Series Databases , 2001, Knowledge and Information Systems.

[24]  Teuvo Kohonen,et al.  Self-Organizing Maps , 2010 .

[25]  Heiko Schuldt,et al.  DelosDLMS - The Integrated DELOS Digital Library Management System , 2007, DELOS.

[26]  Ian H. Witten,et al.  Greenstone: a comprehensive open-source digital library software system , 2000, DL '00.

[27]  Heidrun Schumann,et al.  Visualizing time-oriented data - A systematic view , 2007, Comput. Graph..