A visual digital library approach for time-oriented scientific primary data

Digital Library support for textual and certain types of non-textual documents has significantly advanced over the last years. While Digital Library support implies many aspects along the whole library workflow model, interactive and visual retrieval allowing effective query formulation and result presentation are important functions. Recently, new kinds of non-textual documents which merit Digital Library support, but yet cannot be fully accommodated by existing Digital Library technology, have come into focus. Scientific data, as produced for example, by scientific experimentation, simulation or observation, is such a document type. In this article we report on a concept and first implementation of Digital Library functionality for supporting visual retrieval and exploration in a specific important class of scientific primary data, namely, time-oriented research data. The approach is developed in an interdisciplinary effort by experts from the library, natural sciences, and visual analytics communities. In addition to presenting the concept and to discussing relevant challenges, we present results from a first implementation of our approach as applied on a real-world scientific primary data set. We also report from initial user feedback obtained during discussions with domain experts from the earth observation sciences, indicating the usefulness of our approach.

[1]  William Ribarsky,et al.  WireVis: Visualization of Categorical, Time-Varying Data From Financial Transactions , 2007, 2007 IEEE Symposium on Visual Analytics Science and Technology.

[2]  Jarke J. van Wijk,et al.  Cluster and Calendar Based Visualization of Time Series Data , 1999, INFOVIS.

[3]  Ada Wai-Chee Fu,et al.  Efficient time series matching by wavelets , 1999, Proceedings 15th International Conference on Data Engineering (Cat. No.99CB36337).

[4]  T. Warren Liao,et al.  Clustering of time series data - a survey , 2005, Pattern Recognit..

[5]  Ben Shneiderman,et al.  Dynamic Query Tools for Time Series Data Sets: Timebox Widgets for Interactive Exploration , 2004, Inf. Vis..

[6]  Gert König-Langlo,et al.  The Baseline Surface Radiation Network , 2012 .

[7]  Eamonn J. Keogh,et al.  Dimensionality Reduction for Fast Similarity Search in Large Time Series Databases , 2001, Knowledge and Information Systems.

[8]  Teuvo Kohonen,et al.  Self-Organizing Maps , 2010 .

[9]  Heiko Schuldt,et al.  DelosDLMS - The Integrated DELOS Digital Library Management System , 2007, DELOS.

[10]  Ian H. Witten,et al.  Greenstone: a comprehensive open-source digital library software system , 2000, DL '00.

[11]  Kresimir Simunic Visualization of Stock Market Charts , 2003, WSCG.

[12]  大村 纂,et al.  気候システムにおける放射の意味--Baseline Surface Radiation Network(WCRP)発足に寄せて , 1997 .

[13]  Ben Shneiderman,et al.  The eyes have it: a task by data type taxonomy for information visualizations , 1996, Proceedings 1996 IEEE Symposium on Visual Languages.

[14]  Tobias Schreck,et al.  Visual Cluster Analysis of Trajectory Data with Interactive Kohonen Maps , 2008, 2008 IEEE Symposium on Visual Analytics Science and Technology.

[15]  Daniel A. Keim,et al.  Visual market sector analysis for financial time series data , 2010, 2010 IEEE Symposium on Visual Analytics Science and Technology.

[16]  Sandra Payette,et al.  Fedora: an architecture for complex objects and their relationships , 2005, International Journal on Digital Libraries.

[17]  Pasquale Pagano,et al.  OpenDLib: A Digital Library Service System , 2002, ECDL.

[18]  Christos Faloutsos,et al.  Efficient Similarity Search In Sequence Databases , 1993, FODO.

[19]  Hannes Grobe,et al.  PanPlot - software to visualize profiles and core logs , 2005 .

[20]  Ben Shneiderman,et al.  Visual information seeking: tight coupling of dynamic query filters with starfield displays , 1994, CHI '94.

[21]  Raoul Wessel,et al.  Demonstration of User Interfaces for Querying in 3D Architectural Content in PROBADO3D , 2009, ECDL.

[22]  Martin Wattenberg,et al.  Sketching a graph to query a time-series database , 2001, CHI Extended Abstracts.

[23]  Kyuseok Shim,et al.  Fast Similarity Search in the Presence of Noise, Scaling, and Translation in Time-Series Databases , 1995, VLDB.

[24]  Jan Brase Using Digital Library Techniques - Registration of Scientific Primary Data , 2004, ECDL.

[25]  Hannes Grobe PANGAEA - Publishing Network for Geoscientific & Environmental Data , 2009 .

[26]  Heidrun Schumann,et al.  Visualizing time-oriented data - A systematic view , 2007, Comput. Graph..

[27]  Jon W. Dunn,et al.  VARIATIONS: a digital music library system at Indiana University , 1999, DL '99.

[28]  Reinhard Klein,et al.  The PROBADO Project - Approach and Lessons Learned in Building a Digital Library System for Heterogeneous Non-textual Documents , 2010, ECDL.

[29]  Eamonn J. Keogh,et al.  A symbolic representation of time series, with implications for streaming algorithms , 2003, DMKD '03.