Complex Data Transformations in Digital Libraries with Spatio-Temporal Information

The DIGMAP project researched automated methods for enriching metadata records with structured geo-temporal information. This paper presents our findings regarding the use of XML technology for expressing transformations between the different XML schemas used in DIGMAP metadata records and service interfaces. Both XSLT and XQuery are functional, declarative languages that effectively support XML data integration. They are also extensible, in the sense that new functions can be specified in Java and then combined with general XPath expressions. We extended an XSLT/Xquery engine with additional functions for processing spatio-temporal information and for dealing with incompleteness and inconsistencies in the data. The paper discusses the application over different XML formats and metadata standards.

[1]  Xia Lin,et al.  International Conference on Dublin Core and Metadata Applications , 2008 .

[2]  Lassi Lehto,et al.  Real-time content transformations in a Web service based delivery architecture for geographic information , 2007 .

[3]  Hae-Young Bae,et al.  An extension of XQuery for moving objects over GML , 2004, International Conference on Information Technology: Coding and Computing, 2004. Proceedings. ITCC 2004..

[4]  Guan Jihong,et al.  GQL: Extending Xquery to query GML documents , 2006 .

[5]  Stephan Kepser,et al.  A Simple Proof for the Turing-Completeness of XSLT and XQuery , 2004, Extreme Markup Languages®.

[6]  Dimitre Novatchev Higher-Order Functional Programming with XSLT 2.0 and FXSL , 2006 .

[7]  José Luis Borbinha,et al.  Extracting and Exploring the Geo-Temporal Semantics of Textual Resources , 2008, 2008 IEEE International Conference on Semantic Computing.

[8]  James Frew,et al.  Geographic Names: The Implementation of a Gazetteer in a Georeferenced Digital Library , 1999, D Lib Mag..

[9]  Stuart Macdonald,et al.  User Engagement in Research Data Curation , 2009, ECDL.

[10]  Paolo Manghi,et al.  Hybrid applications over XML: integrating the procedural and declarative approaches , 2002, WIDM '02.

[11]  Diane Hillmann,et al.  Improving Metadata Quality: Augmentation and Recombination , 2004, Dublin Core Conference.

[12]  José Luis Borbinha,et al.  A geo-temporal Web gazetteer integrating data from multiple sources , 2008, 2008 Third International Conference on Digital Information Management.

[13]  Corey Keith Using XSLT to manipulate MARC metadata , 2004 .

[14]  James Frew,et al.  A Hybrid Declarative/Procedural Metadata Mapping Language Based on Python , 2005, ECDL.

[15]  Brian Matthews,et al.  Inter-Thesaurus Mapping , 2004 .

[16]  James F. Allen Time and time again: The many ways to represent time , 1991, Int. J. Intell. Syst..

[17]  Martin Doerr,et al.  Semantic Problems of Thesaurus Mapping , 2006, J. Digit. Inf..

[18]  José Luis Borbinha,et al.  Consolidation of References to Persons in Bibliographic Databases , 2008, ICADL.

[19]  Eric Childress,et al.  A Repository of Metadata Crosswalks , 2004, D Lib Mag..