FunUL: a method to incorporate functions into uplift mapping languages

Typically tools that map non-RDF data into RDF format rely on the technology native to the source of the data when manipulation of data during the mapping is required. Depending on the data format, data manipulation can be performed using underlying technology, such as RDBMS for relational databases or XPath for XML. For CSV/Tabular data there is no such underlying technology, and instead transforming the source data into another format or pre/post-processing techniques are used. As part of this paper, we present a comparison framework for the state-of-the-art in converting CSV/Tabular data into RDF, where a key feature evaluated is transformation functions. We argue that existing approaches for transformation functions in such tools are complex - in number of steps and tools involved - and therefore not as traceable and transparent as one would like. We tackle these problems by defining a more generic, usable and amenable method to incorporate functions into uplift mapping languages, called FunUL. As proof of concept, we show an implementation of our method. Moreover, by using a real world Digital Humanities case study, we compare our approach with other approaches that we have identified to include transformation functions as part of the mapping for CSV/Tabular data.

[1]  Rik Van de Walle,et al.  RML: A Generic Language for Integrated RDF Mappings of Heterogeneous Data , 2014, LDOW.

[2]  Christophe Debruyne,et al.  R2RML-F: Towards Sharing and Executing Domain Logic in R2RML Mappings , 2016, LDOW@WWW.

[3]  Rik Van de Walle,et al.  Automated Metadata Generation for Linked Data Generation and Publishing Workflows , 2016, LDOW@WWW.

[4]  Gerald Reif,et al.  A comparison of RDB-to-RDF mapping languages , 2011, I-Semantics '11.

[5]  Kevin Feeney,et al.  Seshat: The Global History Databank , 2015 .

[6]  Rik Van de Walle,et al.  RMLEditor: A Graph-Based Mapping Editor for Linked Data Mappings , 2016, ESWC.

[7]  Thomas E. Currie,et al.  Building the Seshat Ontology for a Global History Databank , 2016, ESWC.

[8]  Andriy Nikolov,et al.  DataOps: Seamless End-to-End Anything-to-RDF Data Integration , 2015, ESWC.

[9]  Sebastian Rudolph,et al.  Foundations of Semantic Web Technologies , 2009 .

[10]  Peter Fox,et al.  Effective Tooling for Linked Data Publishing in Scientific Research , 2016, 2016 IEEE Tenth International Conference on Semantic Computing (ICSC).

[11]  Raphaël Troncy,et al.  Enabling Linked Data Publication with the Datalift Platform , 2012, Semantic Cities @ AAAI.

[12]  Jens Lehmann,et al.  Simplified RDB2RDF Mapping , 2015, LDOW@WWW.

[13]  Craig A. Knoblock,et al.  KR2RML: An Alternative Interpretation of R2RML for Heterogenous Sources , 2015, COLD.

[14]  Kevin Feeney,et al.  Publishing Social Sciences Datasets as Linked Data: a Political Violence Case Study , 2013 .

[15]  Johan Montagnat,et al.  Translation of Relational and Non-relational Databases into RDF with xR2RML , 2015, WEBIST.

[16]  Christophe Debruyne,et al.  Incorporating Functions in Mappings to Facilitate the Uplift of CSV Files into RDF , 2016, ESWC.

[17]  G. Kellogg,et al.  Model for Tabular Data and Metadata on the Web , 2015 .