Translation of Relational and Non-relational Databases into RDF with xR2RML

With the growing amount of data being continuously produced, it is crucial to come up with solutions to expose data from ever more heterogeneous databases (e.g. NoSQL systems) as linked data. In this paper we present xR2RML, a language designed to describe the mapping of various types of databases to RDF. xR2RML flexibly adapts to heterogeneous query languages and data models while remaining free from any specific language or syntax. It extends R2RML, the W3C recommendation for the mapping of relational databases to RDF, and relies on RML for the handling of various data representation formats. We analyse data models of several modern databases as well as the format in which query results are returned, and we show that xR2RML can translate any data element within such results into RDF, relying on existing languages such as XPath and JSONPath if needed. We illustrate some features of xR2RML such as the generation of RDF collections and containers, and the ability to deal with mixed content.

[1]  Freddy Priyatna,et al.  MIRROR: Automatic R2RML Mapping Generation from Relational Databases , 2015, ICWE.

[2]  Rik Van de Walle,et al.  RML: A Generic Language for Integrated RDF Mappings of Heterogeneous Data , 2014, LDOW.

[3]  Johan Montagnat,et al.  A survey of RDB to RDF translation approaches and tools , 2014 .

[4]  Yannis Papakonstantinou,et al.  The SQL++ Unifying Semi-structured Query Language, and an Expressiveness Benchmark of SQL-on-Hadoop, NoSQL and NewSQL Databases , 2014 .

[5]  Freddy Priyatna,et al.  Formalisation and experiences of R2RML-based SPARQL to SQL query translation using morph , 2014, WWW.

[6]  Laura M. Haas,et al.  Clio: Schema Mapping Creation and Data Exchange , 2009, Conceptual Modeling: Foundations and Applications.

[7]  Philip Fennell,et al.  Schematron - More useful than you'd thought , 2014 .

[8]  Vanja Josifovski,et al.  SQL/MED: a status report , 2002, SGMD.

[9]  Stefan Jablonski,et al.  NoSQL evaluation: A use case oriented survey , 2011, 2011 International Conference on Cloud and Service Computing.

[10]  Kristina Lerman,et al.  Semi-automatically Mapping Structured Sources into the Semantic Web , 2012, ESWC.

[11]  Nikolas Mitrou,et al.  Bringing relational databases into the Semantic Web: A survey , 2012, Semantic Web.

[12]  Wolfram Wöß,et al.  XLWrap - Querying and Integrating Arbitrary Spreadsheets with SPARQL , 2009, SEMWEB.

[13]  Alban Gaignard,et al.  Distributed knowledge sharing and production through collaborative e-Science platforms. (Partage et production de connaissances distribuées dans des plateformes scientifiques collaboratives) , 2013 .

[14]  Johan Montagnat,et al.  xR2RML: Relational and Non-Relational Databases to RDF Mapping Language , 2017 .

[15]  Stefan Decker,et al.  Mapping between RDF and XML with XSPARQL , 2012, Journal on Data Semantics.

[16]  Mary Roth,et al.  Don't Scrap It, Wrap It! A Wrapper Architecture for Legacy Data Sources , 1997, VLDB.

[17]  Katja Hose,et al.  FedX: Optimization Techniques for Federated Query Processing on Linked Data , 2011, SEMWEB.

[18]  Daniel P. Miranker,et al.  Ultrawrap: SPARQL execution on relational data , 2013, J. Web Semant..

[19]  Daan Broeder,et al.  Realising the full potential of research data: common challenges in data management, sharing and integration across scientific disciplines , 2013 .

[20]  Daniel P. Miranker,et al.  Survey of directly mapping SQL databases to the Semantic Web , 2011, The Knowledge Engineering Review.

[21]  Umar Farooq Minhas,et al.  SQL-on-Hadoop: Full Circle Back to Shared-Nothing Database Architectures , 2014, Proc. VLDB Endow..

[22]  Nektarios Gioldasis,et al.  The SPARQL2XQuery interoperability framework , 2013, World Wide Web.

[23]  Rik Van de Walle,et al.  Mapping Hierarchical Sources into RDF Using the RML Mapping Language , 2014, 2014 IEEE International Conference on Semantic Computing.

[24]  Mitesh Patel,et al.  Accessing the deep web , 2007, CACM.

[25]  F. Breitling A standard transformation from XML to RDF via XSLT , 2009, ArXiv.

[26]  Maribel Acosta,et al.  ANAPSID: An Adaptive Query Processing Engine for SPARQL Endpoints , 2011, SEMWEB.

[27]  Raphaël Troncy,et al.  Enabling Linked Data Publication with the Datalift Platform , 2012, Semantic Cities @ AAAI.

[28]  Patrick Valduriez,et al.  CloudMdsQL: querying heterogeneous cloud data stores with a common language , 2016, Distributed and Parallel Databases.