Mapping RDF Databases to Property Graph Databases

RDF triplestores and property graph databases are two approaches for data management which are based on modeling, storing and querying graph-like data. In spite of such common principle, they present special features that complicate the task of database interoperability. While there exist some methods to transform RDF graphs into property graphs, and vice versa, they lack compatibility and a solid formal foundation. This paper presents three direct mappings (schema-dependent and schema-independent) for transforming an RDF database into a property graph database, including data and schema. We show that two of the proposed mappings satisfy the properties of semantics preservation and information preservation. The existence of both mappings allows us to conclude that the property graph data model subsumes the information capacity of the RDF data model.

[1]  Jaroslav Pokorný,et al.  Integrity constraints in graph databases , 2017, ANT/SEIT.

[2]  Heiko Paulheim,et al.  Adoption of the Linked Data Best Practices in Different Topical Domains , 2014, SEMWEB.

[3]  Jens Lehmann,et al.  A Stitch in Time Saves Nine - SPARQL querying of Property Graphs using Gremlin Traversals , 2018, ArXiv.

[4]  Aidan Hogan,et al.  BTC-2019: The 2019 Billion Triple Challenge Dataset , 2019, SEMWEB.

[5]  Amit P. Sheth,et al.  A Formal Graph Model for RDF and Its Implementation , 2016, ArXiv.

[6]  Harsh Thakkar,et al.  RDF and Property Graphs Interoperability: Status and Issues , 2019, AMW.

[7]  Daniel P. Miranker,et al.  On directly mapping relational databases to RDF and OWL , 2012, WWW.

[8]  Dominik Tomaszuk,et al.  DRPD: Architecture for Intelligent Interaction with RDF Prefixes , 2018, DeSemWeb@ISWC.

[9]  Carlos Rojas,et al.  Querying Wikidata: Comparing SPARQL, Relational and Graph Databases , 2016, SEMWEB.

[10]  Peter A. Boncz,et al.  Exploiting Emergent Schemas to Make RDF Systems More Efficient , 2016, SEMWEB.

[11]  Stefan Decker,et al.  Mapping between RDF and XML with XSPARQL , 2012, Journal on Data Semantics.

[12]  Stefano Spaccapietra,et al.  Database Integration: the Key to Data Interoperability , 2022 .

[13]  Young-Koo Lee,et al.  DTD2OWL: automatic transforming XML documents into OWL ontology , 2009, ICIS '09.

[14]  Ran Zhang,et al.  A Unified Relational Storage Scheme for RDF and Property Graphs , 2019, WISA.

[15]  Jayanta Banerjee,et al.  A Tale of Two Graphs: Property Graphs as RDF in Oracle , 2014, EDBT.

[16]  Catherine Faron-Zucker,et al.  A Transformation Language for RDF Based on SPARQL , 2015, WEBIST.

[17]  Olivier Bodenreider,et al.  Singleton Property Graph: Adding A Semantic Web Abstraction Layer to Graph Databases , 2019, BlockSW/CKG@ISWC.

[18]  Marco Brandizi,et al.  Getting the best of Linked Data and Property Graphs: rdf2neo and the KnetMiner use case , 2018, SWAT4LS.

[19]  Roberto De Virgilio,et al.  Smart RDF Data Storage in Graph Databases , 2017, 2017 17th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGRID).

[20]  Letizia Tanca,et al.  Supporting interoperability between new database languages , 1991, [1991] Proceedings, Advanced Computer Technology, Reliable Systems and Applications.

[21]  Axel Polleres,et al.  Everything you always wanted to know about blank nodes , 2014, J. Web Semant..

[22]  David Hyland-Wood,et al.  RDF 1.1: Knowledge Representation and Data Integration Language for the Web , 2020, Symmetry.

[23]  Ruth C Carlos,et al.  Value. , 2019, Journal of the American College of Radiology : JACR.

[24]  Renzo Angles,et al.  Serialization for Property Graphs , 2019, BDAS.

[25]  Georg Lausen,et al.  S2X: Graph-Parallel Querying of RDF with GraphX , 2015, Big-O/DMAH@VLDB.

[26]  Denny Vrandecic,et al.  Wikidata: a new platform for collaborative data collection , 2012, WWW.

[27]  Claudio Gutierrez,et al.  An introduction to Graph Data Management , 2017, Graph Data Management.

[28]  Jens Lehmann,et al.  Two for one: querying property graph databases using SPARQL via gremlinator , 2018, GRADES/NDA@SIGMOD/PODS.

[29]  Richard Hull,et al.  Relative information capacity of simple relational database schemata , 1984, SIAM J. Comput..

[30]  Nicole Tourigny,et al.  Bio2RDF: Towards a mashup to build bioinformatics knowledge systems , 2008, J. Biomed. Informatics.

[31]  Josep-Lluís Larriba-Pey,et al.  The linked data benchmark council: a graph and RDF industry benchmarking effort , 2014, SGMD.

[32]  Catherine Faron-Zucker,et al.  A Generic RDF Transformation Software and Its Application to an Online Translation Service for Common Languages of Linked Data , 2015, International Semantic Web Conference.

[33]  Jens Lehmann,et al.  Let's build Bridges, not Walls: SPARQL Querying of TinkerPop Graph Databases with Sparql-Gremlin , 2020, 2020 IEEE 14th International Conference on Semantic Computing (ICSC).

[34]  Harsh Thakkar Towards an Open Extensible Framework for Empirical Benchmarking of Data Management Solutions: LITMUS , 2017, ESWC.

[35]  Dominik Tomaszuk,et al.  RDF Data in Property Graph Model , 2016, MTSR.

[36]  Antoine Isaac,et al.  data.europeana.eu: The Europeana Linked Open Data Pilot , 2011, Dublin Core Conference.

[37]  Rik Van de Walle,et al.  RML: A Generic Language for Integrated RDF Mappings of Heterogeneous Data , 2014, LDOW.

[38]  Oliver Horlacher,et al.  Property Graph vs RDF Triple Store: A Comparison on Glycan Substructure Search , 2015, PloS one.

[39]  Olaf Hartig,et al.  Foundations of an Alternative Approach to Reification in RDF , 2014, ArXiv.

[40]  Jens Lehmann,et al.  Trying Not to Die Benchmarking: Orchestrating RDF and Graph Data Management Solution Benchmarks Using LITMUS , 2017, SEMANTiCS.

[41]  Jens Lehmann,et al.  The Query Translation Landscape: a Survey , 2019, ArXiv.

[42]  Christopher J. Rawlings,et al.  Representing and querying disease networks using graph databases , 2016, BioData Mining.

[43]  Claudio Gutiérrez,et al.  RDF Compression , 2019, Encyclopedia of Big Data Technologies.