Towards Versioning of Arbitrary RDF Data

Coherent and consistent tracking of provenance data and in particular update history information is a crucial building block for any serious information system architecture. Version Control Systems can be a part of such an architecture enabling users to query and manipulate versioning information as well as content revisions. In this paper, we introduce an RDF versioning approach as a foundation for a full featured RDF Version Control System. We argue that such a system needs support for all concepts of the RDF specification including support for RDF datasets and blank nodes. Furthermore, we placed special emphasis on the protection against unperceived history manipulation by hashing the resulting patches. In addition to the conceptual analysis and an RDF vocabulary for representing versioning information, we present a mature implementation which captures versioning information for changes to arbitrary RDF datasets.

[1]  James Cheney,et al.  PROV-O: The PROV ontology:W3C recommendation 30 April 2013 , 2013 .

[2]  Rik Van de Walle,et al.  R&Wbase: git for triples , 2013, LDOW.

[3]  Jeremy J. Carroll,et al.  Resource description framework (rdf) concepts and abstract syntax , 2003 .

[4]  Sören Auer,et al.  A Versioning and Evolution Framework for RDF Knowledge Bases , 2006, Ershov Memorial Conference.

[5]  Tim Berners-Lee,et al.  Delta: an ontology for the distribution of differences between RDF graphs , 2004 .

[6]  Aidan Hogan,et al.  Skolemising Blank Nodes while Preserving Isomorphism , 2015, WWW.

[7]  Jeremy J. Carroll,et al.  Signing RDF Graphs , 2003, SEMWEB.

[8]  Steve Cassidy,et al.  Version Control for RDF Triple Stores , 2007, ICSOFT.

[9]  Francesco Piazza,et al.  Signing individual fragments of an RDF graph , 2005, WWW '05.

[10]  Deborah L. McGuinness,et al.  PROV-O: The PROV Ontology , 2013 .

[11]  Axel Polleres,et al.  On Blank Nodes , 2011, SEMWEB.

[12]  Giovanni Tummarello,et al.  RDFSync: Efficient Remote Synchronization of RDF Models , 2007, ISWC/ASWC.

[13]  Paul T. Groth,et al.  Provenance XG Final Report , 2010 .

[14]  Wendy Hall,et al.  The Semantic Web Revisited , 2006, IEEE Intelligent Systems.

[15]  Christian Bizer,et al.  The Berlin SPARQL Benchmark , 2009, Int. J. Semantic Web Inf. Syst..

[16]  Atanas Kiryakov,et al.  Tracking Changes in RDF(S) Repositories , 2002, EKAW.

[17]  David B. Leblang,et al.  ClearCase MultiSite: Supporting Geographically-Distributed Software Development , 1995, SCM.