A Version Management Framework for RDF Triple Stores

RDF is widely used as an ontology language for representing the metadata in Semantic Web, knowledge management system and E-commerce. Since ontologies model the knowledge in a particular domain, they may change over time. Furthermore, ontologies are usually developed and controlled in a distributed and collaborative way. Thus, it is very important to be able to manage multiple versions for RDF data. Earlier studies on RDF versions have focused on providing the accesses to different versions (i.e. snapshots) and computing the differences between those two versions. However, the existing approaches suffer from the space overhead for large scale data, since all snapshots should be redundantly kept in a repository. Moreover, it is very time consuming to compute the delta between two specific versions, which is very common in RDF applications. In this paper, we propose a framework for RDF version management in relational databases. It stores the original version and the deltas between two consecutive versions, thereby reducing the space requirement considerably. The other benefit of our approach is appropriate for change queries. On the flip side, in order to answer a query on a specific logical version, version should be constructed on the fly by applying the deltas between the original version and the logical version. This can slow down query performance. In order to overcome this, we propose a compression technique for deltas, called Aggregated Delta, to create a logical version directly rather than executing the sequence of deltas. An experimental study with real life RDF data sets shows our framework maintains multiple versions efficiently.

[1]  Olga De Troyer,et al.  Understanding ontology evolution: A change detection approach , 2007, J. Web Semant..

[2]  Grigoris Antoniou,et al.  Ontology change: classification and survey , 2008, The Knowledge Engineering Review.

[3]  Tim Berners-Lee,et al.  Delta: an ontology for the distribution of differences between RDF graphs , 2004 .

[4]  Won Kim,et al.  Versions of Schema for Object-Oriented Databases , 1988, VLDB.

[5]  Frank van Harmelen,et al.  Sesame: A Generic Architecture for Storing and Querying RDF and RDF Schema , 2002, SEMWEB.

[6]  Michel C. A. Klein,et al.  Ontology Versioning and Change Detection on the Web , 2002, EKAW.

[7]  Mark A. Musen,et al.  Promptdiff: a fixed-point algorithm for comparing ontology versions , 2002, AAAI/IAAI.

[8]  Mark A. Musen,et al.  Ontology versioning in an ontology management framework , 2004, IEEE Intelligent Systems.

[9]  Jennifer Widom,et al.  Tracing the lineage of view data in a warehousing environment , 2000, TODS.

[10]  Dave Reynolds,et al.  Efficient RDF Storage and Retrieval in Jena2 , 2003, SWDB.

[11]  Michel C. A. Klein,et al.  Tracking Changes During Ontology Evolution , 2004, SEMWEB.

[12]  Mehregan Mahdavi,et al.  An Adaptive Document Version Management Scheme , 2003, CAiSE.

[13]  Jennifer Widom,et al.  View maintenance in a warehousing environment , 1995, SIGMOD '95.

[14]  Carlo Zaniolo,et al.  Version Management of XML Documents , 2000, WebDB.

[15]  James A. Hendler,et al.  The National Cancer Institute's Thésaurus and Ontology , 2003, J. Web Semant..

[16]  Atanas Kiryakov,et al.  Tracking Changes in RDF(S) Repositories , 2002, EKAW.

[17]  Jay Banerjee,et al.  Semantics and implementation of schema evolution in object-oriented databases , 1987, SIGMOD '87.

[18]  Yannis Tzitzikas,et al.  On Storage Policies for Semantic Web Repositories That Support Versioning , 2008, ESWC.

[19]  Amélie Marian,et al.  Change-Centric Management of Versions in an XML Warehouse , 2001, VLDB.

[20]  Daniel J. Abadi,et al.  Scalable Semantic Web Data Management Using Vertical Partitioning , 2007, VLDB.

[21]  Arjohn Kampman,et al.  Inferencing and Truth Maintenance in RDF Schema , 2003, PSSS.

[22]  Vassilis Christophides,et al.  On the Foundations of Computing Deltas Between RDF Models , 2007, ISWC/ASWC.

[23]  Tudor Groza,et al.  SemVersion: RDF-based ontology versioning system , 2006 .

[24]  M. Ashburner,et al.  Gene Ontology: tool for the unification of biology , 2000, Nature Genetics.

[25]  Hicham G. Elmongui,et al.  Lazy Maintenance of Materialized Views , 2007, VLDB.

[26]  Walter F. Tichy,et al.  Rcs — a system for version control , 1985, Softw. Pract. Exp..

[27]  James A. Hendler,et al.  The NCI''s thesaurus and ontology , 2003 .

[28]  Marc J. Rochkind,et al.  The source code control system , 1975, IEEE Transactions on Software Engineering.