Backward inference and pruning for RDF change detection using RDBMS

Recent studies on change detection for RDF data have focused on minimizing the delta size and, as a way to exploit the semantics of RDF models in reducing the delta size, the forward-chaining inferences have been widely employed. However, since the forward-chaining inferences should pre-compute the entire closure of the RDF model, the existing approaches are not scalable to large RDF data sets. In this paper, we propose a scalable change detection scheme for RDF data, which is based on backward-chaining inference and pruning. Our scheme, instead of pre-computing the full closure, computes only the necessary closure on the fly, thus achieving fast and scalable change detection. In addition, for any two RDF data input files to be compared, the delta obtained from our scheme is always equivalent to the one from the existing forward-chaining inferences. In addition, in order to handle RDF data sets too large to fit in the available RAM, we present an SQL-based implementation of our scheme. Our experimental results show that our scheme, in comparison to the existing schemes, can reduce the number of inference triples for RDF change detection by 10–60%.

[1]  Sourav S. Bhowmick,et al.  DTD-Diff: A Change Detection Algorithm for DTDs , 2006, DASFAA.

[2]  J. Carroll,et al.  Jena: implementing the semantic web recommendations , 2004, WWW Alt. '04.

[3]  Tudor Groza,et al.  SemVersion: RDF-based ontology versioning system , 2006 .

[4]  Jeremy J. Carroll,et al.  Signing RDF Graphs , 2003, SEMWEB.

[5]  Jim Melton,et al.  SQL: 1999, formerly known as SQL3 , 1999, SGMD.

[6]  Vassilis Christophides,et al.  On the Foundations of Computing Deltas Between RDF Models , 2007, ISWC/ASWC.

[7]  Vassilis Christophides,et al.  On Computing Deltas of RDF/S Knowledge Bases , 2011, TWEB.

[8]  Atanas Kiryakov,et al.  Tracking Changes in RDF(S) Repositories , 2002, EKAW.

[9]  Johann Eder,et al.  Change Detection in Ontologies Using DAG Comparison , 2006, OTM Workshops.

[10]  Dave Reynolds,et al.  Efficient RDF Storage and Retrieval in Jena2 , 2003, SWDB.

[11]  David J. DeWitt,et al.  X-Diff: an effective change detection algorithm for XML documents , 2003, Proceedings 19th International Conference on Data Engineering (Cat. No.03CH37405).

[12]  Tim Berners-Lee,et al.  Delta: an ontology for the distribution of differences between RDF graphs , 2004 .

[13]  Michel C. A. Klein,et al.  Ontology Versioning and Change Detection on the Web , 2002, EKAW.

[14]  Sang-Won Lee,et al.  A Version Management Framework for RDF Triple Stores , 2012, Int. J. Softw. Eng. Knowl. Eng..

[15]  Heiner Stuckenschmidt,et al.  Time - Space Trade-Offs in Scaling up RDF Schema Reasoning , 2005, WISE Workshops.

[16]  Grigoris Antoniou,et al.  Ontology change: classification and survey , 2008, The Knowledge Engineering Review.

[17]  Frank van Harmelen,et al.  Sesame: A Generic Architecture for Storing and Querying RDF and RDF Schema , 2002, SEMWEB.

[18]  Eugene Inseok Chong,et al.  An Efficient SQL-based RDF Querying Scheme , 2005, VLDB.

[19]  Yuzhong Qu,et al.  An RDF Storage and Query Framework with Flexible Inference Strategy , 2006, APWeb.

[20]  Michel C. A. Klein,et al.  Ontology Evolution: Not the Same as Schema Evolution , 2004, Knowledge and Information Systems.

[21]  Arjohn Kampman,et al.  Inferencing and Truth Maintenance in RDF Schema , 2003, PSSS.

[22]  Vassilis Christophides,et al.  On labeling schemes for the semantic web , 2003, WWW '03.

[23]  Sourav S. Bhowmick,et al.  Xandy: Detecting Changes on Large Unordered XML Documents Using Relational Databases , 2005, DASFAA.

[24]  Yannis Tzitzikas,et al.  On Storage Policies for Semantic Web Repositories That Support Versioning , 2008, ESWC.

[25]  Daniel J. Abadi,et al.  Scalable Semantic Web Data Management Using Vertical Partitioning , 2007, VLDB.

[26]  Olga De Troyer,et al.  Understanding ontology evolution: A change detection approach , 2007, J. Web Semant..

[27]  Vassilis Christophides,et al.  On Detecting High-Level Changes in RDF/S KBs , 2009, SEMWEB.

[28]  Mark A. Musen,et al.  Ontology versioning in an ontology management framework , 2004, IEEE Intelligent Systems.

[29]  Giovanni Tummarello,et al.  RDFSync: Efficient Remote Synchronization of RDF Models , 2007, ISWC/ASWC.

[30]  Jeremy J. Carroll,et al.  Resource description framework (rdf) concepts and abstract syntax , 2003 .