RDF packages: a scheme for efficient reasoning and querying over large-scale RDF data

When querying RDF and RDFS data, for improving the performance, it is common to derive all triples according to RDFS entailment rules before query processing. An undesirable drawback of this approach is that a large number of triples are generated by the RDFS reasoning, and hence considerable amount of storage space is required if we materialize the RDFS closure. In this paper, we propose RDF packages, which is a time and space efficient format for RDF data. In an RDF package, a set of triples of the same class or triples having the same predicate are grouped into a dedicated node named Package. Using Packages, we can represent any metadata that can be expressed by RDF. An important feature of the RDF packages is that, when performing RDFS reasoning, the same rules can be applied without any modification, thereby allowing us to use existing RDFS reasoners. In this paper, we discuss the model of RDF packages and its rules, followed by the transformation between RDF and RDF packages. We also discuss the implementation RDF packages using an existing RDF framework. Finally, we demonstrate the performance of the proposed scheme in triple size, reasoning speed, and querying speed.

[1]  Vassilis Christophides,et al.  RQL: a declarative query language for RDF , 2002, WWW.

[2]  Arjohn Kampman,et al.  SeRQL: A Second Generation RDF Query Language , 2003 .

[3]  Li Ma,et al.  Minerva: A Scalable OWL Ontology Storage and Inference System , 2006, ASWC.

[4]  Atanas Kiryakov,et al.  OWLIM - A Pragmatic Semantic Repository for OWL , 2005, WISE Workshops.

[5]  Dan Brickley,et al.  Resource Description Framework (RDF) standard recommendation, World Wide Web Consortium , 2004 .

[6]  Herman J. ter Horst,et al.  Completeness, decidability and complexity of entailment for RDF Schema and a semantic extension involving the OWL vocabulary , 2005, J. Web Semant..

[7]  Brian McBride,et al.  Jena: A Semantic Web Toolkit , 2002, IEEE Internet Comput..

[8]  Frank van Harmelen,et al.  Mind the data skew: distributed inferencing by speeddating in elastic regions , 2010, WWW '10.

[9]  Frank van Harmelen,et al.  Sesame: A Generic Architecture for Storing and Querying RDF and RDF Schema , 2002, SEMWEB.

[10]  Manolis Koubarakis,et al.  RDFS Reasoning and Query Answering on Top of DHTs , 2008, SEMWEB.

[11]  Heiner Stuckenschmidt,et al.  Time - Space Trade-Offs in Scaling up RDF Schema Reasoning , 2005, WISE Workshops.

[12]  E. Prud hommeaux,et al.  SPARQL query language for RDF , 2011 .

[13]  Frank van Harmelen,et al.  Scalable Distributed Reasoning Using MapReduce , 2009, SEMWEB.

[14]  Jeff Heflin,et al.  LUBM: A benchmark for OWL knowledge base systems , 2005, J. Web Semant..

[15]  Bob J. Wielinga,et al.  Prolog-based Infrastructure for RDF: Scalability and Performance , 2003, PSSS.

[16]  Tom Schrijvers,et al.  Under Consideration for Publication in Theory and Practice of Logic Programming Swi-prolog , 2022 .

[17]  Jeff Heflin,et al.  DLDB: Extending Relational Databases to Support Semantic Web Queries , 2003, PSSS.