Logical Inferences with Contexts of RDF Triples

Logical inference, an integral feature of the Semantic Web, is the process of deriving new triples by applying entailment rules on knowledge bases. The entailment rules are determined by the model-theoretic semantics. Incorporating context of an RDF triple (e.g., provenance, time, and location) into the inferencing process requires the formal semantics to be capable of describing the context of RDF triples also in the form of triples, or in other words, RDF contextual triples about triples. The formal semantics should also provide the rules that could entail new contextual triples about triples. In this paper, we propose the first inferencing mechanism that allows context of RDF triples, represented in the form of RDF triples about triples, to be the first-class citizens in the model-theoretic semantics and in the logical rules. Our inference mechanism is well-formalized with all new concepts being captured in the model-theoretic semantics. This formal semantics also allows us to derive a new set of entailment rules that could entail new contextual triples about triples. To demonstrate the feasibility and the scalability of the proposed mechanism, we implement a new tool in which we transform the existing knowledge bases to our representation of RDF triples about triples and provide the option for this tool to compute the inferred triples for the proposed rules. We evaluate the computation of the proposed rules on a large scale using various real-world knowledge bases such as Bio2RDF NCBI Genes and DBpedia. The results show that the computation of the inferred triples can be highly scalable. On average, one billion inferred triples adds 5-6 minutes to the overall transformation process. NCBI Genes, with 20 billion triples in total, took only 232 minutes for the transformation of 12 billion triples and added 42 minutes for inferring 8 billion triples to the overall process.

[1]  Zhe Wu,et al.  Implementing an Inference Engine for RDFS/OWL Constructs and User-Defined Rules in Oracle , 2008, 2008 IEEE 24th International Conference on Data Engineering.

[2]  Steffen Staab,et al.  Querying for meta knowledge , 2008, WWW.

[3]  Amit P. Sheth,et al.  Don't like RDF reification?: making statements about statements using singleton property , 2014, WWW.

[4]  Nicole Tourigny,et al.  Bio2RDF: Towards a mashup to build bioinformatics knowledge systems , 2008, J. Biomed. Informatics.

[5]  Jens Lehmann,et al.  DBpedia - A large-scale, multilingual knowledge base extracted from Wikipedia , 2015, Semantic Web.

[6]  Sebastian Rudolph,et al.  Foundations of Semantic Web Technologies , 2009 .

[7]  Deborah L. McGuinness,et al.  PROV-O: The PROV Ontology , 2013 .

[8]  Evan E. Bolton,et al.  Exposing Provenance Metadata Using Different RDF Models , 2015, SWAT4LS.

[9]  Jeremy J. Carroll,et al.  OWL 2 Web Ontology Language RDF-Based Semantics , 2009 .

[10]  Gerhard Weikum,et al.  YAGO2: A Spatially and Temporally Enhanced Knowledge Base from Wikipedia: Extended Abstract , 2013, IJCAI.

[11]  J. Carroll,et al.  Jena: implementing the semantic web recommendations , 2004, WWW Alt. '04.

[12]  Zhe Wu,et al.  Optimizing Enterprise-Scale OWL 2 RL Reasoning in a Relational Database System , 2010, SEMWEB.

[13]  Wolf Siberski,et al.  SLUBM: An Extended LUBM Benchmark for Stream Reasoning , 2013, OrdRing@ISWC.

[14]  Claudio Gutiérrez,et al.  Temporal RDF , 2005, ESWC.

[15]  Jeremy J. Carroll,et al.  Named graphs , 2005, J. Web Semant..

[16]  Wei Zhang,et al.  Knowledge vault: a web-scale approach to probabilistic knowledge fusion , 2014, KDD.