Certain Answers for SPARQL with Blank Nodes

Blank nodes in RDF graphs can be used to represent values known to exist but whose identity remains unknown. A prominent example of such usage can be found in the Wikidata dataset where, e.g., the author of Beowulf is given as a blank node. However, while SPARQL considers blank nodes in a query as existentials, it treats blank nodes in RDF data more like constants. Running SPARQL queries over datasets with unknown values may thus lead to counter-intuitive results, which may make the standard SPARQL semantics unsuitable for datasets with existential blank nodes. We thus explore the feasibility of an alternative SPARQL semantics based on certain answers. In order to estimate the performance costs that would be associated with such a change in semantics for current implementations, we adapt and evaluate approximation techniques proposed in a relational database setting for a core fragment of SPARQL. To further understand the impact that such a change in semantics may have on query solutions, we analyse how this new semantics would affect the results of user queries over Wikidata.

[1]  Marcelo Arenas,et al.  Semantics and Complexity of SPARQL , 2006, International Semantic Web Conference.

[2]  E. F. Codd Understanding relations , 1974, SGMD.

[3]  Witold Lipski On relational algebra with marked nulls preliminary version , 1984, PODS '84.

[4]  M. Arenas,et al.  SQL ' s Three-Valued Logic and Certain Answers , 2015 .

[5]  Axel Polleres,et al.  Everything you always wanted to know about blank nodes , 2014, J. Web Semant..

[6]  Leonid Libkin,et al.  Correctness of SQL Queries on Databases with Nulls , 2017, SGMD.

[7]  Markus Krötzsch,et al.  Wikidata , 2014, Commun. ACM.

[8]  Hans-Joachim Klein On the Use of Marked Nulls for the Evaluation of Queries Against Incomplete Relational Databases , 1998, FMLDO.

[9]  Leonid Libkin Certain answers as objects and knowledge , 2016, Artif. Intell..

[10]  Leonid Libkin,et al.  Making SQL Queries Correct on Incomplete Databases: A Feasibility Study , 2016, PODS.

[11]  Leonid Libkin,et al.  On the complexity of query answering over incomplete XML documents , 2012, ICDT '12.

[12]  Claire David,et al.  Certain answers for XML queries , 2010, PODS.

[13]  Reinhard Pichler,et al.  Towards Reconciling SPARQL and Certain Answers , 2015, WWW.

[14]  Marcelo Arenas,et al.  Querying semantic web data with SPARQL , 2011, PODS.

[15]  Serge Abiteboul,et al.  On the representation and querying of sets of possible worlds , 1987, SIGMOD '87.

[16]  Claudio Gutiérrez,et al.  The Multiset Semantics of SPARQL Patterns , 2016, SEMWEB.

[17]  Leonid Libkin,et al.  Approximations and Refinements of Certain Answers via Many-Valued Logics , 2016, KR.