Implementing Flexible Operators for Regular Path Queries

Given the heterogeneity of complex graph data on the web, such as RDF linked data,a user wishing to query such data may lack full knowledge of its structure and irregularities. Hence, providing users with flexible querying capabilities can be beneficial. The query language we adopt comprises conjunctions of regular path queries, thus including extensions proposed for SPARQL 1.1 to allow for querying paths using regular expressions. To this language we add two operators: APPROX, supporting standard notions of approximation based on edit distance, and RELAX, which performs query relaxation based on RDFS inference rules. We describe our techniques for implementing the extended language and present a performance study undertaken on two real-world data sets. Our baseline implementation performs competitively with other automaton-based approaches, and we demonstrate empirically how various optimisations can decrease execution times of queries containing APPROX and RELAX, sometimes by orders of magnitude.

[1]  Tom Heath,et al.  How to Publish Linked Data on the Web - Proposal for a Half-day Tutorial at ISWC2008 , 2008 .

[2]  David Dominguez-Sal,et al.  Using semijoin programs to solve traversal queries in graph databases , 2014, GRADES.

[3]  Ding-Zhu Du,et al.  Problem solving in automata, languages, and complexity , 2004, IEEE Circuits and Devices Magazine.

[4]  Georg Lausen,et al.  RDFPath: Path Query Processing on Large RDF Graphs with MapReduce , 2011, ESWC Workshops.

[5]  Alexandra Poulovassilis,et al.  Flexible Querying of Lifelong Learner Metadata , 2012, IEEE Transactions on Learning Technologies.

[6]  Alexandra Poulovassilis,et al.  Ranking Approximate Answers to Semantic Web Queries , 2009, ESWC.

[7]  Olaf Hartig,et al.  The SPARQL Query Graph Model for Query Optimization , 2007, ESWC.

[8]  Josep-Lluís Larriba-Pey,et al.  Efficient graph management based on bitmap indices , 2012, IDEAS '12.

[9]  K. Selçuk Candan,et al.  R2DF framework for ranked path queries over weighted RDF graphs , 2011, WIMS '11.

[10]  Tim Furche,et al.  A RPL through RDF: Expressive Navigation in RDF Graphs , 2010, RR.

[11]  Alexandra Poulovassilis,et al.  Combining Approximation and Relaxation in Semantic Web Path Queries , 2010, SEMWEB.

[12]  Ulf Leser,et al.  Regular Path Queries on Large Graphs , 2012, SSDBM.

[13]  Daniel J. Abadi,et al.  Scalable SPARQL querying of large RDF graphs , 2011, Proc. VLDB Endow..

[14]  Diego Calvanese,et al.  Containment of Conjunctive Regular Path Queries with Inverse , 2000, KR.

[15]  M. Droste,et al.  Handbook of Weighted Automata , 2009 .

[16]  Gerhard Weikum,et al.  The YAGO-NAGA approach to knowledge discovery , 2009, SGMD.