Multi-query Optimization via Common Sub Query Elimination for SPARQL

As the main query language on RDF data, SPARQL constantly drawing attention from the Semantic Web communities. In this paper we present a mechanism that can identify common sub queries in multiple RDF queries and eliminate redundant sub queries, significantly improving the efficiency of executing multiple RDF queries, both simultaneously and sequentially. Unlike previous optimization works concentrated on optimizing a single RDF query, our method focus on improving the performance of multiple similar RDF queries, which commonly occur in real world applications related to RDF data. Extensive experiments over benchmark RDF repositories of millions of triples confirm that our method is superior to the state-of-the-art methods when common sub queries exist in multiple SPARQL queries.

[1]  Feifei Li,et al.  Scalable Multi-query Optimization for SPARQL , 2012, 2012 IEEE 28th International Conference on Data Engineering.

[2]  Jeff Heflin,et al.  An Evaluation of Knowledge Base Systems for Large OWL Datasets , 2004, SEMWEB.

[3]  Marcelo Arenas,et al.  Semantics and Complexity of SPARQL , 2006, International Semantic Web Conference.

[4]  Lili Jiang,et al.  Schema-Based Query Rewriting in SPARQL , 2016, KSEM.

[5]  Diego Calvanese,et al.  Updating RDFS ABoxes and TBoxes in SPARQL , 2014, SEMWEB.

[6]  Haibo Chen,et al.  Fast and Concurrent RDF Queries with RDMA-Based Distributed Graph Exploration , 2016, OSDI.

[7]  Diego Calvanese,et al.  DL-Lite: Tractable Description Logics for Ontologies , 2005, AAAI.

[8]  Dan Brickley,et al.  Resource Description Framework (RDF) Model and Syntax Specification , 2002 .

[9]  Wenfei Fan,et al.  Proceedings of the 2014 ACM SIGMOD International Conference on Management of Data , 2014 .

[10]  Diego Calvanese,et al.  Ontop: Answering SPARQL queries over relational databases , 2016, Semantic Web.

[11]  Georg Lausen,et al.  S2X: Graph-Parallel Querying of RDF with GraphX , 2015, Big-O/DMAH@VLDB.

[12]  Diego Calvanese,et al.  Tractable Reasoning and Efficient Query Answering in Description Logics: The DL-Lite Family , 2007, Journal of Automated Reasoning.

[13]  Martin Theobald,et al.  TriAD: a distributed shared-nothing RDF engine based on asynchronous message passing , 2014, SIGMOD Conference.

[14]  E. Prud hommeaux,et al.  SPARQL query language for RDF , 2011 .

[15]  Daniel P. Miranker,et al.  Ultrawrap: SPARQL execution on relational data , 2013, J. Web Semant..

[16]  Gerhard Weikum,et al.  The LRU-K page replacement algorithm for database disk buffering , 1993, SIGMOD Conference.

[17]  Reynold Xin,et al.  GraphX: a resilient distributed graph system on Spark , 2013, GRADES.

[18]  Jeff Heflin,et al.  LUBM: A benchmark for OWL knowledge base systems , 2005, J. Web Semant..

[19]  Gabriela Montoya,et al.  Answering SPARQL Queries using Views , 2015, DC@ISWC.