Optimizing Aggregate SPARQL Queries Using Materialized RDF Views

During recent years, more and more data has been published as native RDF datasets. In this setup, both the size of the datasets and the need to process aggregate queries represent challenges for standard SPARQL query processing techniques. To overcome these limitations, materialized views can be created and used as a source of precomputed partial results during query processing. However, materialized view techniques as proposed for relational databases do not support RDF specifics, such as incompleteness and the need to support implicit (derived) information. To overcome these challenges, this paper proposes MARVEL (MAterialized Rdf Views with Entailment and incompLetness). The approach consists of a view selection algorithm based on an associated RDF-specific cost model, a view definition syntax, and an algorithm for rewriting SPARQL queries using materialized RDF views. The experimental evaluation shows that MARVEL can improve query response time by more than an order of magnitude while effectively handling RDF specifics.

[1]  Rafael Berlanga Llavori,et al.  Building data warehouses with semantic data , 2010, EDBT '10.

[2]  Werner Nutt,et al.  Completeness Statements about RDF Data Sources and Their Use for Query Answering , 2013, SEMWEB.

[3]  Ulf Leser,et al.  RDFMatView : Indexing RDF Data using Materialized SPARQL queries , 2010 .

[4]  Jürgen Umbrich,et al.  Resource Planning for SPARQL Query Execution on Data Sharing Platforms , 2014, COLD.

[5]  Lorena Etcheverry,et al.  Modeling and Querying Data Warehouses on the Semantic Web Using QB4OLAP , 2014, DaWaK.

[6]  Jeff Heflin,et al.  LUBM: A benchmark for OWL knowledge base systems , 2005, J. Web Semant..

[7]  Ashish Gupta,et al.  Aggregate-Query Processing in Data Warehousing Environments , 1995, VLDB.

[8]  Fabian M. Suchanek,et al.  Fast rule mining in ontological knowledge bases with AMIE+\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$+$$\end{docu , 2015, The VLDB Journal.

[9]  Divesh Srivastava,et al.  Answering Queries with Aggregation Using Views , 1996, VLDB.

[10]  Alon Y. Halevy,et al.  Answering queries using views: A survey , 2001, The VLDB Journal.

[11]  Torben Bach Pedersen,et al.  Using Semantic Web Technologies for Exploratory OLAP: A Survey , 2015, IEEE Transactions on Knowledge and Data Engineering.

[12]  François Goasdoué,et al.  RDF analytics: lenses over semantic graphs , 2014, WWW.

[13]  Rafael Berlanga Llavori,et al.  Building data warehouses with semantic web data , 2012, Decis. Support Syst..

[14]  François Goasdoué,et al.  View Selection in Semantic Web Databases , 2011, Proc. VLDB Endow..

[15]  Timos K. Sellis,et al.  View selection for designing the global data warehouse , 2001, Data Knowl. Eng..

[16]  Andreas Harth,et al.  No Size Fits All - Running the Star Schema Benchmark with SPARQL and RDF Aggregate Views , 2013, ESWC.

[17]  Surajit Chaudhuri,et al.  Automated Selection of Materialized Views and Indexes in SQL Databases , 2000, VLDB.

[18]  Feifei Li,et al.  Rewriting queries on SPARQL views , 2011, WWW.

[19]  Torben Bach Pedersen,et al.  Towards Exploratory OLAP Over Linked Open Data - A Case Study , 2014, BIRTE.

[20]  Torben Bach Pedersen,et al.  Processing Aggregate Queries in a Federation of SPARQL Endpoints , 2015, ESWC.

[21]  Jeffrey D. Ullman,et al.  Implementing data cubes efficiently , 1996, SIGMOD '96.

[22]  Esteban Zimányi,et al.  Data Warehouse Systems , 2014, Data-Centric Systems and Applications.

[23]  Marcelo Arenas,et al.  Semantics and Complexity of SPARQL , 2006, International Semantic Web Conference.

[24]  François Goasdoué,et al.  Efficient OLAP operations for RDF analytics , 2015, 2015 31st IEEE International Conference on Data Engineering Workshops.