An Overview on Execution Strategies for Linked Data Queries

The publication of Linked Open Data on the Web has gained tremendous momentum over the last five years. This development makes possible (and interesting) the execution of queries using up-to-date data from multiple, automatically discovered data sources. As a result, we currently witness the emergence of a new research area that focuses on an online execution of Linked Data queries; i.e. queries that range over data that is made available using the Linked Data publishing principles.This article provides a general overview on this new area. In particular, we introduce the specific challenges that need to be addressed and then focus on possible strategies for executing Linked Data queries. Furthermore, we classify approaches proposed in the literature w.r.t. these strategies.

[1]  Olaf Hartig,et al.  Zero-Knowledge Query Planning for an Iterator Implementation of Link Traversal Based Query Execution , 2011, ESWC.

[2]  Andreas Harth,et al.  Top-k Linked Data Query Processing , 2012, ESWC.

[3]  Deborah L. McGuinness,et al.  SameAs Networks and Beyond: Analyzing Deployment Status and Implications of owl: sameAs in Linked Data , 2010, International Semantic Web Conference.

[4]  Beat Signer,et al.  Efficient Querying of Distributed RDF Sources in Mobile Settings based on a Source Index Model , 2011, ANT/MobiWIS.

[5]  Olaf Hartig,et al.  How Caching Improves Efficiency and Result Completeness for Querying Linked Data , 2011, LDOW.

[6]  Christian Bizer,et al.  Evolving the Web into a Global Data Space , 2011, BNCOD.

[7]  Michael Stonebraker A Database Perspective , 1982, On Conceptual Modelling.

[8]  Jürgen Umbrich,et al.  Searching and browsing Linked Data with SWSE: The Semantic Web Search Engine , 2011, J. Web Semant..

[9]  Peter Dolog,et al.  Relaxing RDF queries based on user and domain preferences , 2009, Journal of Intelligent Information Systems.

[10]  Luciano Serafini,et al.  Querying the Web of Data: A Formal Approach , 2009, ASWC.

[11]  Gavin Powell,et al.  Towards Fuzzy Query-Relaxation for RDF , 2012, ESWC.

[12]  Peter Mika,et al.  Metadata Statistics for a Large Web Corpus , 2012, LDOW.

[13]  Jürgen Umbrich,et al.  Data summaries for on-demand queries over linked data , 2010, WWW '10.

[14]  Jürgen Umbrich,et al.  Enhancing Source Selection for Live Queries over Linked Data via Query Log Mining , 2011, JIST.

[15]  Olaf Hartig,et al.  SPARQL for a Web of Linked Data: Semantics and Computability (Extended Version) , 2012, ESWC.

[16]  Olaf Hartig,et al.  Foundations of traversal based query execution over linked data , 2011, HT '12.

[17]  Christian Bizer,et al.  Web Data Commons - Extracting Structured Data from Two Large Web Corpora , 2012, LDOW.

[18]  Florian Schmedding,et al.  Incremental SPARQL Evaluation for Query Answering on Linked Data , 2011, COLD.

[19]  Jürgen Umbrich,et al.  Comparing data summaries for processing live queries over Linked Data , 2011, World Wide Web.

[20]  Euripides G. M. Petrakis,et al.  Improving the performance of focused web crawlers , 2009, Data Knowl. Eng..

[21]  Jeremy J. Carroll,et al.  OWL 2 Web Ontology Language RDF-Based Semantics , 2009 .

[22]  Sebastian Speiser,et al.  On Completeness Classes for Query Evaluation on Linked Data , 2012, AAAI.

[23]  Tom Heath,et al.  Linked Data: Evolving the Web into a Global Data Space , 2011, Linked Data.

[24]  Ivan Herman,et al.  RDFa Core 1.1Syntax and processing rules for embedding RDF through attributes , 2012 .

[25]  Jürgen Umbrich,et al.  Improving the Recall of Live Linked Data Querying through Reasoning , 2012, RR.

[26]  Günter Ladwig,et al.  Linked Data Query Processing Strategies , 2010, SEMWEB.

[27]  Steffen Staab,et al.  Federated Data Management and Query Optimization for Linked Open Data , 2011, New Directions in Web Data Management 1.

[28]  E. Prud hommeaux,et al.  SPARQL query language for RDF , 2011 .

[29]  Tim Berners-Lee,et al.  Linked Data - The Story So Far , 2009, Int. J. Semantic Web Inf. Syst..

[30]  Olaf Hartig,et al.  A Database Perspective on Consuming Linked Data on the Web , 2010, Datenbank-Spektrum.

[31]  Amit P. Sheth,et al.  Alignment-Based Querying of Linked Open Data , 2012, OTM Conferences.

[32]  Christian Bizer,et al.  Executing SPARQL Queries over the Web of Linked Data , 2009, SEMWEB.

[33]  Abraham Bernstein,et al.  Hexastore: sextuple indexing for semantic web data management , 2008, Proc. VLDB Endow..

[34]  Chengfei Liu,et al.  Approximating query answering on RDF databases , 2011, World Wide Web.

[35]  Marco Gori,et al.  Focused Crawling Using Context Graphs , 2000, VLDB.

[36]  Nektarios Gioldasis,et al.  Ontology Mapping and SPARQL Rewriting for Querying Federated RDF Data Sources - (Short Paper) , 2010, OTM Conferences.

[37]  Günter Ladwig,et al.  SIHJoin: Querying Remote and Local Linked Data , 2011, ESWC.

[38]  Lakhmi C. Jain,et al.  New Directions in Web Data Management 1 , 2011, New Directions in Web Data Management 1.

[39]  Martin van den Berg,et al.  Focused Crawling: A New Approach to Topic-Specific Web Resource Discovery , 1999, Comput. Networks.

[40]  Andreas Harth,et al.  Optimized index structures for querying RDF from the Web , 2005, Third Latin American Web Congress (LA-WEB'2005).

[41]  Gerhard Weikum,et al.  RDF-3X: a RISC-style engine for RDF , 2008, Proc. VLDB Endow..