Freshening up while Staying Fast: Towards Hybrid SPARQL Queries

Querying over cached indexes of Linked Data often suffers from stale or missing results due to infrequent updates and partial coverage of sources. Conversely, live decentralised approaches offer fresh results directly from the Web, but exhibit slow response times due to accessing numerous remote sources at runtime. We thus propose a hybrid query approach that improves upon both paradigms, offering fresher results from a broader range of sources than Linked Data caches while offering faster results than live querying. Our hybrid query engine takes a cached and live query engine as black boxes, where a hybrid query planner splits an input query and delegates the appropriate sub-queries to each interface. In this paper, we discuss query planning alternatives and their main strengths and weaknesses. We also present coherence measures to quantify the coverage and freshness for cached indexes of Linked Data, and show how these measures can be used for hybrid query planning to optimise the trade-off between fresh results and fast runtimes.

[1]  László Böszörményi,et al.  A survey of Web cache replacement strategies , 2003, CSUR.

[2]  Lei Zhang,et al.  Summary Models for Routing Keywords to Linked Data Sources , 2010, International Semantic Web Conference.

[3]  Michael Martin,et al.  Improving the Performance of Semantic Web Applications with SPARQL Query Caching , 2010, ESWC.

[4]  Lora Aroyo,et al.  The Semantic Web - ISWC 2011 - 10th International Semantic Web Conference, Bonn, Germany, October 23-27, 2011, Proceedings, Part I , 2011, SEMWEB.

[5]  Jürgen Umbrich,et al.  Linked Data and Live Querying for Enabling Support Platforms for Web Dataspaces , 2012, 2012 IEEE 28th International Conference on Data Engineering Workshops.

[6]  Christian Bizer,et al.  Executing SPARQL Queries over the Web of Linked Data , 2009, SEMWEB.

[7]  Óscar Corcho,et al.  Semantics and Optimization of the SPARQL 1.1 Federation Extension , 2011, ESWC.

[8]  Barry Bishop,et al.  FactForge: A fast track to the Web of data , 2011, Semantic Web.

[9]  Jeff Z. Pan,et al.  The Semanic Web: Research and Applications - 8th Extended Semantic Web Conference, ESWC 2011, Heraklion, Crete, Greece, May 29 - June 2, 2011, Proceedings, Part II , 2011, ESWC.

[10]  Orri Erling,et al.  RDF Support in the Virtuoso DBMS , 2007, CSSW.

[11]  Jürgen Umbrich,et al.  Towards a Dynamic Linked Data Observatory , 2012 .

[12]  Jürgen Umbrich,et al.  Towards Understanding the Changing Web: Mining the Dynamics of Linked-Data Sources and Entities , 2010, LWA.

[13]  Katja Hose,et al.  FedX: A Federation Layer for Distributed Query Processing on Linked Open Data , 2011, ESWC.

[14]  Jürgen Umbrich,et al.  Towards Dataset Dynamics: Change Frequency of Linked Open Data Sources , 2010, LDOW.

[15]  Olaf Hartig,et al.  A Database Perspective on Consuming Linked Data on the Web , 2010, Datenbank-Spektrum.

[16]  Ulf Leser,et al.  Querying Distributed RDF Data Sources with SPARQL , 2008, ESWC.

[17]  Jürgen Umbrich,et al.  Comparing data summaries for processing live queries over Linked Data , 2011, World Wide Web.

[18]  Abraham Bernstein,et al.  The Semantic Web - ISWC 2009, 8th International Semantic Web Conference, ISWC 2009, Chantilly, VA, USA, October 25-29, 2009. Proceedings , 2009, SEMWEB.

[19]  Günter Ladwig,et al.  Linked Data Query Processing Strategies , 2010, SEMWEB.

[20]  Ian Horrocks,et al.  The Semantic Web – ISWC 2010: 9th International Semantic Web Conference, ISWC 2010, Shanghai, China, November 7-11, 2010, Revised Selected Papers, Part I , 2010, SEMWEB.

[21]  Katja Hose,et al.  FedX: Optimization Techniques for Federated Query Processing on Linked Data , 2011, SEMWEB.

[22]  Lora Aroyo,et al.  The Semantic Web: Research and Applications , 2009, Lecture Notes in Computer Science.

[23]  Jesse Weaver,et al.  Enabling Fine-Grained HTTP Caching of SPARQL Query Results , 2011, SEMWEB.

[24]  Marcel Karnstedt,et al.  Semantic Caching in Ontology-based Mediator Systems , 2003, Berliner XML Tage.

[25]  Eyal Oren,et al.  Sindice.com: a document-oriented lookup index for open linked data , 2008, Int. J. Metadata Semant. Ontologies.