Approximate Continuous Query Answering over Streams and Dynamic Linked Data Sets

To perform complex tasks, RDF Stream Processing Web applications evaluate continuous queries over streams and quasi-static background data. While the former are pushed in the application, the latter are continuously retrieved from the sources. As soon as the background data increase the volume and become distributed over the Web, the cost to retrieve them increases and applications become unresponsive. In this paper, we address the problem of optimizing the evaluation of these queries by leveraging local views on background data. Local views enhance performance, but require maintenance processes, because changes in the background data sources are not automatically reflected in the application. We propose a two-step query-driven maintenance process to maintain the local view: it exploits information from the query e.g.,i?źthe sliding window definition and the current window content to maintain the local view based on user-defined Quality of Service constraints. Experimental evaluation show the effectiveness of the approach.

[1]  Michael Schmidt,et al.  Foundations of SPARQL query optimization , 2008, ICDT '10.

[2]  RoussopoulosNick,et al.  Exploring the tradeoff between performance and data freshness in database-driven Web servers , 2004, VLDB 2004.

[3]  Jignesh M. Patel,et al.  A comparison of join algorithms for log processing in MaPreduce , 2010, SIGMOD Conference.

[4]  Xiaoquan Zhang,et al.  Impact of Wikipedia on Market Information Environment: Evidence on Management Disclosure and Investor Reaction , 2013, MIS Q..

[5]  Óscar Corcho,et al.  Federating queries in SPARQL 1.1: Syntax, semantics and evaluation , 2013, J. Web Semant..

[6]  Daniele Braga,et al.  Querying RDF streams with C-SPARQL , 2010, SGMD.

[7]  Jennifer Widom,et al.  Adaptive caching for continuous queries , 2005, 21st International Conference on Data Engineering (ICDE'05).

[8]  Varghese S. Jacob,et al.  Assessing Information Quality for the Composite Relational Operation Join , 2002, ICIQ.

[9]  Jürgen Umbrich,et al.  Freshening up while Staying Fast: Towards Hybrid SPARQL Queries , 2012, EKAW.

[10]  Alexandros Labrinidis,et al.  Exploring the tradeoff between performance and data freshness in database-driven Web servers , 2004, The VLDB Journal.

[11]  Jeffrey F. Naughton,et al.  Maximizing the Output Rate of Multi-Way Join Queries over Streaming Information Sources , 2003, VLDB.

[12]  François Goasdoué,et al.  View Selection in Semantic Web Databases , 2011, Proc. VLDB Endow..

[13]  Jonathan Goldstein,et al.  Relaxed currency and consistency: how to say "good enough" in SQL , 2004, SIGMOD '04.

[14]  Karl Aberer,et al.  Enabling Query Technologies for the Semantic Sensor Web , 2012, Int. J. Semantic Web Inf. Syst..

[15]  Danh Le Phuoc,et al.  A Native and Adaptive Approach for Unified Processing of Linked Streams and Linked Data , 2011, SEMWEB.

[16]  John R. Smith,et al.  Supporting Incremental Join Queries on Ranked Inputs , 2001, VLDB.

[17]  Freddy Lécué,et al.  Smart traffic analytics in the semantic web with STAR-CITY: Scenarios, system and lessons learned in Dublin City , 2014, J. Web Semant..

[18]  Jürgen Umbrich,et al.  Strategies for Executing Federated Queries in SPARQL1.1 , 2014, SEMWEB.

[19]  Raghu Ramakrishnan,et al.  Caching with 'Good Enough' Currency, Consistency, and Completeness , 2005, VLDB.

[20]  Emanuele Della Valle,et al.  Towards BOTTARI: Using Stream Reasoning to Make Sense of Location-Based Micro-posts , 2011, ESWC Workshops.