Processing of RDF Stream Data

We are witnessing a paradigm shift, where real-time, time-dependent data is becoming ubiquitous. As Linked Data facilitates the data integration process among heterogenous data sources, RDF Stream Data has the same goal with respect to data streams. It bridges the gap between stream and more static data sources. To support the processing on RDF stream data, there is a need on investigating how to extend RDF to model and represent stream data. Then, from the RDF-based data representation, the query model processing models need to be defined to build the stream processing engine that is tailored for streaming data. This chapter provides an overview on how such requirements are addressed in the current state-of-the-art of RDF Stream Data processing.

[1]  Jennifer Widom,et al.  Flexible time management in data stream systems , 2004, PODS.

[2]  Charu C. Aggarwal,et al.  gSketch: On Query Estimation in Graph Streams , 2011, Proc. VLDB Endow..

[3]  Boris Motik,et al.  Representing and querying validity time in RDF and OWL: A logic-based approach , 2010, J. Web Semant..

[4]  Jeffrey F. Naughton,et al.  Rate-based query optimization for streaming information sources , 2002, SIGMOD '02.

[5]  Olaf Hartig,et al.  The SPARQL Query Graph Model for Query Optimization , 2007, ESWC.

[6]  Abhinav Gupta,et al.  Optimizing Refresh of a Set of Materialized Views , 2005, VLDB.

[7]  Samuel Madden,et al.  Continuously adaptive continuous queries over streams , 2002, SIGMOD '02.

[8]  Sebastian Rudolph,et al.  EP-SPARQL: a unified language for event processing and stream reasoning , 2011, WWW.

[9]  Ziv Bar-Yossef,et al.  Reductions in streaming algorithms, with an application to counting triangles in graphs , 2002, SODA '02.

[10]  Rajeev Motwani,et al.  Operator scheduling in data stream systems , 2004, VLDB 2004.

[11]  Joseph M. Hellerstein,et al.  Eddies: continuously adaptive query processing , 2000, SIGMOD 2000.

[12]  Michael J. Franklin,et al.  Remembrance of Streams Past: Overload-Sensitive Management of Archived Streams , 2004, VLDB.

[13]  Michael Stonebraker,et al.  Optimization of parallel query execution plans in XPRS , 2005, Distributed and Parallel Databases.

[14]  Nick Koudas,et al.  The design of a query monitoring system , 2009, TODS.

[15]  Danh Le Phuoc,et al.  A Native and Adaptive Approach for Linked Stream Data Processing , 2013 .

[16]  Samuel Madden,et al.  ZStream: a cost-based query processor for adaptively detecting composite events , 2009, SIGMOD Conference.

[17]  Andre Bolles,et al.  Streaming SPARQL - Extending SPARQL to Process Data Streams , 2008, ESWC.

[18]  Hoan Quoc Nguyen-Mau,et al.  The Graph of Things: A step towards the Live Knowledge Graph of connected things , 2016, J. Web Semant..

[19]  Sebastian Rudolph,et al.  A Rule-Based Language for Complex Event Processing and Reasoning , 2010, RR.

[20]  Alasdair J. G. Gray,et al.  Enabling Ontology-Based Access to Streaming Data Sources , 2010, SEMWEB.

[21]  Michael Stonebraker,et al.  Operator Scheduling in a Data Stream Manager , 2003, VLDB.

[22]  Daniele Braga,et al.  An execution environment for C-SPARQL queries , 2010, EDBT '10.

[23]  Yanlei Diao,et al.  High-performance complex event processing over streams , 2006, SIGMOD Conference.

[24]  Neil Immerman,et al.  Efficient pattern matching over event streams , 2008, SIGMOD Conference.

[25]  Kirk Pruhs,et al.  Freshness-Aware Scheduling of Continuous Queries in the Dynamic Web , 2005, WebDB.

[26]  A. N. Wilschut,et al.  Dataflow query execution in a parallel main-memory environment , 1991, Distributed and Parallel Databases.

[27]  Umberto Straccia,et al.  AnQL: SPARQLing Up Annotated RDFS , 2010, SEMWEB.

[28]  JÜRGEN KRÄMER,et al.  Semantics and implementation of continuous sliding window queries over data streams , 2009, TODS.

[29]  Christian Y. A. Brenninkmeijer,et al.  An Architecture for Query Optimization in Sensor Networks , 2008, 2008 IEEE 24th International Conference on Data Engineering.

[30]  Craig Chambers,et al.  The Dataflow Model: A Practical Approach to Balancing Correctness, Latency, and Cost in Massive-Scale, Unbounded, Out-of-Order Data Processing , 2015, Proc. VLDB Endow..

[31]  Geoff Holmes,et al.  Mining frequent closed graphs on evolving data streams , 2011, KDD.

[32]  Gregor Schiele,et al.  Querying Heterogeneous Personal Information on the Go , 2014, International Semantic Web Conference.

[33]  Frederick Reiss,et al.  TelegraphCQ: Continuous Dataflow Processing for an Uncertain World , 2003, CIDR.

[34]  Sharma Chakravarthy,et al.  Queueing analysis of relational operators for continuous data streams , 2003, CIKM '03.

[35]  Jeffrey F. Naughton,et al.  Maximizing the Output Rate of Multi-Way Join Queries over Streaming Information Sources , 2003, VLDB.

[36]  Andrew Heybey,et al.  Tribeca: A System for Managing Large Databases of Network Traffic , 1998, USENIX Annual Technical Conference.

[37]  Jian Yin,et al.  Position: short object lifetimes require a delete-optimized storage system , 2004, EW 11.

[38]  Philippe Bonnet,et al.  Towards Sensor Database Systems , 2001, Mobile Data Management.

[39]  Danh Le-Phuoc Operator-aware approach for boosting performance in RDF stream processing , 2017 .

[40]  Danh Le Phuoc,et al.  Operator-aware approach for boosting performance in RDF stream processing , 2017, J. Web Semant..

[41]  Sreenivas Gollapudi,et al.  Estimating PageRank on graph streams , 2008, PODS.

[42]  Karl Aberer,et al.  TripleWave: Spreading RDF Streams on the Web , 2016, SEMWEB.

[43]  David J. DeWitt,et al.  NiagaraCQ: a scalable continuous query system for Internet databases , 2000, SIGMOD 2000.

[44]  Emanuele Della Valle,et al.  A Query Model to Capture Event Pattern Matching in RDF Stream Processing Query Languages , 2016, EKAW.

[45]  Michael Eckert,et al.  A CEP Babelfish: Languages for Complex Event Processing and Querying Surveyed , 2011 .

[46]  Ge Yu,et al.  Tick Scheduling: A Deadline Based Optimal Task Scheduling Approach for Real-Time Data Stream Systems , 2005, WAIM.

[47]  Theodore Johnson,et al.  Gigascope: a stream database for network applications , 2003, SIGMOD '03.

[48]  Donald Kossmann,et al.  The state of the art in distributed query processing , 2000, CSUR.

[49]  Walid G. Aref,et al.  Supporting views in data stream management systems , 2010, TODS.

[50]  Danh Le Phuoc,et al.  A Native and Adaptive Approach for Unified Processing of Linked Streams and Linked Data , 2011, SEMWEB.

[51]  Samuel Madden,et al.  Fjording the stream: an architecture for queries over streaming sensor data , 2002, Proceedings 18th International Conference on Data Engineering.

[52]  David Maier,et al.  Semantics and evaluation techniques for window aggregates in data streams , 2005, SIGMOD '05.

[53]  Claudio Gutiérrez,et al.  Introducing Time into RDF , 2007, IEEE Transactions on Knowledge and Data Engineering.

[54]  Michael Eckert,et al.  Two Semantics for CEP, no Double Talk: Complex Event Relational Algebra (CERA) and Its Application to XChange EQ , 2011 .

[55]  Michael Eckert,et al.  Rules for Making Sense of Events: Design Issues for High-Level Event Query and Reasoning Languages (Position Paper) , 2008, AAAI Spring Symposium: AI Meets Business Rules and Process Management.

[56]  Hoan Quoc Nguyen-Mau,et al.  Elastic and Scalable Processing of Linked Stream Data in the Cloud , 2013, SEMWEB.

[57]  Jennifer Widom,et al.  The CQL continuous query language: semantic foundations and query execution , 2006, The VLDB Journal.

[58]  John Miles Smith,et al.  Optimizing the performance of a relational algebra database interface , 1975, CACM.

[59]  Hector Garcia-Molina,et al.  Wave-indices: indexing evolving databases , 1997, SIGMOD '97.

[60]  Marcelo Arenas,et al.  Semantics and complexity of SPARQL , 2006, TODS.

[61]  Michael Stonebraker,et al.  Aurora: a new model and architecture for data stream management , 2003, The VLDB Journal.

[62]  Theodore Johnson,et al.  Out-of-order processing: a new architecture for high-performance stream systems , 2008, Proc. VLDB Endow..

[63]  David Maier,et al.  Exploiting Punctuation Semantics in Continuous Data Streams , 2003, IEEE Trans. Knowl. Data Eng..

[64]  Kirk Pruhs,et al.  Algorithms and metrics for processing multiple heterogeneous continuous queries , 2008, TODS.