Window Update Patterns in Stream Operators

Continuous queries applied over nonterminating data streams usually specify windows in order to obtain an evolving ---yet restricted--- set of tuples and thus provide timely results. Among other typical variants, sliding windows are mostly employed in stream processing engines and several advanced techniques have been suggested for their incremental evaluation. In this paper, we set out to study the existence of monotonic-related semantics in windowing constructs towards a more efficient maintenance of their changing contents. We investigate update patterns observed in common window variants as well as their impact on windowed adaptations of typical operators (like selection, join or aggregation), offering more insight towards design and implementation of stream processing mechanisms. Finally, to demonstrate its significance, this framework is validated for several windowed operations against streaming datasets with simulations at diverse arrival rates and window sizes.

[1]  Jennifer Widom,et al.  The CQL continuous query language: semantic foundations and query execution , 2006, The VLDB Journal.

[2]  Lukasz Golab,et al.  Update-pattern-aware modeling and processing of continuous queries , 2005, SIGMOD '05.

[3]  Jef Wijsen,et al.  Current Trends in Database Technology - EDBT 2006, EDBT 2006 Workshops PhD, DataX, IIDB, IIHA, ICSNW, QLQP, PIM, PaRMA, and Reactivity on the Web, Munich, Germany, March 26-31, 2006, Revised Selected Papers , 2006, EDBT Workshops.

[4]  Jennifer Widom,et al.  Towards a streaming SQL standard , 2008, Proc. VLDB Endow..

[5]  Walid G. Aref,et al.  Incremental Evaluation of Sliding-Window Queries over Data Streams , 2007 .

[6]  David Maier,et al.  Exploiting Punctuation Semantics in Continuous Data Streams , 2003, IEEE Trans. Knowl. Data Eng..

[7]  Michael Stonebraker,et al.  The 8 requirements of real-time stream processing , 2005, SGMD.

[8]  Jennifer Widom,et al.  Models and issues in data stream systems , 2002, PODS.

[9]  David Maier,et al.  Using Punctuation Schemes to Characterize Strategies for Querying over Data Streams , 2007, IEEE Transactions on Knowledge and Data Engineering.

[10]  Theodore Johnson,et al.  A Heartbeat Mechanism and Its Application in Gigascope , 2005, VLDB.

[11]  Timos K. Sellis,et al.  Window Specification over Data Streams , 2006, EDBT Workshops.

[12]  Qiang Chen,et al.  Aurora : a new model and architecture for data stream management ) , 2006 .

[13]  Frederick Reiss,et al.  TelegraphCQ: Continuous Dataflow Processing for an Uncertain World , 2003, CIDR.

[14]  Ying Xing,et al.  The Design of the Borealis Stream Processing Engine , 2005, CIDR.

[15]  David Maier,et al.  Semantics and evaluation techniques for window aggregates in data streams , 2005, SIGMOD '05.

[16]  Michael Stonebraker,et al.  Linear Road: A Stream Data Management Benchmark , 2004, VLDB.

[17]  Jennifer Widom,et al.  Resource Sharing in Continuous Sliding-Window Aggregates , 2004, VLDB.

[18]  Bernhard Seeger,et al.  A Temporal Foundation for Continuous Queries over Data Streams , 2005, COMAD.