Efficient Pattern Detection Over a Distributed Framework

In recent past, work has been done to parallelize pattern detection queries over event stream, by partitioning the event stream on certain keys or attributes. In such partitioning schemes the degree of parallelization totally relies on the available partition keys. A limited number of partitioning keys, or unavailability of such partitioning attributes noticeably affect the distribution of data among multiple nodes, and is a reason of potential data skew and improper resource utilization. Moreover, majority of the past implementations of complex event detection are based on a single machine, hence, they are immune to potential data skew that could be seen in a real distributed environment. In this study, we propose an event stream partitioning scheme that without considering any key attributes partitions the stream over time-windows. This scheme efficiently distributes the event stream partitions across network, and detects pattern sequences in distributed fashion. Our scheme also provides an effective means to minimize potential data skew and handles a substantial number of pattern queries across network.

[1]  Rajeev Motwani,et al.  Operator scheduling in data stream systems , 2004, VLDB 2004.

[2]  Chunhui Zhao,et al.  Elastic Non-contiguous Sequence Pattern Detection for Data Stream Monitoring , 2007, IDEAL.

[3]  Martin Hirzel,et al.  Partition and compose: parallel complex event processing , 2012, DEBS.

[4]  Sharma Chakravarthy,et al.  Scheduling Strategies for Processing Continuous Queries over Streams , 2004, BNCOD.

[5]  Xiaoming Zhang,et al.  Complex Event Processing over distributed probabilistic event streams , 2012, FSKD.

[6]  Nesime Tatbul,et al.  RIP: run-based intra-query parallelism for scalable complex event processing , 2013, DEBS.

[7]  Rajeev Motwani,et al.  Chain: operator scheduling for memory minimization in data stream systems , 2003, SIGMOD '03.

[8]  Miron Livny,et al.  Sequence query processing , 1994, SIGMOD '94.

[9]  Jun Wei,et al.  Sequential event pattern based context-aware adaptation , 2010, Internetware.

[10]  Samuel Madden,et al.  ZStream: a cost-based query processor for adaptively detecting composite events , 2009, SIGMOD Conference.

[11]  Michael Stonebraker,et al.  The 8 requirements of real-time stream processing , 2005, SGMD.

[12]  Yanlei Diao,et al.  High-performance complex event processing over streams , 2006, SIGMOD Conference.

[13]  N. Immerman,et al.  SASE + : An Agile Language for Kleene Closure over Event Streams , 2007 .

[14]  Elke A. Rundensteiner,et al.  Sequence Pattern Query Processing over Out-of-Order Event Streams , 2009, 2009 IEEE 25th International Conference on Data Engineering.

[15]  A. Rasin,et al.  Streaming for Dummies , 2004 .

[16]  Mohamed A. Sharaf,et al.  Scheduling continuous queries in data stream management systems , 2008, Proc. VLDB Endow..

[17]  Neil Immerman,et al.  On Supporting Kleene Closure over Event Streams , 2008, 2008 IEEE 24th International Conference on Data Engineering.

[18]  Jennifer Widom,et al.  Models and issues in data stream systems , 2002, PODS.

[19]  Murali Mani Efficient Event Stream Processing: Handling Ambiguous Events and Patterns with Negation , 2011, DASFAA Workshops.

[20]  Xin Li,et al.  Complex Event Processing over Uncertain Data Streams , 2010, 2010 International Conference on P2P, Parallel, Grid, Cloud and Internet Computing.

[21]  Lukasz Golab,et al.  Issues in data stream management , 2003, SGMD.

[22]  Hans-Arno Jacobsen,et al.  Towards highly parallel event processing through reconfigurable hardware , 2011, DaMoN '11.

[23]  Johannes Gehrke,et al.  Distributed event stream processing with non-deterministic finite automata , 2009, DEBS '09.

[24]  Chetan Gupta,et al.  Processing nested complex sequence pattern queries over event streams , 2010, DMSN '10.

[25]  Ji Wu,et al.  QoS-Oriented Multi-query Scheduling over Data Streams , 2009, DASFAA.

[26]  Neil Immerman,et al.  Efficient pattern matching over event streams , 2008, SIGMOD Conference.

[27]  Carlo Zaniolo,et al.  Query Languages and Data Models for Database Sequences and Data Streams , 2004, VLDB.

[28]  Peter R. Pietzuch,et al.  Distributed complex event processing with query rewriting , 2009, DEBS '09.

[29]  Wei Pan,et al.  Event Detection over Live and Archived Streams , 2011, WAIM.

[30]  Kun-Lung Wu,et al.  IBM Research Report SPL Stream Processing Language Specification , 2009 .

[31]  Johannes Gehrke,et al.  Cayuga: A General Purpose Event Monitoring System , 2007, CIDR.

[32]  Sharma Chakravarthy,et al.  Composite Events for Active Databases: Semantics, Contexts and Detection , 1994, VLDB.