New parallel processing strategies in complex event processing systems with data streams

Sensor network–based application has gained increasing attention where data streams gathered from distributed sensors need to be processed and analyzed with timely responses. Distributed complex event processing is an effective technology to handle these data streams by matching of incoming events to persistent pattern queries. Therefore, a well-managed parallel processing scheme is required to improve both system performance and the quality-of-service guarantees of the system. However, the specific properties of pattern operators increase the difficulties of implementing parallel processing. To address this issue, a new parallelization model and three parallel processing strategies are proposed for distributed complex event processing systems. The effects of temporal constraints, for example, sliding windows, are included in the new parallelization model to enable the processing load for the overlap between windows of a batch induced by each input event to be shared by the downstream machines to avoid events that may result in wrong decisions. The proposed parallel strategies can keep the complex event processing system working stably and continuously during the elapsed time. Finally, the application of our work is demonstrated using experiments on the StreamBase system regardless of the increased input rate of the stream or the increased time window size of the operator.

[1]  Kurt Rothermel,et al.  Predictable Low-Latency Event Detection With Parallel Complex Event Processing , 2015, IEEE Internet of Things Journal.

[2]  Fuyuan Xiao,et al.  Weighted Evidence Combination Based on Distance of Evidence and Entropy Function , 2016, Int. J. Distributed Sens. Networks.

[3]  Nesime Tatbul,et al.  RIP: run-based intra-query parallelism for scalable complex event processing , 2013, DEBS.

[4]  Antonio F. Gómez-Skarmeta,et al.  A Cooperative Approach to Traffic Congestion Detection With Complex Event Processing and VANET , 2012, IEEE Transactions on Intelligent Transportation Systems.

[5]  Frederick Reiss,et al.  TelegraphCQ: Continuous Dataflow Processing for an Uncertain World , 2003, CIDR.

[6]  Eric A. Brewer,et al.  Highly available, fault-tolerant, parallel dataflows , 2004, SIGMOD '04.

[7]  Yong Deng,et al.  A New Aggregating Operator for Linguistic Information Based on D Numbers , 2016, Int. J. Uncertain. Fuzziness Knowl. Based Syst..

[8]  R BHARGAVI,et al.  Semantic intrusion detection with multisensor data fusion using complex event processing , 2013 .

[9]  Alessandro Margara,et al.  Processing flows of information: From data stream to complex event processing , 2012, CSUR.

[10]  Kun-Lung Wu,et al.  Auto-parallelizing stateful distributed streaming applications , 2012, 2012 21st International Conference on Parallel Architectures and Compilation Techniques (PACT).

[11]  Qiang Chen,et al.  Aurora : a new model and architecture for data stream management ) , 2006 .

[12]  Michael Stonebraker,et al.  Monitoring Streams - A New Class of Data Management Applications , 2002, VLDB.

[13]  Chetan Gupta,et al.  High-performance nested CEP query processing over event streams , 2011, 2011 IEEE 27th International Conference on Data Engineering.

[14]  Michael Stonebraker,et al.  Fault-tolerance in the borealis distributed stream processing system , 2008, ACM Trans. Database Syst..

[15]  Neil Immerman,et al.  Efficient pattern matching over event streams , 2008, SIGMOD Conference.

[16]  Xinyang Deng,et al.  Dependence assessment in human reliability analysis based on D numbers and AHP , 2017 .

[17]  Jingren Zhou,et al.  SCOPE: easy and efficient parallel processing of massive data sets , 2008, Proc. VLDB Endow..

[18]  Ge Yu,et al.  Deadline-aware complex event processing models over distributed monitoring streams , 2012, Math. Comput. Model..

[19]  Alessandro Margara,et al.  Low latency complex event processing on parallel hardware , 2012, J. Parallel Distributed Comput..

[20]  Samuel Madden,et al.  ZStream: a cost-based query processor for adaptively detecting composite events , 2009, SIGMOD Conference.

[21]  Elke A. Rundensteiner,et al.  Revisiting Pipelined Parallelism in Multi-Join Query Processing , 2005, VLDB.

[22]  Theodore Johnson,et al.  Query-Aware Partitioning for Monitoring Massive Network Data Streams , 2008, 2008 IEEE 24th International Conference on Data Engineering.

[23]  Kurt Rothermel,et al.  MCEP: A Mobility-Aware Complex Event Processing System , 2014, ACM Trans. Internet Techn..

[24]  Johannes Gehrke,et al.  Distributed event stream processing with non-deterministic finite automata , 2009, DEBS '09.

[25]  Minos N. Garofalakis,et al.  Issues in complex event processing: Status and prospects in the Big Data era , 2017, J. Syst. Softw..

[26]  Martin Hirzel,et al.  Partition and compose: parallel complex event processing , 2012, DEBS.

[27]  Volker Markl,et al.  Parallelizing query optimization , 2008, Proc. VLDB Endow..

[28]  Yanlei Diao,et al.  High-performance complex event processing over streams , 2006, SIGMOD Conference.

[29]  Ying Xing,et al.  The Design of the Borealis Stream Processing Engine , 2005, CIDR.

[30]  Xinyang Deng,et al.  D-DEMATEL: a new method to identify critical success factors in emergency management , 2017 .

[31]  Yanlei Diao,et al.  SASE: Complex Event Processing over Streams , 2006, ArXiv.

[32]  Ying Xing,et al.  A Cooperative, Self-Configuring High-Availability Solution for Stream Processing , 2007, 2007 IEEE 23rd International Conference on Data Engineering.

[33]  Johannes Gehrke,et al.  Cayuga: A General Purpose Event Monitoring System , 2007, CIDR.

[34]  Mostafa S. Haghjoo,et al.  Dispatching stream operators in parallel execution of continuous queries , 2011, The Journal of Supercomputing.

[35]  Magdalena Balazinska,et al.  A latency and fault-tolerance optimizer for online parallel query plans , 2011, SIGMOD '11.

[36]  Mostafa S. Haghjoo,et al.  Parallel processing of continuous queries over data streams , 2010, Distributed and Parallel Databases.

[37]  Fuyuan Xiao,et al.  Economical and Fault-Tolerant Load Balancing in Distributed Stream Processing Systems , 2012, IEICE Trans. Inf. Syst..

[38]  Fuyuan Xiao,et al.  Nested Pattern Queries Processing Optimization over Multi-dimensional Event Streams , 2013, 2013 IEEE 37th Annual Computer Software and Applications Conference.

[39]  Ugur Çetintemel,et al.  Plan-based complex event detection across distributed sources , 2008, Proc. VLDB Endow..

[40]  Beng Chin Ooi,et al.  Parallelizing stateful operators in a distributed stream processing system: how, should you and how much? , 2012, DEBS.

[41]  Chetan Gupta,et al.  E-Cube: multi-dimensional event sequence analysis using hierarchical pattern query sharing , 2011, SIGMOD '11.