Probabilistic and fuzzy logic based event processing for effective business intelligence

This paper focuses on Probabilistic Complex Event Processing (PCEP) in the context of real world event sources of data streams. PCEP executes complex event pattern queries on the continuously streaming probabilistic data with uncertainty. The methodology consists of two phases: Efficient Generic Event Filtering (EGEF) and probabilistic event sequence prediction paradigm. In the first phase, a Non1deterministic Finite Automaton (NFA) based event matching allows to filter the relevant events by discovering the occurrences of the user defined event patterns in a large volume of continuously arriving data streams. In order to, express the complex event patterns in a more efficient form, a Complex Event Processing (CEP) language named as Complex Event Pattern Subscription Language (CEPSL) is developed by extending the existing high level event query languages. Furthermore, query plan1based approach is used to compile the specified event patterns into the NFA automaton and to distribute to a cluster of state machines to improve the scalability. In the second phase, an effective Dynamic Fuzzy Probabilistic Relational Model (DFPRM) is proposed to construct the probability space in the form of event hierarchy. The proposed system deploys a Probabilistic Fuzzy Logic (PFL) based inference engine to derive the composite of event sequence approximately with the reduced probability space. To determine the effectiveness of the proposed approach, a detailed performance analysis is performed using a prototype implementation.

[1]  Alfian Abdul Halin,et al.  Soccer event detection via collaborative multimodal feature analysis and candidate ranking , 2013, Int. Arab J. Inf. Technol..

[2]  Johannes Gehrke,et al.  Distributed event stream processing with non-deterministic finite automata , 2009, DEBS '09.

[3]  Daisy Zhe Wang,et al.  Probabilistic Complex Event Triggering , 2009 .

[4]  Peter R. Pietzuch,et al.  Distributed complex event processing with query rewriting , 2009, DEBS '09.

[5]  Fei Hu,et al.  Quality of Service , 2014 .

[6]  Dan Suciu,et al.  Efficient query evaluation on probabilistic databases , 2004, The VLDB Journal.

[7]  Michael Eckert,et al.  Complex Event Processing (CEP) , 2009, Informatik-Spektrum.

[8]  Neil Immerman,et al.  Efficient pattern matching over event streams , 2008, SIGMOD Conference.

[9]  Feifei Li,et al.  Efficient Processing of Top-k Queries in Uncertain Databases with x-Relations , 2008, IEEE Transactions on Knowledge and Data Engineering.

[10]  Mohamed A. Soliman,et al.  Top-k Query Processing in Uncertain Databases , 2007, 2007 IEEE 23rd International Conference on Data Engineering.

[11]  Georges Hébrail,et al.  Data stream management and mining , 2007, NATO ASI Mining Massive Data Sets for Security.

[12]  Frederick Reiss,et al.  TelegraphCQ: Continuous Dataflow Processing for an Uncertain World , 2003, CIDR.

[13]  Dennis Shasha,et al.  Filtering algorithms and implementation for very fast publish/subscribe systems , 2001, SIGMOD '01.

[14]  Pedro M. Domingos,et al.  Dynamic Probabilistic Relational Models , 2003, IJCAI.

[15]  Florian Matthes,et al.  Proceedings of the 10th international conference on Advances in Database Technology , 2006 .

[16]  Alejandro P. Buchmann,et al.  Complex Event Processing , 2009, it Inf. Technol..

[17]  Alessandro Margara,et al.  Processing flows of information: From data stream to complex event processing , 2012, CSUR.

[18]  Johannes Gehrke,et al.  Towards Expressive Publish/Subscribe Systems , 2006, EDBT.

[19]  Umeshwar Dayal,et al.  The architecture of an active database management system , 1989, SIGMOD '89.

[20]  Helmut Veith,et al.  Efficient filtering in publish-subscribe systems using binary decision diagrams , 2001, Proceedings of the 23rd International Conference on Software Engineering. ICSE 2001.

[21]  Christopher Ré,et al.  Event queries on correlated probabilistic streams , 2008, SIGMOD Conference.

[22]  Isabel L. Nunes,et al.  Handling Human-Centered Systems Uncertainty Using Fuzzy Logics - A Review~!2010-04-16~!2010-06-30~!2010-08-07~! , 2010 .

[23]  Kyoung Soo Bok,et al.  Efficient Complex Event Processing over RFID Streams , 2012, Int. J. Distributed Sens. Networks.

[24]  David L. Hicks,et al.  Mining Massive Data Sets for Security , 2008 .

[25]  Zhanhuai Li,et al.  Complex Event Processing over Unreliable RFID Data Streams , 2011, APWeb.

[26]  E. Al-Shaer High-performance Event Filtering for Distributed Dynamic Multi-point Applications : Survey and Evaluation , 2007 .

[27]  Angelo CORSARO,et al.  Quality of service in publish/subscribe middleware , 2012 .

[28]  Yanlei Diao,et al.  High-performance complex event processing over streams , 2006, SIGMOD Conference.

[29]  Ben Taskar,et al.  Selectivity estimation using probabilistic models , 2001, SIGMOD '01.

[30]  Yanlei Diao,et al.  SASE: Complex Event Processing over Streams , 2006, ArXiv.

[31]  Jennifer Widom,et al.  Working Models for Uncertain Data , 2006, 22nd International Conference on Data Engineering (ICDE'06).

[32]  Johannes Gehrke,et al.  Cayuga: A General Purpose Event Monitoring System , 2007, CIDR.