Determination of Rule Patterns in Complex Event Processing Using Machine Learning Techniques

Abstract Complex Event Processing (CEP) is a novel and promising methodology that enables the real-time analysis of stream event data. The main purpose of CEP is detection of the complex event patterns from the atomic and semantically low-level events such as sensor, log, or RFID data. Determination of the rule patterns for matching these simple events based on the temporal, semantic, or spatial correlations is the central task of CEP systems. In the current design of the CEP systems, experts provide event rule patterns. Having reached maturity, the Big Data Systems and Internet of Things (IoT) technology require the implementation of advanced machine learning approaches for automation in the CEP domain. The goal of this research is proposing a machine learning model to replace the manual identification of rule patterns. After a pre-processing stage (dealing with missing values, data outliers, etc.), various rule-based machine learning approaches were applied to detect complex events. Promising results with high preciseness were obtained. A comparative analysis of the performance of classifiers is discussed.

[1]  Yushun Fan,et al.  Complex event processing in enterprise information systems based on RFID , 2007, Enterp. Inf. Syst..

[2]  Brent Martin,et al.  INSTANCE-B ASED LEARNING: Nearest Neighbour with Generalisation , 1995 .

[3]  Daeyoung Kim,et al.  Complex Event Processing in EPC Sensor Network Middleware for Both RFID and WSN , 2008, 2008 11th IEEE International Symposium on Object and Component-Oriented Real-Time Distributed Computing (ISORC).

[4]  Fusheng Wang,et al.  Bridging Physical and Virtual Worlds: Complex Event Processing for RFID Data Streams , 2006, EDBT.

[5]  J. R. Landis,et al.  The measurement of observer agreement for categorical data. , 1977, Biometrics.

[6]  Dong Wang,et al.  Design of RFID Middleware Based on Complex Event Processing , 2006, 2006 IEEE Conference on Cybernetics and Intelligent Systems.

[7]  Gary M. Weiss,et al.  Activity recognition using cell phone accelerometers , 2011, SKDD.

[8]  Sun Jiang ' hong,et al.  Large Rotating Machinery Fault Diagnosis and Knowledge Rules Acquiring Based on Improved RIPPER , 2009 .

[9]  Salvatore J. Stolfo,et al.  A data mining framework for building intrusion detection models , 1999, Proceedings of the 1999 IEEE Symposium on Security and Privacy (Cat. No.99CB36344).

[10]  Alessandro Margara,et al.  Complex event processing with T-REX , 2012, J. Syst. Softw..

[11]  Adrian Paschke,et al.  Knowledge-based processing of complex stock market events , 2012, EDBT '12.

[12]  Yanlei Diao,et al.  High-performance complex event processing over streams , 2006, SIGMOD Conference.

[13]  Jun Wu,et al.  A new approach for classification of fingerprint image quality , 2008, 2008 7th IEEE International Conference on Cognitive Informatics.

[14]  Bogdan Gabrys,et al.  Data-driven Soft Sensors in the process industry , 2009, Comput. Chem. Eng..

[15]  Tapio Elomaa,et al.  In Defense of C4.5: Notes in Learning One-Level Decision Trees , 1994, ICML.

[16]  Ian H. Witten,et al.  Generating Accurate Rule Sets Without Global Optimization , 1998, ICML.

[17]  Opher Etzion,et al.  Event Processing in Action , 2010 .

[18]  Emiel Krahmer,et al.  Detecting Problematic Turns in Human-Machine Interactions: Rule-induction Versus Memory-based Learning Approaches , 2001, ACL.

[19]  Brian R. Gaines,et al.  Induction of ripple-down rules applied to modeling large databases , 1995, Journal of Intelligent Information Systems.

[20]  Tatiana Jaworska Application of fuzzy rule-based classifier to CBIR in comparison with other classifiers , 2014, 2014 11th International Conference on Fuzzy Systems and Knowledge Discovery (FSKD).

[21]  Ian H. Witten,et al.  The WEKA data mining software: an update , 2009, SKDD.

[22]  Robert C. Holte,et al.  Very Simple Classification Rules Perform Well on Most Commonly Used Datasets , 1993, Machine Learning.

[23]  Eibe Frank,et al.  Combining Naive Bayes and Decision Tables , 2008, FLAIRS.

[24]  William W. Cohen Fast Effective Rule Induction , 1995, ICML.

[25]  Charu C. Aggarwal An Introduction to Sensor Data Analytics , 2013, Managing and Mining Sensor Data.

[26]  Antonio F. Gómez-Skarmeta,et al.  CEP-traj: An event-based solution to process trajectory data , 2015, Inf. Syst..