A Novel Complex-Events Analytical System Using Episode Pattern Mining Techniques

Along with the rapid development of IoT Internet of Things, there comes the 'Big Data' era with the fast growth of digital data and the requirements rise for gaining useful knowledge by analyzing the rich data of complex types. How to effectively and efficiently apply data mining techniques to analyze the big data plays a crucial role in real-world use cases. In this paper, we propose a novel complex-events analytical system based on episode pattern mining techniques. The proposed system consists of four major components, including data preprocessing, pattern mining, rules management and prediction modules. For the core mining process, we proposed a new algorithm named EM-CESEpisode Mining over Complex Event Sequences based on the sliding window approach. We also make the proposed system integrable with other application platform for complex event analysis, such that users can easily and quickly make use of it to gain the valuable information from complex data. Finally, excellent experimental results on a real-life dataset for electric power consumption monitoring validate the efficiency and effectiveness of the proposed system.

[1]  Qiming Chen,et al.  PrefixSpan,: mining sequential patterns efficiently by prefix-projected pattern growth , 2001, Proceedings 17th International Conference on Data Engineering.

[2]  Heikki Mannila,et al.  Discovering Generalized Episodes Using Minimal Occurrences , 1996, KDD.

[3]  Kian-Lee Tan,et al.  Finding constrained frequent episodes using minimal occurrences , 2004, Fourth IEEE International Conference on Data Mining (ICDM'04).

[4]  Alaaeldin M. Hafez,et al.  An efficient stream mining technique , 2008 .

[5]  Heikki Mannila,et al.  Discovering Frequent Episodes in Sequences , 1995, KDD.

[6]  Vincent S. Tseng,et al.  RuleGrowth: mining sequential rules common to several sequences by pattern-growth , 2011, SAC.

[7]  Philip S. Yu,et al.  Mining high utility episodes in complex event sequences , 2013, KDD.

[8]  Heikki Mannila,et al.  Discovery of Frequent Episodes in Event Sequences , 1997, Data Mining and Knowledge Discovery.

[9]  Vincent S. Tseng,et al.  Effective temporal data classification by integrating sequential pattern mining and probabilistic induction , 2009, Expert Syst. Appl..

[10]  Vincent S. Tseng,et al.  Efficient Mining of Frequent Target Episodes from Complex Event Sequences , 2014, ICS.

[11]  Gemma Casas-Garriga Discovering Unbounded Episodes in Sequential Data , 2003 .

[12]  Vincent S. Tseng,et al.  A Novel Episode Mining Methodology for Stock Investment , 2014, J. Inf. Sci. Eng..

[13]  Eamonn J. Keogh,et al.  A symbolic representation of time series, with implications for streaming algorithms , 2003, DMKD '03.