A unified view of Automata-based algorithms for Frequent Episode Discovery

Frequent Episode Discovery framework is a popular framework in Temporal Data Mining with many applications. Over the years many different notions of frequencies of episodes have been proposed along with different algorithms for episode discovery. In this paper we present a unified view of all such frequency counting algorithms. We present a generic algorithm such that all current algorithms are special cases of it. This unified view allows one to gain insights into different frequencies and we present quantitative relationships among different frequencies. Our unified view also helps in obtaining correctness proofs for various algorithms as we show here. We also point out how this unified view helps us to consider generalization of the algorithm so that they can discover episodes with general partial orders.

[1]  Gemma Casas-Garriga Discovering Unbounded Episodes in Sequential Data , 2003 .

[2]  Christopher D. Carothers,et al.  VOGUE: A Novel Variable Order-Gap State Machine for Modeling Sequences , 2006, PKDD.

[3]  P. S. Sastry,et al.  A fast algorithm for finding frequent episodes in event streams , 2007, KDD '07.

[4]  Debprakash Patnaik,et al.  Inferring neuronal network connectivity from spike data: A temporal data mining approach , 2008, Sci. Program..

[5]  K. Iwanuma,et al.  On anti-monotone frequency measures for extracting sequential patterns from a single very-long data sequence , 2004, IEEE Conference on Cybernetics and Intelligent Systems, 2004..

[6]  Fabian Mörchen,et al.  Unsupervised pattern mining from symbolic temporal data , 2007, SKDD.

[7]  Heikki Mannila,et al.  Discovering Generalized Episodes Using Minimal Occurrences , 1996, KDD.

[8]  Srivatsan Laxman Discovering Frequent Episodes : Fast Algorithms, Connections With HMMs And Generalizations , 2006 .

[9]  Susan M. Bridges,et al.  Mining fuzzy association rules and fuzzy frequency episodes for intrusion detection , 2000, Int. J. Intell. Syst..

[10]  Ryen W. White,et al.  Stream prediction using a generative model based on frequent episodes in event sequences , 2008, KDD.

[11]  Christophe Rigotti,et al.  Constraint-Based Mining of Episode Rules and Optimal Window Sizes , 2004, PKDD.

[12]  P. S. Sastry,et al.  Discovering frequent episodes and learning hidden Markov models: a formal connection , 2005, IEEE Transactions on Knowledge and Data Engineering.

[13]  Dimitrios Gunopulos,et al.  Episode Matching , 1997, CPM.

[14]  Ramakrishnan Srikant,et al.  Mining Sequential Patterns: Generalizations and Performance Improvements , 1996, EDBT.

[15]  Ada Wai-Chee Fu,et al.  Mining Frequent Episodes for Relating Financial Events and Stock Trends , 2003, PAKDD.

[16]  Chia-Hui Chang,et al.  Efficient mining of frequent episodes from complex sequences , 2008, Inf. Syst..

[17]  Heikki Mannila,et al.  Discovery of Frequent Episodes in Event Sequences , 1997, Data Mining and Knowledge Discovery.

[18]  Meng-Feng Tsai,et al.  Exploiting Frequent Episodes in Weighted Suffix Tree to Improve Intrusion Detection System , 2008, 22nd International Conference on Advanced Information Networking and Applications - Workshops (aina workshops 2008).

[19]  Avinash Achar,et al.  Discovering general partial orders in event streams , 2009, ArXiv.

[20]  Mikhail J. Atallah,et al.  Reliable detection of episodes in event sequences , 2004, Knowledge and Information Systems.