A tree structure for event-based sequence mining

The incorporation of temporal semantics into traditional data mining techniques has led to the development of a new field called temporal data mining. This is especially necessary for extracting useful knowledge from dynamic domains, which by nature are time-varying. However, in practical terms, this is a computationally intractable problem, and therefore, it poses more challenges to efficient processing than non-temporal techniques. In this paper, we present a tree-based structure and a handling algorithm, called TSET-Miner, for frequent temporal pattern mining from time-stamped datasets. The algorithm is based on mining inter-transaction association, and is mainly characterized by the use of a single tree-based data structure for generation and storage of all frequent sequences discovered by mining. Given the versatility involved in the use of a single data structure, it may be extended an adapted to extract other types of patterns with relative little effort. To demonstrate this, we also present TSET^m^a^x-Miner, an algorithm based on the TSET structure, designed to extract maximal frequent event-based sequences.

[1]  Roberto J. Bayardo,et al.  Efficiently mining long patterns from databases , 1998, SIGMOD '98.

[2]  Xun Xu,et al.  Recent development of knowledge-based systems, methods and tools for One-of-a-Kind Production , 2011, Knowl. Based Syst..

[3]  Roque Marín,et al.  Extracting Uncertain Temporal Relations from Mined Frequent Sequences , 2006, Thirteenth International Symposium on Temporal Representation and Reasoning (TIME'06).

[4]  Anthony K. H. Tung,et al.  Breaking the barrier of transactions: mining inter-transaction association rules , 1999, KDD '99.

[5]  Srinivasan Parthasarathy,et al.  New Algorithms for Fast Discovery of Association Rules , 1997, KDD.

[6]  Joong Hyuk Chang,et al.  Mining weighted sequential patterns in a sequence database with a time-interval weight , 2011, Knowl. Based Syst..

[7]  Henry A. Kautz,et al.  Constraint Propagation Algorithms for Temporal Reasoning , 1986, AAAI.

[8]  Maria E. Orlowska,et al.  Finding Negative Event-Oriented Patterns in Long Temporal Sequences , 2004, PAKDD.

[9]  Sushil Jajodia,et al.  Testing complex temporal relationships involving multiple granularities and its application to data mining (extended abstract) , 1996, PODS.

[10]  Heikki Mannila,et al.  Local and Global Methods in Data Mining: Basic Techniques and Open Problems , 2002, ICALP.

[11]  Anthony K. H. Tung,et al.  Efficient Mining of Intertransaction Association Rules , 2003, IEEE Trans. Knowl. Data Eng..

[12]  James F. Allen Maintaining knowledge about temporal intervals , 1983, CACM.

[13]  Anthony J. T. Lee,et al.  An efficient algorithm for mining closed inter-transaction itemsets , 2008, Data Knowl. Eng..

[14]  Chia-Hui Chang,et al.  ClosedPROWL: Efficient Mining of Closed Frequent Continuities by Projected Window List Technology , 2005, SDM.

[15]  Ramesh C Agarwal,et al.  Depth first generation of long patterns , 2000, KDD '00.

[16]  Qing Li,et al.  Mining Inter-transactional Association Rules: Generalization and Empirical Evaluation , 2001, DaWaK.

[17]  Gustavo Rossi,et al.  An approach to discovering temporal association rules , 2000, SAC '00.

[18]  Marie-Odile Cordier,et al.  An Inductive Database for Mining Temporal Patterns in Event Sequences , 2005, IJCAI.

[19]  Mohammed J. Zaki,et al.  SPADE: An Efficient Algorithm for Mining Frequent Sequences , 2004, Machine Learning.

[20]  G. P. Bhattacharjee,et al.  Temporal representation and reasoning in artificial intelligence: A review , 2001 .

[21]  Hongjun Lu,et al.  Beyond intratransaction association analysis: mining multidimensional intertransaction association rules , 2000, TOIS.

[22]  Ming-Syan Chen,et al.  On mining general temporal association rules in a publication database , 2001, Proceedings 2001 IEEE International Conference on Data Mining.

[23]  Heikki Mannila,et al.  Discovery of Frequent Episodes in Event Sequences , 1997, Data Mining and Knowledge Discovery.

[24]  Mark Weiss Data Structures and Problem Solving Using C , 1999 .

[25]  X.S. Wang,et al.  Discovering Frequent Event Patterns with Multiple Granularities in Time Sequences , 1998, IEEE Trans. Knowl. Data Eng..

[26]  Carlos Ordonez,et al.  Discovering Interesting Association Rules in Medical Data , 2000, ACM SIGMOD Workshop on Research Issues in Data Mining and Knowledge Discovery.

[27]  Sushil Jajodia,et al.  Discovering Temporal Patterns in Multiple Granularities , 2000, TSDM.

[28]  Jiawei Han,et al.  Efficient mining of partial periodic patterns in time series database , 1999, Proceedings 15th International Conference on Data Engineering (Cat. No.99CB36337).

[29]  Tomasz Imielinski,et al.  Mining association rules between sets of items in large databases , 1993, SIGMOD Conference.

[30]  Ramakrishnan Srikant,et al.  Mining sequential patterns , 1995, Proceedings of the Eleventh International Conference on Data Engineering.

[31]  Hongjun Lu,et al.  A template model for multidimensional inter-transactional association rules , 2002, The VLDB Journal.

[32]  Sridhar Ramaswamy,et al.  Cyclic association rules , 1998, Proceedings 14th International Conference on Data Engineering.

[33]  Chia-Hui Chang,et al.  PROWL: An Efficient Frequent continuity Mining Algorithm on Event Sequences , 2004, DaWaK.

[34]  Zvi M. Kedem,et al.  Pincer-Search: An Efficient Algorithm for Discovering the Maximum Frequent Set , 2002, IEEE Trans. Knowl. Data Eng..

[35]  Anthony J. T. Lee,et al.  An efficient algorithm for mining frequent inter-transaction patterns , 2007, Inf. Sci..

[36]  Ron Rymon,et al.  Search through Systematic Set Enumeration , 1992, KR.

[37]  John F. Roddick,et al.  A Survey of Temporal Knowledge Discovery Paradigms and Methods , 2002, IEEE Trans. Knowl. Data Eng..

[38]  Ming-Tat Ko,et al.  Discovering time-interval sequential patterns in sequence databases , 2003, Expert Syst. Appl..

[39]  Maria E. Orlowska,et al.  Finding Temporal Features of Event-Oriented Patterns , 2005, PAKDD.

[40]  Aomar Osmani,et al.  Mining Association Rules in Temporal Sequences , 2007, 2007 IEEE Symposium on Computational Intelligence and Data Mining.

[41]  Philip S. Yu,et al.  An effective hash-based algorithm for mining association rules , 1995, SIGMOD '95.

[42]  Roque Marín,et al.  On the Evaluation of Mined Frequent Sequences - An Evidence Theory-based Method , 2010, HEALTHINF.

[43]  Sushil Jajodia,et al.  Discovering calendar-based temporal association rules , 2003 .

[44]  Charu C. Aggarwal,et al.  A Tree Projection Algorithm for Generation of Frequent Item Sets , 2001, J. Parallel Distributed Comput..