An Apriori-based Approach for First-Order Temporal Pattern

Previous studies on mining sequential patterns have focused on temporal patterns specified by some form of propositional temporal logic. However, there are some interesting sequential patterns whose specification needs a more expressive formalism, the first-order temporal logic. In this article, we focus on the problem of mining multi-sequential patterns which are first-order temporal patterns (not expressible in propositional temporal logic). We propose two Apriori-based algorithms to perform this mining task. The first one, the PM (Projection Miner) Algorithm adapts the key idea of the classical GSP algorithm for propositional sequential pattern mining by projecting the first-order pattern in two propositional components during the candidate generation and pruning phases. The second algorithm, the SM (Simultaneous Miner) Algorithm, executes the candidate generation and pruning phases without decomposing the pattern, that is, the mining process, in some extent, does not reduce itself to its propositional counterpart. Our extensive experiments shows that SM scales up far better than PM.

[1]  Sushil Jajodia,et al.  Testing complex temporal relationships involving multiple granularities and its application to data mining (extended abstract) , 1996, PODS.

[2]  James F. Allen,et al.  Actions and Events in Interval Temporal Logic , 1994, J. Log. Comput..

[3]  Arbee L. P. Chen,et al.  An efficient algorithm for mining frequent sequences by a new strategy without support counting , 2004, Proceedings. 20th International Conference on Data Engineering.

[4]  Luc De Raedt,et al.  Constraint Based Mining of First Order Sequences in SeqLog , 2004, Database Support for Data Mining Applications.

[5]  Maguelonne Teisseire,et al.  HYPE: mining hierarchical sequential patterns , 2006, DOLAP '06.

[6]  Zhenglu Yang,et al.  PAID: Mining Sequential Patterns by Passed Item Deduction in Large Databases , 2006, 2006 10th International Database Engineering and Applications Symposium (IDEAS'06).

[7]  Frank Höppner Discovery of Temporal Patterns. Learning Rules about the Qualitative Behaviour of Time Series , 2001, PKDD.

[8]  George Karypis,et al.  A Universal Formulation of Sequential Patterns , 1999 .

[9]  Heikki Mannila,et al.  Rule Discovery from Time Series , 1998, KDD.

[10]  Simon Fraser MULTI-DIMENSIONAL SEQUENTIAL PATTERN MINING , 2001 .

[11]  Ramakrishnan Srikant,et al.  Mining Sequential Patterns: Generalizations and Performance Improvements , 1996, EDBT.

[12]  Yen-Liang Chen,et al.  Mining sequential patterns from multidimensional sequence data , 2005, IEEE Transactions on Knowledge and Data Engineering.

[13]  G. G. Stokes "J." , 1890, The New Yale Book of Quotations.

[14]  François Jacquenet,et al.  Mining Frequent Logical Sequences with SPIRIT-LoG , 2002, ILP.

[15]  Johannes Gehrke,et al.  Sequential PAttern mining using a bitmap representation , 2002, KDD.

[16]  Alexander Tuzhilin,et al.  Discovering Unexpected Patterns in Temporal Data Using Temporal Logic , 1997, Temporal Databases, Dagstuhl.

[17]  Jianyong Wang,et al.  Mining sequential patterns by pattern-growth: the PrefixSpan approach , 2004, IEEE Transactions on Knowledge and Data Engineering.

[18]  Jerzy Stefanowski,et al.  Mining Context Based Sequential Patterns , 2005, AWIC.

[19]  Heikki Mannila,et al.  Discovery of Frequent Episodes in Event Sequences , 1997, Data Mining and Knowledge Discovery.

[20]  Hendrik Blockeel,et al.  From Shell Logs to Shell Scripts , 2001, ILP.

[21]  Ramakrishnan Srikant,et al.  Mining sequential patterns , 1995, Proceedings of the Eleventh International Conference on Data Engineering.

[22]  Mohammed J. Zaki Efficiently mining frequent trees in a forest , 2002, KDD.

[23]  Hongjun Lu,et al.  Beyond intratransaction association analysis: mining multidimensional intertransaction association rules , 2000, TOIS.

[24]  Balaji Padmanabhan,et al.  Pattern Discovery in Temporal Databases: A Temporal Logic Approach , 1996, KDD.

[25]  Heikki Mannila,et al.  Fast Discovery of Association Rules , 1996, Advances in Knowledge Discovery and Data Mining.

[26]  A. Akhmetova Discovery of Frequent Episodes in Event Sequences , 2006 .

[27]  Anne Laurent,et al.  M2SP: Mining Sequential Patterns Among Several Dimensions , 2005, PKDD.

[28]  Umeshwar Dayal,et al.  FreeSpan: frequent pattern-projected sequential pattern mining , 2000, KDD '00.

[29]  Kyuseok Shim,et al.  SPIRIT: Sequential Pattern Mining with Regular Expression Constraints , 1999, VLDB.

[30]  Mohammed J. Zaki,et al.  SPADE: An Efficient Algorithm for Mining Frequent Sequences , 2004, Machine Learning.