Extended Time Constraints for Sequence Mining

Many applications require techniques for temporal knowledge discovery. Some of those approaches can handle time constraints between events. In particular some work has been done to mine generalized sequential patterns. However, such constraints are often too crisp or need a very precise assessment to avoid erroneous information. Therefore, in this paper we propose to soften temporal constraints used for generalized sequential pattern mining. To handle these constraints while data mining, we design an algorithm based on sequence graphs. Moreover, as these relaxed constraints may extract more generalized patterns, we propose temporal accuracy measure for helping the analysis of the numerous discovered patterns.

[1]  James F. Allen Maintaining knowledge about temporal intervals , 1983, CACM.

[2]  Christophe Rigotti,et al.  Constraint-Based Mining of Episode Rules and Optimal Window Sizes , 2004, PKDD.

[3]  Tomasz Imielinski,et al.  Mining association rules between sets of items in large databases , 1993, SIGMOD Conference.

[4]  Maguelonne Teisseire,et al.  Need For Speed : Mining Sequential Patterns in Data Streams , 2005, BDA.

[5]  Maguelonne Teisseire,et al.  Pre-processing time constraints for efficiently mining generalized sequential patterns , 2004, Proceedings. 11th International Symposium on Temporal Representation and Reasoning, 2004. TIME 2004..

[6]  Céline Fiot,et al.  From Crispness to Fuzziness: Three Algorithms for Soft Sequential Pattern Mining , 2007, IEEE Transactions on Fuzzy Systems.

[7]  Céline Fiot Extended Time Constraints for Generalized Sequential Patterns , 2006 .

[8]  Florent Masseglia,et al.  The PSP Approach for Mining Sequential Patterns , 1998, PKDD.

[9]  Kyuseok Shim,et al.  Mining Sequential Patterns with Regular Expression Constraints , 2002, IEEE Trans. Knowl. Data Eng..

[10]  Johan de Kleer,et al.  Readings in qualitative reasoning about physical systems , 1990 .

[11]  Ramakrishnan Srikant,et al.  Mining Sequential Patterns: Generalizations and Performance Improvements , 1996, EDBT.

[12]  Jean-François Boulicaut,et al.  Mining Frequent Sequential Patterns under Regular Expressions: A Highly Adaptive Strategy for Pushing Contraints , 2003, SDM.

[13]  Mohammed J. Zaki Sequence mining in categorical domains: incorporating constraints , 2000, CIKM '00.

[14]  Ramakrishnan Srikant,et al.  Mining sequential patterns , 1995, Proceedings of the Eleventh International Conference on Data Engineering.

[15]  Heikki Mannila,et al.  Levelwise Search and Borders of Theories in Knowledge Discovery , 1997, Data Mining and Knowledge Discovery.

[16]  Heikki Mannila,et al.  Discovery of Frequent Episodes in Event Sequences , 1997, Data Mining and Knowledge Discovery.

[17]  Suh-Yin Lee,et al.  DELISP: Efficient Discovery of Generalized Sequential Patterns by Delimited Pattern-Growth Technology , 2002, PAKDD.

[18]  Mohammed J. Zaki,et al.  SPADE: An Efficient Algorithm for Mining Frequent Sequences , 2004, Machine Learning.

[19]  Qiming Chen,et al.  PrefixSpan,: mining sequential patterns efficiently by prefix-projected pattern growth , 2001, Proceedings 17th International Conference on Data Engineering.

[20]  Jean-François Boulicaut,et al.  Constraint-Based Mining of Sequential Patterns over Datasets with Consecutive Repetitions , 2003, PKDD.

[21]  Jean-François Boulicaut,et al.  Mining Frequent Sequential Patterns under a Similarity Constraint , 2002, IDEAL.