Softening the blow of frequent sequence analysis: soft constraints and temporal accuracy

Mining temporal knowledge has many applications. Such knowledge can be all the more interesting as some time constraints between events can be integrated during the mining task. Both in data mining and machine learning, some methods have been proposed to extract and manage such knowledge using temporal constraints. In particular, some work has been done to mine Generalised Sequential Patterns (GSPs). However, such constraints are often too crisp or need a very precise assessment to avoid erroneous information. Within this context, we propose an approach based on sequence graphs derived from soft temporal constraints. These relaxed constraints enable us to find more GSPs. We also propose a temporal accuracy measure to provide the user with a tool for analysing the numerous extracted patterns.

[1]  Florent Masseglia,et al.  An efficient algorithm for Web usage mining , 1999 .

[2]  Jean-François Boulicaut,et al.  Constraint-Based Mining of Sequential Patterns over Datasets with Consecutive Repetitions , 2003, PKDD.

[3]  Jean-François Boulicaut,et al.  Mining Frequent Sequential Patterns under a Similarity Constraint , 2002, IDEAL.

[4]  Jiawei Han,et al.  Discovering Web access patterns and trends by applying OLAP and data mining technology on Web logs , 1998, Proceedings IEEE International Forum on Research and Technology Advances in Digital Libraries -ADL'98-.

[5]  Florent Masseglia,et al.  The PSP Approach for Mining Sequential Patterns , 1998, PKDD.

[6]  Christophe Rigotti,et al.  Constraint-Based Mining of Episode Rules and Optimal Window Sizes , 2004, PKDD.

[7]  A. Akhmetova Discovery of Frequent Episodes in Event Sequences , 2006 .

[8]  Maguelonne Teisseire,et al.  Pre-processing time constraints for efficiently mining generalized sequential patterns , 2004, Proceedings. 11th International Symposium on Temporal Representation and Reasoning, 2004. TIME 2004..

[9]  Mohammed J. Zaki Sequence mining in categorical domains: incorporating constraints , 2000, CIKM '00.

[10]  Ramakrishnan Srikant,et al.  Mining Sequential Patterns: Generalizations and Performance Improvements , 1996, EDBT.

[11]  Umeshwar Dayal,et al.  From User Access Patterns to Dynamic Hypertext Linking , 1996, Comput. Networks.

[12]  Suh-Yin Lee,et al.  DELISP: Efficient Discovery of Generalized Sequential Patterns by Delimited Pattern-Growth Technology , 2002, PAKDD.

[13]  Myra Spiliopoulou,et al.  WUM: A tool for Web Utilization analysis , 1999 .

[14]  Ramakrishnan Srikant,et al.  Mining sequential patterns , 1995, Proceedings of the Eleventh International Conference on Data Engineering.

[15]  Céline Fiot,et al.  Why Fuzzy Sequential Patterns can Help Data Summarization: An Application to the INPI Trade Mark Database , 2006, 2006 IEEE International Conference on Fuzzy Systems.

[16]  Myra Spiliopoulou,et al.  WUM - A Tool for WWW Ulitization Analysis , 1998, WebDB.