Incremental Mining of Sequential Patterns over a Stream Sliding Window

Incremental mining of sequential patterns from data streams is one of the most challenging problems in mining data streams. However, previous work of mining sequential patterns from data streams is almost focused on mining of patterns from stream of item-sequences, not stream of itemset-sequences. In this paper, we propose an efficient single-pass algorithm, called IncSPAM, to maintain the set of sequential patterns from itemset-sequence streams with a transaction-sensitive sliding window. An effective bit-sequence representation of items is used in the proposed algorithm to reduce the time and memory needed to slide the windows. Experiments show that the proposed IncSPAM algorithm is efficient for mining sequential patterns over data streams

[1]  Suh-Yin Lee,et al.  Fast Discovery of Sequential Patterns by Memory Indexing , 2002, DaWaK.

[2]  Won Suk Lee,et al.  Decaying Obsolete Information in Finding Recent Frequent Itemsets over Data Streams , 2004, IEICE Trans. Inf. Syst..

[3]  An Chen,et al.  Multiple-Level Sequential Pattern Discovery from Customer Transaction Databases , 2005 .

[4]  Dennis Shasha,et al.  StatStream: Statistical Monitoring of Thousands of Data Streams in Real Time , 2002, VLDB.

[5]  Xifeng Yan,et al.  CloSpan: Mining Closed Sequential Patterns in Large Datasets , 2003, SDM.

[6]  Qiming Chen,et al.  PrefixSpan,: mining sequential patterns efficiently by prefix-projected pattern growth , 2001, Proceedings 17th International Conference on Data Engineering.

[7]  Suh-Yin Lee,et al.  Incremental update on sequential patterns in large databases , 1998, Proceedings Tenth IEEE International Conference on Tools with Artificial Intelligence (Cat. No.98CH36294).

[8]  Xindong Wu,et al.  Mining Sequential Patterns Across Data Streams , 2005 .

[9]  Johannes Gehrke,et al.  Sequential PAttern mining using a bitmap representation , 2002, KDD.

[10]  Ramakrishnan Srikant,et al.  Mining sequential patterns , 1995, Proceedings of the Eleventh International Conference on Data Engineering.

[11]  Jiawei Han,et al.  IncSpan: incremental mining of sequential patterns in large database , 2004, KDD.

[12]  Lukasz Golab,et al.  Issues in data stream management , 2003, SGMD.

[13]  Florent Masseglia,et al.  Mining Sequential Patterns from Temporal Streaming Data , 2005 .