Incremental Mining of Closed Sequential Patterns in Multiple Data Streams

Sequential pattern mining searches for the relative sequence of events, allowing users to make predictions on discovered sequential patterns. Due to drastically advanced information technology over recent years, data have rapidly changed, growth in data amount has exploded and real-time demand is increasing, leading to the data stream environment. Data in this environment cannot be fully stored and ineptitude in traditional mining techniques has led to the emergence of data stream mining technology. Multiple data streams are a branch of the data stream environment. The MILE algorithm cannot preserve previously mined sequential patterns when new data are entered because of the concept of one-time fashion mining. To address this problem, we propose the ICspan algorithm to continue mining sequential patterns through an incremental approach and to acquire a more accurate mining result. In addition, due to the algorithm constraint in closed sequential patterns mining, the generation and records for sequential patterns will be reduced, leading to a decrease of memory usage and to an effective increase of execution efficiency.

[1]  Suh-Yin Lee,et al.  Incremental Mining of Sequential Patterns over a Stream Sliding Window , 2006, Sixth IEEE International Conference on Data Mining - Workshops (ICDMW'06).

[2]  Won Suk Lee,et al.  Decaying Obsolete Information in Finding Recent Frequent Itemsets over Data Streams , 2004, IEICE Trans. Inf. Syst..

[3]  Christie I. Ezeife,et al.  SSM : A Frequent Sequential Data Stream Patterns Miner , 2007, 2007 IEEE Symposium on Computational Intelligence and Data Mining.

[4]  Xindong Wu,et al.  Sequential pattern mining in multiple streams , 2005, Fifth IEEE International Conference on Data Mining (ICDM'05).

[5]  Heikki Mannila,et al.  Rule Discovery from Time Series , 1998, KDD.

[6]  M. Teisseire,et al.  SPEED : Mining Maximal Sequential Patterns over Data Streams , 2022 .

[7]  Paul R. Cohen,et al.  Searching for Structure in Multiple Streams of Data , 1996, ICML.

[8]  Won Suk Lee,et al.  Efficient mining method for retrieving sequential patterns over online data streams , 2005, J. Inf. Sci..

[9]  Philip S. Yu,et al.  Mining long sequential patterns in a noisy environment , 2002, SIGMOD '02.

[10]  Maguelonne Teisseire,et al.  Sequential Pattern Mining , 2009, Encyclopedia of Data Warehousing and Mining.