An Efficient Approach for Incrementally Mining Frequent Closed Itemsets

Mining frequent item sets is to discover the groups of items appearing always together excess of a user specified threshold from a transaction database. However, there may be many frequent item sets existing in a transaction database, such that it is difficult to make a decision for a decision maker. Recently, mining frequent closed item sets becomes a major research issue, since a set of the frequent closed item sets is a condensed and complete representation of the frequentitemsets and all the frequent item sets can be derived from the frequent closed item sets. However, the transactions in a transaction database will grow rapidly in a short time, and the frequent closed item sets may be changed due to the addition of the new transactions. This paper proposes an efficient algorithm for incrementally mining frequent closed item sets without scanning the original database and searching the previous closed item sets. The experimental results also show that our algorithm significantly outperforms the previous approaches which need to take a lot of time to search the previous closed item sets.

[1]  Jiawei Han,et al.  Frequent pattern mining: current status and future directions , 2007, Data Mining and Knowledge Discovery.

[2]  Ramakrishnan Srikant,et al.  Fast Algorithms for Mining Association Rules in Large Databases , 1994, VLDB.

[3]  Jian Pei,et al.  CLOSET+: searching for the best strategies for mining frequent closed itemsets , 2003, KDD '03.

[4]  Salvatore Orlando,et al.  Fast and memory efficient mining of frequent closed itemsets , 2006, IEEE Transactions on Knowledge and Data Engineering.

[5]  Jian Pei,et al.  Mining Frequent Patterns without Candidate Generation: A Frequent-Pattern Tree Approach , 2006, Sixth IEEE International Conference on Data Mining - Workshops (ICDMW'06).

[6]  Reda Alhajj,et al.  DRFP-tree: disk-resident frequent pattern tree , 2009, Applied Intelligence.

[7]  Jia-Ling Koh,et al.  An Efficient Approach for Maintaining Association Rules Based on Adjusting FP-Tree Structures1 , 2004, DASFAA.

[8]  Nan Jiang,et al.  CFI-Stream: mining closed frequent itemsets in data streams , 2006, KDD '06.

[9]  Philip S. Yu,et al.  Scoring the Data Using Association Rules , 2003, Applied Intelligence.

[10]  Rajeev Motwani,et al.  Approximate Frequency Counts over Data Streams , 2012, VLDB.

[11]  Jie Dong,et al.  BitTableFI: An efficient mining frequent itemsets algorithm , 2007, Knowl. Based Syst..

[12]  Nicolas Pasquier,et al.  Discovering Frequent Closed Itemsets for Association Rules , 1999, ICDT.

[13]  Ruoming Jin,et al.  An algorithm for in-core frequent itemset mining on streaming data , 2005, Fifth IEEE International Conference on Data Mining (ICDM'05).

[14]  Yue-Shi Lee,et al.  An efficient approach for updating the structure for mining frequent patterns , 2012, 2012 IEEE International Conference on Industrial Engineering and Engineering Management.

[15]  Philip S. Yu,et al.  Moment: maintaining closed frequent itemsets over a stream sliding window , 2004, Fourth IEEE International Conference on Data Mining (ICDM'04).

[16]  Wilfred Ng,et al.  A survey on algorithms for mining frequent itemsets over data streams , 2008, Knowledge and Information Systems.

[17]  Bingru Yang,et al.  An Adaptive Frequent Itemset Mining Algorithm for Data Stream with Concept Drifts , 2008, 2008 International Conference on Computer Science and Software Engineering.

[18]  Maguelonne Teisseire,et al.  Towards a new approach for mining frequent itemsets on data stream , 2007, Journal of Intelligent Information Systems.

[19]  Suh-Yin Lee,et al.  Mining frequent itemsets over data streams using efficient window sliding techniques , 2009, Expert Syst. Appl..