论文信息 - Using Loose and Tight Bounds to Mine Frequent Itemsets

Using Loose and Tight Bounds to Mine Frequent Itemsets

Mining frequent itemsets forms a core operation in many data mining problems. The operation, however, is data intensive and produces a large output. Furthermore, we also have to scan the database many times. In this paper, we propose to use loose and tight bounds to mine frequent itemsets. We use loose bounds to remove the candidate itemsets whose support cannot satisfy the preset threshold. Then, we find whether we can determine the frequency of the remainder candidate itemsets with the tight bounds. According to the itemsets that cannot be treated, we scan the database for them. Using this new method, we can decrease not only the candidate frequent itemsets have to be tested, but also the database scan times.

Lei Jia | Jun Yao | Renqing Pei

[1] Jiawei Han,et al. Discovery of Multiple-Level Association Rules from Large Databases , 1995, VLDB.

[2] Tomasz Imielinski,et al. Mining association rules between sets of items in large databases , 1993, SIGMOD Conference.

[3] Toon Calders,et al. Deducing Bounds on the Frequency of Itemsets , 2002 .

[4] Laks V. S. Lakshmanan,et al. Mining frequent itemsets with convertible constraints , 2001, Proceedings 17th International Conference on Data Engineering.

[5] Ramakrishnan Srikant,et al. Fast Algorithms for Mining Association Rules in Large Databases , 1994, VLDB.

[6] Nicolas Pasquier,et al. Efficient Mining of Association Rules Using Closed Itemset Lattices , 1999, Inf. Syst..

[7] Jean-François Boulicaut,et al. Frequent Closures as a Concise Representation for Binary Data Mining , 2000, PAKDD.

[8] Toon Calders,et al. Mining All Non-derivable Frequent Itemsets , 2002, PKDD.

[9] Jian Pei,et al. Mining frequent patterns without candidate generation , 2000, SIGMOD 2000.