Summary In this paper, we propose a new algorithm which associates weight to each item in the transaction database based on the significance of the corresponding item. Weighted support is calculated using the weight and the frequency of occurrence of the item in the transactions. This weighted support is used to find the frequent itemsets. We partition the database among ‘N’ processors and generate closed frequent itemsets in parallel. The parallel algorithm used minimizes communication by exchanging only weighted supports among the processors. We generate closed frequent itemsets to reduce the number of itemsets and also as all frequent itemsets can be obtained from closed frequent itemsets, we are not losing any interesting and significant itemsets. The performance of the proposed algorithm is compared to count distribution algorithm in terms of scaleup, speedup, sizeup and is shown that the proposed algorithm performs better.
[1]
Rakesh Agarwal,et al.
Fast Algorithms for Mining Association Rules
,
1994,
VLDB 1994.
[2]
Tomasz Imielinski,et al.
Mining association rules between sets of items in large databases
,
1993,
SIGMOD Conference.
[3]
Nicolas Pasquier,et al.
Efficient Mining of Association Rules Using Closed Itemset Lattices
,
1999,
Inf. Syst..
[4]
Rakesh Agrawal,et al.
Parallel Mining of Association Rules
,
1996,
IEEE Trans. Knowl. Data Eng..
[5]
Philip S. Yu,et al.
Efficient mining of weighted association rules (WAR)
,
2000,
KDD '00.
[6]
Dimitris Kanellopoulos,et al.
Association Rules Mining: A Recent Overview
,
2006
.
[7]
Fionn Murtagh,et al.
Weighted Association Rule Mining using weighted support and significance framework
,
2003,
KDD '03.