Data mining, also known as Knowledge Discovery in Databases (KDD) is one of the most important and interesting research areas in 21st century. Frequent pattern discovery is one of the important techniques in data mining. The application includes Medicine, Telecommunications and World Wide Web. Nowadays frequent pattern discovery research focuses on finding co-occurrence relationships between items. Apriori algorithm is a classical algorithm for association rule mining. Lots of algorithms for mining association rules and their mutations are proposed on the basis of Apriori algorithm. Most of the previous algorithms Apriori-like algorithm which generates candidates and improving algorithm strategy and structure but at the same time many of the researchers not concentrate on the structure of database. In this research paper, it has been proposed an improved algorithm for mining frequent patterns in large datasets using transposition of the database with minor modification of the Apriori-like algorithm. The main advantage of the proposed method is the database stores in transposed form and in each iteration database is filtered and reduced by generating the transaction id for each pattern. The proposed method reduces the huge computing time and also decreases the database size. Several experiments on real-life data show that the proposed algorithm is very much faster than existing Apriori-like algorithms. Hence the proposed method is very much suitable for the discovering frequent patterns from large datasets.
[1]
Albert,et al.
Emergence of scaling in random networks
,
1999,
Science.
[2]
Rajeev Motwani,et al.
Dynamic itemset counting and implication rules for market basket data
,
1997,
SIGMOD '97.
[3]
D. Watts.
The “New” Science of Networks
,
2004
.
[4]
Ron Kohavi,et al.
Real world performance of association rule algorithms
,
2001,
KDD '01.
[5]
Jian Pei,et al.
Mining frequent patterns without candidate generation
,
2000,
SIGMOD '00.
[6]
Ramakrishnan Srikant,et al.
Fast Algorithms for Mining Association Rules in Large Databases
,
1994,
VLDB.
[7]
Mohammed J. Zaki,et al.
Theoretical Foundations of Association Rules
,
2007
.
[8]
Roberto J. Bayardo,et al.
Efficiently mining long patterns from databases
,
1998,
SIGMOD '98.
[9]
Dennis P. Groth,et al.
Average-Case Performance of the Apriori Algorithm
,
2004,
SIAM J. Comput..