GM-Tree: An efficient frequent pattern mining technique for dynamic database

Since its inception, mining frequent patterns have become an imperative issue in data mining. The main problem in this area is to find out the association rule that identifies the relationships among a set of items. But the most expensive step in association rule is finding frequent itemsets and hence it draw the attention of many important research. In this paper, we propose a novel tree structure, called GM(Generate and Merge)Tree, which is a combination of prefix based incremental mining using canonical ordering and batch incrementing techniques. Our approach makes the tree structure more compact, canonically ordered of nodes and avoids sequential incrementing of transactions. It also helps to give a scalable algorithm with minimum overheads of modifying the tree structure during update operations. This algorithm is especially expected to give better results in case of extremely large transaction database in a dynamic environment. The experimental work has been carried out on two large datasets. Test results show the efficiency and effectiveness of the proposed approach by outperforming the traditional FP-Tree, CanTree(Canonical-order Tree) and BIT(Batch Incremental Tree).

[1]  Mohammed J. Zaki,et al.  CHARM: An Efficient Algorithm for Closed Itemset Mining , 2002, SDM.

[2]  Reda Alhajj,et al.  Alternative Method for Increnentally Constructing the FP-Tree , 2006 .

[3]  R. S. Thakur,et al.  Frequent Pattern Mining Using Modified CP-Tree for Knowledge Discovery , 2010, ADMA.

[4]  Philip S. Yu,et al.  Data Mining: An Overview from a Database Perspective , 1996, IEEE Trans. Knowl. Data Eng..

[5]  Carson Kai-Sang Leung,et al.  CanTree: a tree structure for efficient incremental mining of frequent patterns , 2005, Fifth IEEE International Conference on Data Mining (ICDM'05).

[6]  Rakesh Agarwal,et al.  Fast Algorithms for Mining Association Rules , 1994, VLDB 1994.

[7]  Guang-shun Yao,et al.  Mining algorithm for maximal frequent itemsets based on improved FP-tree: Mining algorithm for maximal frequent itemsets based on improved FP-tree , 2013 .

[8]  Jiawei Han,et al.  IncSpan: incremental mining of sequential patterns in large database , 2004, KDD.

[9]  Reda Alhajj,et al.  Alternative Method for Increnentally Constructing the FP-Tree , 2006, 2006 3rd International IEEE Conference Intelligent Systems.

[10]  Cheng-Fa Tsai,et al.  A new fast algorithms for mining association rules in large databases , 2002, IEEE International Conference on Systems, Man and Cybernetics.

[11]  Hussein H. Aly,et al.  Fast mining of association rules in large-scale problems , 2001, Proceedings. Sixth IEEE Symposium on Computers and Communications.

[12]  Tomasz Imielinski,et al.  Database Mining: A Performance Perspective , 1993, IEEE Trans. Knowl. Data Eng..

[13]  Mohammed J. Zaki,et al.  Efficient algorithms for mining closed itemsets and their lattice structure , 2005, IEEE Transactions on Knowledge and Data Engineering.

[14]  Yang Chuan-jian Mining algorithm for maximal frequent itemsets based on improved FP-tree , 2012 .

[15]  Carson Kai-Sang Leung,et al.  Efficient algorithms for the mining of constrained frequent patterns from uncertain data , 2010, SKDD.

[16]  Lei Ma,et al.  Improved algorithm for mining maximum frequent patterns based on FP-Tree , 2012 .

[17]  Tomasz Imielinski,et al.  Mining association rules between sets of items in large databases , 1993, SIGMOD Conference.

[18]  Jian Pei,et al.  Mining Frequent Patterns without Candidate Generation: A Frequent-Pattern Tree Approach , 2006, Sixth IEEE International Conference on Data Mining - Workshops (ICDMW'06).

[19]  P. V. G. D. Prasad Reddy,et al.  Batch incremental processing for FP-tree construction using FP-Growth algorithm , 2012, Knowledge and Information Systems.

[20]  Janusz Kacprzyk,et al.  Intelligent Techniques and Tools for Novel System Architectures , 2008 .

[21]  Tzung-Pei Hong,et al.  Incrementally fast updated frequent pattern trees , 2008, Expert Syst. Appl..

[22]  Reda Alhajj,et al.  Constructing Complete FP-Tree for Incremental Mining of Frequent Patterns in Dynamic Databases , 2006, IEA/AIE.

[23]  Ramakrishnan Srikant,et al.  Mining quantitative association rules in large relational tables , 1996, SIGMOD '96.

[24]  Zhan Li,et al.  Knowledge and Information Systems , 2007 .

[25]  P. V. G. D. Prasad Reddy,et al.  Batch Processing for Incremental FP-tree Construction , 2010 .

[26]  Jia-Ling Koh,et al.  An Efficient Approach for Maintaining Association Rules Based on Adjusting FP-Tree Structures1 , 2004, DASFAA.

[27]  Jian Pei,et al.  Mining frequent patterns without candidate generation , 2000, SIGMOD '00.

[28]  Osmar R. Zaïane,et al.  Incremental mining of frequent patterns without candidate generation or support constraint , 2003, Seventh International Database Engineering and Applications Symposium, 2003. Proceedings..