Mining Frequent Patterns based on Compressed FP-tree without Conditional FP-tree Generation

Frequent patterns mining are widely used in many practical data mining applications. Therefore, current research focuses on developing frequent patterns mining algorithms of high performances and FP-growth is proved as an important and efficient frequent patterns mining algorithm. In this paper, a new algorithm Temporary Root growth based on Compressed FP-tree, i.e. TR-CFP, is proposed. This algorithm employs a temporary root constructing thought during mining on a CFP- tree without conditional FP-tree generation. TR-CFP saves large memory space occupied by FP-tree and the cost of constructing many conditional FP-trees. Experiments show that the time and space for TR-CFP have reduced significantly compared to FP- growth mining based on FP-tree. Furthermore, TR-CFP has a particular character other than Apriori and FP-growth, that it can specially mine frequent patterns of the designated length dynamically and efficiently. In further experiments, even at a low support threshold, when the whole mining process takes a long time, special mining of TR-CFP is still very quick. This particular character may be very useful in applications.

[1]  Jian Pei,et al.  Mining Frequent Patterns without Candidate Generation: A Frequent-Pattern Tree Approach , 2006, Sixth IEEE International Conference on Data Mining - Workshops (ICDMW'06).

[2]  Petra Perner,et al.  Data Mining - Concepts and Techniques , 2002, Künstliche Intell..

[3]  Laks V. S. Lakshmanan,et al.  Efficient mining of constrained correlated sets , 2000, Proceedings of 16th International Conference on Data Engineering (Cat. No.00CB37073).

[4]  Tomasz Imielinski,et al.  Database Mining: A Performance Perspective , 1993, IEEE Trans. Knowl. Data Eng..

[5]  Rakesh Agarwal,et al.  Fast Algorithms for Mining Association Rules , 1994, VLDB 1994.

[6]  Raj P. Gopalan,et al.  CT-PRO: A Bottom-Up Non Recursive Frequent Itemset Mining Algorithm Using Compressed FP-Tree Data Structure , 2004, FIMI.

[7]  R. Gopalan,et al.  High Performance Frequent Patterns Extraction using Compressed FP-Tree , .

[8]  Jian Pei,et al.  Mining frequent patterns without candidate generation , 2000, SIGMOD '00.

[9]  Yong Qiu,et al.  An improved algorithm of mining from FP-tree , 2004, Proceedings of 2004 International Conference on Machine Learning and Cybernetics (IEEE Cat. No.04EX826).