An efficient algorithm for mining top-rank-K frequent patterns from uncertain databases

The analysis and management of uncertain data has gained a lot of importance in the past few years because of their importance in a wide variety of applications such as sensor network and privacy preserving data mining applications. Many algorithms have been proposed to mine the frequent pattern over uncertain database. However the existing algorithms for uncertain data generate a large no. of candidate patterns and required to define an appropriate user defined threshold which is a challenging task for users. In this paper, we propose a new algorithm known as UFAE (uncertain filtering and extending) algorithm to mine top-rank-k frequent itemset or patterns. Mining only top-rank-k frequent pattern greatly decrease the number of candidate pattern generated so reduce the mining time. Many algorithms exist to mine top-rank-k frequent itemset in case of precise data but none in case of uncertain database. Experiments are performed to evaluate the performance of the algorithm on various dataset.

[1]  Bay Vo,et al.  An efficient and effective algorithm for mining top-rank-k frequent patterns , 2015, Expert Syst. Appl..

[2]  Charu C. Aggarwal On Unifying Privacy and Uncertain Data Models , 2008, 2008 IEEE 24th International Conference on Data Engineering.

[3]  Toon Calders,et al.  Approximation of Frequentness Probability of Itemsets in Uncertain Data , 2010, 2010 IEEE International Conference on Data Mining.

[4]  Carson Kai-Sang Leung,et al.  Tightening Upper Bounds to the Expected Support for Uncertain Frequent Pattern Mining , 2014, KES.

[5]  Charu C. Aggarwal,et al.  Frequent pattern mining with uncertain data , 2009, KDD.

[6]  Carson Kai-Sang Leung,et al.  A Tree-Based Approach for Frequent Pattern Mining from Uncertain Data , 2008, PAKDD.

[7]  Zhi-Hong Deng,et al.  VTK: Vertical Mining of Top-Rank-K Frequent Patterns , 2008, 2008 Fifth International Conference on Fuzzy Systems and Knowledge Discovery.

[8]  Ramakrishnan Srikant,et al.  Fast algorithms for mining association rules , 1998, VLDB 1998.

[9]  Philip S. Yu,et al.  Mining Frequent Itemsets over Uncertain Databases , 2012, Proc. VLDB Endow..

[10]  Carson Kai-Sang Leung,et al.  Uncertain Frequent Pattern Mining , 2014, Frequent Pattern Mining.

[11]  Zhi-Hong Deng,et al.  Mining Top-Rank-K Frequent Patterns , 2007, 2007 International Conference on Machine Learning and Cybernetics.

[12]  Carson Kai-Sang Leung,et al.  PUF-Tree: A Compact Tree Structure for Frequent Pattern Mining of Uncertain Data , 2013, PAKDD.

[13]  Edward Hung,et al.  Mining Frequent Itemsets from Uncertain Data , 2007, PAKDD.

[14]  Zhi-Hong Deng,et al.  Fast mining Top-Rank-k frequent patterns by using Node-lists , 2014, Expert Syst. Appl..