Uncertain Frequent Itemsets Mining Algorithm on Data Streams with Constraints

Nowadays, many emerging applications in real-life can produce amount of uncertain data streams, while people are often interested in some aspects. To mine constrained frequent itemsets on uncertain data streams, this paper presents a method. First, determining the order of items in the transactions of data streams according to the properties of constraints; then, inserting items into the tree in order; finally, mining constrained frequent itemsets from the tree. Existing algorithms are compared with the proposed method and the performances are analyzed. Results indicate that the proposed method is effective and efficient, which mines constrained frequent itemsets when users request for the mining results and need no additional memory.

[1]  Carson Kai-Sang Leung,et al.  Distributed Uncertain Data Mining for Frequent Patterns Satisfying Anti-monotonic Constraints , 2014, 2014 28th International Conference on Advanced Information Networking and Applications Workshops.

[2]  Carson Kai-Sang Leung,et al.  PUF-Tree: A Compact Tree Structure for Frequent Pattern Mining of Uncertain Data , 2013, PAKDD.

[3]  Luigi Troiano,et al.  Mining frequent itemsets in data streams within a time horizon , 2014, Data Knowl. Eng..

[4]  Carson Kai-Sang Leung,et al.  Mining of Frequent Itemsets from Streams of Uncertain Data , 2009, 2009 IEEE 25th International Conference on Data Engineering.

[5]  Jian Pei,et al.  Mining frequent patterns without candidate generation , 2000, SIGMOD 2000.

[6]  Laks V. S. Lakshmanan,et al.  Exploratory mining and pruning optimizations of constrained associations rules , 1998, SIGMOD '98.

[7]  Edward Hung,et al.  Mining Frequent Itemsets from Uncertain Data , 2007, PAKDD.

[8]  Charu C. Aggarwal,et al.  Frequent pattern mining with uncertain data , 2009, KDD.

[9]  Alfredo Cuzzocrea,et al.  Mining constrained frequent itemsets from distributed uncertain data , 2014, Future Gener. Comput. Syst..

[10]  Carson Kai-Sang Leung,et al.  Constrained frequent itemset mining from uncertain data streams , 2010, 2010 IEEE 26th International Conference on Data Engineering Workshops (ICDEW 2010).

[11]  Reza Akbarinia,et al.  Fast and Exact Mining of Probabilistic Data Streams , 2013, ECML/PKDD.

[12]  M. H. Margahny,et al.  FAST ALGORITHM FOR MINING ASSOCIATION RULES , 2014 .

[13]  Carson Kai-Sang Leung,et al.  Efficient algorithms for mining constrained frequent patterns from uncertain data , 2009, U '09.

[14]  Carson Kai-Sang Leung,et al.  BigSAM: Mining Interesting Patterns from Probabilistic Databases of Uncertain Big Data , 2014, PAKDD Workshops.

[15]  Philip S. Yu,et al.  A Framework for Clustering Uncertain Data Streams , 2008, 2008 IEEE 24th International Conference on Data Engineering.

[16]  Mohammad Hadi Sadreddini,et al.  A sliding window based algorithm for frequent closed itemset mining over data streams , 2013, J. Syst. Softw..

[17]  Philip S. Yu,et al.  Mining Frequent Patterns in Data Streams at Multiple Time Granularities , 2002 .

[18]  Yuni Xia,et al.  Hyper-structure mining of frequent patterns in uncertain data streams , 2012, Knowledge and Information Systems.

[19]  Carson Kai-Sang Leung,et al.  Fast Tree-Based Mining of Frequent Itemsets from Uncertain Data , 2012, DASFAA.

[20]  Christophe G. Giraud-Carrier,et al.  Efficient mining of high-speed uncertain data streams , 2015, Applied Intelligence.

[21]  Carson Kai-Sang Leung,et al.  A Tree-Based Approach for Frequent Pattern Mining from Uncertain Data , 2008, PAKDD.

[22]  Alfredo Cuzzocrea,et al.  Approximation to Expected Support of Frequent Itemsets in Mining Probabilistic Sets of Uncertain Data , 2015, KES.

[23]  Won Suk Lee,et al.  CP-tree: An adaptive synopsis structure for compressing frequent itemsets over online data streams , 2014, Inf. Sci..

[24]  Tzung-Pei Hong,et al.  A new mining approach for uncertain databases using CUFP trees , 2012, Expert Syst. Appl..

[25]  Alfredo Cuzzocrea,et al.  Distributed Mining of Constrained Frequent Sets from Uncertain Data , 2011, ICA3PP.

[26]  Carson Kai-Sang Leung,et al.  A Scalable Data Analytics Algorithm for Mining Frequent Patterns from Uncertain Data , 2014, PAKDD Workshops.