Analysis of tree-based uncertain frequent pattern mining techniques without pattern losses

Various large-scale data have been generated in a variety of application fields, since the Internet began to be widely used. Accordingly, researchers have developed various data mining methods for pervasive human-centric computing to deal with the data and discover interesting knowledge. Frequent pattern mining is one of the main issues in data mining, which finds meaningful pattern information from databases. In this area, not only precise data but also uncertain data can be generated depending on environments of data generation. Since the concept of uncertain frequent pattern mining was proposed to overcome the limitations of traditional approaches that cannot deal with uncertain data with existential probabilities of items, several relevant methods have been developed. In this paper, we introduce and analyze state-of-the-art methods based on tree structures, and propose a new uncertain frequent pattern mining approach. We also compare algorithm performance and discuss characteristics of them.

[1]  Keun Ho Ryu,et al.  Fast algorithm for high utility pattern mining with the sum of item quantities , 2016, Intell. Data Anal..

[2]  Tzung-Pei Hong,et al.  A new mining approach for uncertain databases using CUFP trees , 2012, Expert Syst. Appl..

[3]  Fan Zhang,et al.  Accelerating frequent itemset mining on graphics processing units , 2013, The Journal of Supercomputing.

[4]  Reynold Cheng,et al.  Efficient Mining of Frequent Item Sets on Large Uncertain Databases , 2012, IEEE Transactions on Knowledge and Data Engineering.

[5]  Carson Kai-Sang Leung,et al.  Efficient Mining of Frequent Patterns from Uncertain Data , 2007 .

[6]  Lin Feng,et al.  AT-Mine: An Efficient Algorithm of Frequent Itemset Mining on Uncertain Dataset , 2013, J. Comput..

[7]  Ming-Yang Su,et al.  A real-time network intrusion detection system for large-scale attacks based on an incremental mining approach , 2009, Comput. Secur..

[8]  Jiang-hui Cai,et al.  Association rule mining method based on weighted frequent pattern tree in mobile computing environment , 2013, Int. J. Wirel. Mob. Comput..

[9]  Charu C. Aggarwal,et al.  Frequent pattern mining with uncertain data , 2009, KDD.

[10]  Xiaoyang Yu,et al.  Mining community and inferring friendship in mobile social networks , 2016, Neurocomputing.

[11]  Reynold Cheng,et al.  Evaluating Continuous Probabilistic Queries Over Imprecise Sensor Data , 2010, DASFAA.

[12]  Soon Myoung Chung,et al.  Parallel mining of maximal sequential patterns using multiple samples , 2010, The Journal of Supercomputing.

[13]  Carson Kai-Sang Leung,et al.  A Tree-Based Approach for Frequent Pattern Mining from Uncertain Data , 2008, PAKDD.

[14]  Heungmo Ryang,et al.  An uncertainty-based approach: Frequent itemset mining from uncertain data with different item importance , 2015, Knowl. Based Syst..

[15]  Heungmo Ryang,et al.  Incremental high utility pattern mining with static and dynamic databases , 2014, Applied Intelligence.

[16]  Liming Liu,et al.  An Approximation Algorithm Of Mining Frequent Itemsets From Uncertain Dataset , 2012 .

[17]  Ramakrishnan Srikant,et al.  Fast algorithms for mining association rules , 1998, VLDB 1998.

[18]  Heungmo Ryang,et al.  Top-k high utility pattern mining with effective threshold raising strategies , 2015, Knowl. Based Syst..

[19]  Azriel Rosenfeld,et al.  Face recognition: A literature survey , 2003, CSUR.

[20]  Guodong Fang,et al.  Network Traffic Monitoring Based on Mining Frequent Patterns , 2009, 2009 Sixth International Conference on Fuzzy Systems and Knowledge Discovery.

[21]  Unil Yun,et al.  A fast perturbation algorithm using tree structure for privacy preserving utility mining , 2015, Expert Syst. Appl..

[22]  Keun Ho Ryu,et al.  Mining Frequent Weighted Itemsets without Storing Transaction IDs and Generating Candidates , 2017, Int. J. Uncertain. Fuzziness Knowl. Based Syst..

[23]  Heungmo Ryang,et al.  Multiple Minimum Support-Based Rare Graph Pattern Mining Considering Symmetry Feature-Based Growth Technique and the Differing Importance of Graph Elements , 2015, Symmetry.

[24]  Yufei Tao,et al.  Efficient Evaluation of Probabilistic Advanced Spatial Queries on Existentially Uncertain Data , 2009, IEEE Transactions on Knowledge and Data Engineering.

[25]  Heungmo Ryang,et al.  Mining weighted erasable patterns by using underestimated constraint-based pruning technique , 2015, J. Intell. Fuzzy Syst..

[26]  Ramakrishnan Srikant,et al.  Fast Algorithms for Mining Association Rules in Large Databases , 1994, VLDB.

[27]  Heungmo Ryang,et al.  Approximate Maximal Frequent Pattern Mining with Weight Conditions and Error Tolerance , 2016, Int. J. Pattern Recognit. Artif. Intell..

[28]  Sunil Prabhakar,et al.  Querying imprecise data in moving object environments , 2003, Proceedings 19th International Conference on Data Engineering (Cat. No.03CH37405).

[29]  Edward Hung,et al.  Mining Frequent Itemsets from Uncertain Data , 2007, PAKDD.

[30]  Stuart Barber,et al.  Classification of multiple time signals using localized frequency characteristics applied to industrial process monitoring , 2016, Comput. Stat. Data Anal..

[31]  Aleksandra Slavkovic,et al.  "Secure" Logistic Regression of Horizontally and Vertically Partitioned Distributed Databases , 2007 .

[32]  Jian Pei,et al.  Mining Frequent Patterns without Candidate Generation: A Frequent-Pattern Tree Approach , 2006, Sixth IEEE International Conference on Data Mining - Workshops (ICDMW'06).

[33]  Maguelonne Teisseire,et al.  Sequential patterns mining and gene sequence visualization to discover novelty from microarray data , 2011, J. Biomed. Informatics.

[34]  Unil Yun,et al.  Efficient Mining of Robust Closed Weighted Sequential Patterns Without Information Loss , 2015, Int. J. Artif. Intell. Tools.

[35]  Unil Yun,et al.  Incremental mining of weighted maximal frequent itemsets from dynamic databases , 2016, Expert Syst. Appl..

[36]  Dinh Que Tran,et al.  A fusion of data mining techniques for predicting movement of mobile users , 2015, Journal of Communications and Networks.

[37]  Wilfred Ng,et al.  Mining Probabilistically Frequent Sequential Patterns in Large Uncertain Databases , 2014, IEEE Transactions on Knowledge and Data Engineering.