Single-pass based efficient erasable pattern mining using list data structure on dynamic incremental databases

Abstract Many different approaches of data mining have been proposed to satisfy various demands of users. Erasable pattern mining is one of the interesting areas in frequent pattern mining, which was proposed to diagnose and solve financial problems caused in industrial fields. Since its original concept emerged, various relevant approaches have been devised. Analyzing incremental data becomes more important because interesting data are continually accumulated in various application fields including industrial areas. For this reason, an incremental method for erasable pattern mining has also been suggested in order to reflect such a trend. Since incremental data become gradually larger and more complicated with the passage of time, it is important to process such data as quickly and efficiently as possible. However, the previous method has limitations in this respect. Motivated by this challenge, we propose a new incremental erasable pattern mining algorithm including new data structures and mining techniques for efficient incremental data processing. We also demonstrate that the proposed method outperforms previous state-of-the-art approaches through extensive, empirical performance tests.

[1]  Ho-Jin Choi,et al.  Single-pass incremental and interactive mining for weighted frequent patterns , 2012, Expert Syst. Appl..

[2]  Shih-Ying Chen,et al.  An efficient algorithm of frequent XML query pattern mining for ebXML applications in e-commerce , 2012, Expert Syst. Appl..

[3]  Václav Snásel,et al.  An efficient method for mining frequent sequential patterns using multi-Core processors , 2016, Applied Intelligence.

[4]  Bay Vo,et al.  A novel approach for mining maximal frequent patterns , 2017, Expert Syst. Appl..

[5]  Tzung-Pei Hong,et al.  Efficient Algorithms for Mining Erasable Closed Patterns From Product Datasets , 2017, IEEE Access.

[6]  Justin Zhijun Zhan,et al.  Mining of frequent patterns with multiple minimum supports , 2017, Eng. Appl. Artif. Intell..

[7]  Nada Lavrac,et al.  ClowdFlows: Online workflows for distributed big data mining , 2017, Future Gener. Comput. Syst..

[8]  Xiaoran Xu,et al.  Fast mining erasable itemsets using NC_sets , 2012, Expert Syst. Appl..

[9]  Heungmo Ryang,et al.  High utility pattern mining over data streams with sliding window technique , 2016, Expert Syst. Appl..

[10]  Ahmed Guessoum,et al.  Complex diseases SNP selection and classification by hybrid Association Rule Mining and Artificial Neural Network - based Evolutionary Algorithms , 2016, Eng. Appl. Artif. Intell..

[11]  Frans Coenen,et al.  A new method for mining Frequent Weighted Itemsets based on WIT-trees , 2013, Expert Syst. Appl..

[12]  Wei Liu,et al.  Frequent patterns mining in multiple biological sequences , 2013, Comput. Biol. Medicine.

[13]  Francesco Calabrese,et al.  ABACUS: frequent pAttern mining-BAsed Community discovery in mUltidimensional networkS , 2013, Data Mining and Knowledge Discovery.

[14]  Yue-Shi Lee,et al.  An efficient algorithm for incrementally mining frequent closed itemsets , 2013, Applied Intelligence.

[15]  Unil Yun,et al.  Sliding window based weighted erasable stream pattern mining for stream data applications , 2016, Future Gener. Comput. Syst..

[16]  Tzung-Pei Hong,et al.  Using Soft Set Theory for Mining Maximal Association Rules in Text Data , 2016, J. Univers. Comput. Sci..

[17]  Heungmo Ryang,et al.  Mining weighted erasable patterns by using underestimated constraint-based pruning technique , 2015, J. Intell. Fuzzy Syst..

[18]  Abdullah Gani,et al.  Hadith data mining and classification: a comparative analysis , 2016, Artificial Intelligence Review.

[19]  Bay Vo,et al.  MEI: An efficient algorithm for mining erasable itemsets , 2014, Eng. Appl. Artif. Intell..

[20]  Heungmo Ryang,et al.  Erasable itemset mining over incremental databases with weight conditions , 2016, Eng. Appl. Artif. Intell..

[21]  Bay Vo,et al.  A New Approach for Mining Top-Rank-k Erasable Itemsets , 2014, ACIIDS.

[22]  Ramakrishnan Srikant,et al.  Fast Algorithms for Mining Association Rules in Large Databases , 1994, VLDB.

[23]  Zhi-Hong Deng,et al.  Mining erasable itemsets , 2009, 2009 International Conference on Machine Learning and Cybernetics.

[24]  Frans Coenen,et al.  Finding "interesting" trends in social networks using frequent pattern mining and self organizing maps , 2012, Knowl. Based Syst..

[25]  Kawuu W. Lin,et al.  A fast and resource efficient mining algorithm for discovering frequent patterns in distributed computing environments , 2015, Future Gener. Comput. Syst..

[26]  Ruixuan Li,et al.  Role mining using answer set programming , 2016, Future Gener. Comput. Syst..

[27]  Byeong-Soo Jeong,et al.  Efficient Mining of Weighted Frequent Patterns over Data Streams , 2009, 2009 11th IEEE International Conference on High Performance Computing and Communications.

[28]  Young-Koo Lee,et al.  Efficient Tree Structures for High Utility Pattern Mining in Incremental Databases , 2009, IEEE Transactions on Knowledge and Data Engineering.

[29]  Jian Pei,et al.  Mining Frequent Patterns without Candidate Generation: A Frequent-Pattern Tree Approach , 2006, Sixth IEEE International Conference on Data Mining - Workshops (ICDMW'06).

[30]  Witold Pedrycz,et al.  Mining erasable itemsets with subset and superset itemset constraints , 2017, Expert Syst. Appl..

[31]  Frans Coenen,et al.  An Efficient Algorithm for Mining Erasable Itemsets Using the Difference of NC-Sets , 2013, 2013 IEEE International Conference on Systems, Man, and Cybernetics.

[32]  Alexi Delgado,et al.  Environmental conflict analysis using an integrated grey clustering and entropy-weight method: A case study of a mining project in Peru , 2016, Environ. Model. Softw..

[33]  Keiji Yanai,et al.  Event photo mining from Twitter using keyword bursts and image clustering , 2016, Neurocomputing.

[34]  Unil Yun,et al.  A new efficient approach for mining uncertain frequent patterns using minimum data structure without false positives , 2017, Future Gener. Comput. Syst..

[35]  Heungmo Ryang,et al.  Top-k high utility pattern mining with effective threshold raising strategies , 2015, Knowl. Based Syst..

[36]  Xiaoran Xu,et al.  An Efficient Algorithm for Mining Erasable Itemsets , 2010, ADMA.