Advanced approach of sliding window based erasable pattern mining with list structure of industrial fields

Abstract As one of the various data mining areas, erasable pattern mining is a method for solving financial problems that can be caused in various industrial fields. Since the concept of erasable pattern mining was proposed, various relevant techniques have been developed. Dealing with dynamic stream data is an important issue in different data mining areas including erasable pattern mining. In order to deal with partial data with the latest information, a sliding window-based approach was proposed. However, the method still has performance limits in many cases because of its inefficient data structures and mining techniques. Motivated by this problem, we propose a new efficient erasable pattern mining algorithm based on the sliding window that can guarantee robustness and reliability and deal with the latest stream data more effectively. In addition, the newly proposed list data structure and mining techniques allow the proposed method to extract erasable patterns in a more efficient way. Results of empirical experiments using well-known benchmark datasets guarantee that the proposed algorithm outperforms previous state-of-the-art approaches in various aspects.

[1]  Heungmo Ryang,et al.  High utility pattern mining over data streams with sliding window technique , 2016, Expert Syst. Appl..

[2]  Unil Yun,et al.  Mining of high average-utility itemsets using novel list structure and pruning strategy , 2017, Future Gener. Comput. Syst..

[3]  Lei Shu,et al.  Reality mining: A prediction algorithm for disease dynamics based on mobile big data , 2017, Inf. Sci..

[4]  Heungmo Ryang,et al.  Mining weighted erasable patterns by using underestimated constraint-based pruning technique , 2015, J. Intell. Fuzzy Syst..

[5]  Keun Ho Ryu,et al.  Sliding window based weighted maximal frequent pattern mining over data streams , 2014, Expert Syst. Appl..

[6]  Jian Pei,et al.  Mining Frequent Patterns without Candidate Generation: A Frequent-Pattern Tree Approach , 2006, Sixth IEEE International Conference on Data Mining - Workshops (ICDMW'06).

[7]  Witold Pedrycz,et al.  Mining erasable itemsets with subset and superset itemset constraints , 2017, Expert Syst. Appl..

[8]  Frans Coenen,et al.  An Efficient Algorithm for Mining Erasable Itemsets Using the Difference of NC-Sets , 2013, 2013 IEEE International Conference on Systems, Man, and Cybernetics.

[9]  Alexi Delgado,et al.  Environmental conflict analysis using an integrated grey clustering and entropy-weight method: A case study of a mining project in Peru , 2016, Environ. Model. Softw..

[10]  Carson Kai-Sang Leung,et al.  Mining interesting patterns from uncertain databases , 2016, Inf. Sci..

[11]  Ahmed Guessoum,et al.  Complex diseases SNP selection and classification by hybrid Association Rule Mining and Artificial Neural Network - based Evolutionary Algorithms , 2016, Eng. Appl. Artif. Intell..

[12]  Iqbal Gondal,et al.  Dependable large scale behavioral patterns mining from sensor data using Hadoop platform , 2017, Inf. Sci..

[13]  Abdullah Gani,et al.  Hadith data mining and classification: a comparative analysis , 2016, Artificial Intelligence Review.

[14]  Xiaoran Xu,et al.  An Efficient Algorithm for Mining Erasable Itemsets , 2010, ADMA.

[15]  Keun Ho Ryu,et al.  Fast algorithm for high utility pattern mining with the sum of item quantities , 2016, Intell. Data Anal..

[16]  Keiji Yanai,et al.  Event photo mining from Twitter using keyword bursts and image clustering , 2016, Neurocomputing.

[17]  Heungmo Ryang,et al.  Mining recent high average utility patterns based on sliding window from stream data , 2016, J. Intell. Fuzzy Syst..

[18]  Yue-Shi Lee,et al.  An efficient algorithm for incrementally mining frequent closed itemsets , 2013, Applied Intelligence.

[19]  Unil Yun,et al.  Sliding window based weighted erasable stream pattern mining for stream data applications , 2016, Future Gener. Comput. Syst..

[20]  Bay Vo,et al.  MEI: An efficient algorithm for mining erasable itemsets , 2014, Eng. Appl. Artif. Intell..

[21]  Ho-Jin Choi,et al.  Single-pass incremental and interactive mining for weighted frequent patterns , 2012, Expert Syst. Appl..

[22]  Heungmo Ryang,et al.  Erasable itemset mining over incremental databases with weight conditions , 2016, Eng. Appl. Artif. Intell..

[23]  Ruixuan Li,et al.  Role mining using answer set programming , 2016, Future Gener. Comput. Syst..

[24]  Nada Lavrac,et al.  ClowdFlows: Online workflows for distributed big data mining , 2017, Future Gener. Comput. Syst..

[25]  Yan Li,et al.  A distributed ensemble approach for mining healthcare data under privacy constraints , 2016, Inf. Sci..

[26]  Byeong-Soo Jeong,et al.  Efficient Mining of Weighted Frequent Patterns over Data Streams , 2009, 2009 11th IEEE International Conference on High Performance Computing and Communications.

[27]  Xiaoran Xu,et al.  Fast mining erasable itemsets using NC_sets , 2012, Expert Syst. Appl..

[28]  Frans Coenen,et al.  A new method for mining Frequent Weighted Itemsets based on WIT-trees , 2013, Expert Syst. Appl..

[29]  Unil Yun,et al.  Efficient Mining of Robust Closed Weighted Sequential Patterns Without Information Loss , 2015, Int. J. Artif. Intell. Tools.

[30]  Shih-Ying Chen,et al.  An efficient algorithm of frequent XML query pattern mining for ebXML applications in e-commerce , 2012, Expert Syst. Appl..

[31]  Young-Koo Lee,et al.  Efficient Tree Structures for High Utility Pattern Mining in Incremental Databases , 2009, IEEE Transactions on Knowledge and Data Engineering.

[32]  Unil Yun,et al.  Incremental mining of weighted maximal frequent itemsets from dynamic databases , 2016, Expert Syst. Appl..

[33]  Ramakrishnan Srikant,et al.  Fast algorithms for mining association rules , 1998, VLDB 1998.

[34]  Keun Ho Ryu,et al.  Discovering high utility itemsets with multiple minimum supports , 2014, Intell. Data Anal..

[35]  Heungmo Ryang,et al.  Monitoring vehicle outliers based on clustering technique , 2016, Appl. Soft Comput..

[36]  Zhi-Hong Deng,et al.  Mining erasable itemsets , 2009, 2009 International Conference on Machine Learning and Cybernetics.

[37]  Frans Coenen,et al.  Finding "interesting" trends in social networks using frequent pattern mining and self organizing maps , 2012, Knowl. Based Syst..

[38]  Wei Liu,et al.  Frequent patterns mining in multiple biological sequences , 2013, Comput. Biol. Medicine.

[39]  Bay Vo,et al.  A New Approach for Mining Top-Rank-k Erasable Itemsets , 2014, ACIIDS.

[40]  Tzung-Pei Hong,et al.  A novel method for constrained class association rule mining , 2015, Inf. Sci..

[41]  Unil Yun,et al.  A new efficient approach for mining uncertain frequent patterns using minimum data structure without false positives , 2017, Future Gener. Comput. Syst..

[42]  Heungmo Ryang,et al.  Top-k high utility pattern mining with effective threshold raising strategies , 2015, Knowl. Based Syst..

[43]  Heungmo Ryang,et al.  Approximate Maximal Frequent Pattern Mining with Weight Conditions and Error Tolerance , 2016, Int. J. Pattern Recognit. Artif. Intell..