High-Utility Itemset Mining in Big Dataset

High-utility mining (HUIM) is an extended concept from frequent itemset mining (FIM). It emphasizes the more important factors, such as profits or the weight of an itemset in commercial applications. In this paper, we assume a dataset is too big to be loaded in the memory, then propose a MapReduce framework to handle this kind of situation, and try to reduce the times of scanning dataset as possible and maximize parallelization of the process.

[1]  Tzung-Pei Hong,et al.  An Incremental High-Utility Mining Algorithm with Transaction Insertion , 2015, TheScientificWorldJournal.

[2]  Tzung-Pei Hong,et al.  An efficient projection-based indexing approach for mining high utility itemsets , 2012, Knowledge and Information Systems.

[3]  Tzung-Pei Hong,et al.  An effective tree structure for mining high utility itemsets , 2011, Expert Syst. Appl..

[4]  Lu Yang,et al.  Mining high-utility itemsets based on particle swarm optimization , 2016, Eng. Appl. Artif. Intell..

[5]  Philip S. Yu,et al.  HUOPM: High-Utility Occupancy Pattern Mining , 2018, IEEE Transactions on Cybernetics.

[6]  Justin Zhijun Zhan,et al.  An ACO-based approach to mine high-utility itemsets , 2017, Knowl. Based Syst..

[7]  Lu Yang,et al.  A binary PSO approach to mine high-utility itemsets , 2017, Soft Comput..

[8]  Srikumar Krishnamoorthy,et al.  Pruning strategies for mining high utility itemsets , 2015, Expert Syst. Appl..

[9]  Tzung-Pei Hong,et al.  FDHUP: Fast algorithm for mining discriminative high utility patterns , 2017, Knowledge and Information Systems.

[10]  Mengchi Liu,et al.  Mining high utility itemsets without candidate generation , 2012, CIKM.

[11]  Philip S. Yu,et al.  Efficient Algorithms for Mining Top-K High Utility Itemsets , 2016, IEEE Transactions on Knowledge and Data Engineering.

[12]  Jerry Chun-Wei Lin,et al.  Mining of high average-utility patterns with item-level thresholds , 2019 .

[13]  Sanjay Ghemawat,et al.  MapReduce: Simplified Data Processing on Large Clusters , 2004, OSDI.

[14]  Vincent S. Tseng,et al.  EFIM: A Highly Efficient Algorithm for High-Utility Itemset Mining , 2015, MICAI.

[15]  Lu Yang,et al.  Mining of skyline patterns by considering both frequent and utility constraints , 2019, Eng. Appl. Artif. Intell..