Mining high-utility itemsets based on particle swarm optimization

High-utility itemset mining (HUIM) is a critical issue in recent years since it can be used to reveal the profitable products by considering both the quantity and profit factors instead of frequent itemset mining (FIM) or association-rule mining (ARM). Several algorithms have been presented to mine high-utility itemsets (HUIs) and most of the designed algorithms have to handle the exponential search space for discovering HUIs when the number of distinct items and the size of database are very large. In the past, a heuristic HUPEumu-GRAM algorithm was proposed to mine HUIs based on genetic algorithm (GA). For the evolutionary computation (EC) techniques of particle swarm optimization (PSO), it only requires fewer parameters compared to the GA-based approach. Since the traditional PSO mechanism is used to handle the continuous problem, in this paper, the discrete PSO is adopted to encode the particles as the binary variables. An efficient PSO-based algorithm namely HUIM-BPSOsig is proposed to efficiently find HUIs. It first sets the number of discovered high-transaction-weighted utilization 1-itemsets (1-HTWUIs) as the size of a particle based on transaction-weighted utility (TWU) model, which can greatly reduce the combinational problem in evolution process. The sigmoid function is adopted in the updating process of the particles of the designed HUIM-BPSOsig algorithm. Substantial experiments on real-life datasets show that the proposed algorithm has better results compared to the state-of-the-art GA-based algorithm.

[1]  Alicia Troncoso Lora,et al.  Mining quantitative association rules based on evolutionary computation and its application to atmospheric pollution , 2010, Integr. Comput. Aided Eng..

[2]  Vadlamani Ravi,et al.  Association rule mining using binary particle swarm optimization , 2013, Eng. Appl. Artif. Intell..

[3]  Tzung-Pei Hong,et al.  An effective tree structure for mining high utility itemsets , 2011, Expert Syst. Appl..

[4]  Howard J. Hamilton,et al.  Mining itemset utilities from transaction databases , 2006, Data Knowl. Eng..

[5]  Vincent S. Tseng,et al.  Novel Concise Representations of High Utility Itemsets Using Generator Patterns , 2014, ADMA.

[6]  Ansaf Salleb-Aouissi,et al.  QuantMiner: A Genetic Algorithm for Mining Quantitative Association Rules , 2007, IJCAI.

[7]  Qiang Yang,et al.  Mining high utility itemsets , 2003, Third IEEE International Conference on Data Mining.

[8]  Franz Oppacher,et al.  Techniques for evolutionary rule discovery in data mining , 2009, 2009 IEEE Congress on Evolutionary Computation.

[9]  Ramakrishnan Srikant,et al.  Fast Algorithms for Mining Association Rules in Large Databases , 1994, VLDB.

[10]  John H. Holland,et al.  Adaptation in Natural and Artificial Systems: An Introductory Analysis with Applications to Biology, Control, and Artificial Intelligence , 1992 .

[11]  Yun Sing Koh,et al.  Weighted Association Rule Mining Using Particle Swarm Optimization , 2011, PAKDD Workshops.

[12]  Mounir Boukadoum,et al.  Particle swarm classification: A survey and positioning , 2013, Pattern Recognit..

[13]  Philip S. Yu,et al.  UP-Growth: an efficient algorithm for high utility itemset mining , 2010, KDD.

[14]  Vincent S. Tseng,et al.  FHM: Faster High-Utility Itemset Mining Using Estimated Utility Co-occurrence Pruning , 2014, ISMIS.

[15]  Tzung-Pei Hong,et al.  An efficient projection-based indexing approach for mining high utility itemsets , 2012, Knowledge and Information Systems.

[16]  Vincent S. Tseng,et al.  EFIM: A Highly Efficient Algorithm for High-Utility Itemset Mining , 2015, MICAI.

[17]  Ying Liu,et al.  A Two-Phase Algorithm for Fast Discovery of High Utility Itemsets , 2005, PAKDD.

[18]  Riccardo Poli,et al.  Particle swarm optimization , 1995, Swarm Intelligence.

[19]  Russell C. Eberhart,et al.  A discrete binary version of the particle swarm algorithm , 1997, 1997 IEEE International Conference on Systems, Man, and Cybernetics. Computational Cybernetics and Simulation.

[20]  Mengchi Liu,et al.  Mining high utility itemsets without candidate generation , 2012, CIKM.

[21]  Philippe Fournier-Viger,et al.  FOSHU: faster on-shelf high utility itemset mining -- with or without negative unit profit , 2015, SAC.

[22]  R. J. Kuo,et al.  Application of particle swarm optimization to association rule mining , 2011, Appl. Soft Comput..

[23]  Aijun An,et al.  Mining top-k high utility patterns over data streams , 2014, Inf. Sci..

[24]  Kandhasamy Premalatha,et al.  Discovery of High Utility Itemsets Using Genetic Algorithm with Ranked Mutation , 2014, Appl. Artif. Intell..

[25]  Philip S. Yu,et al.  Data Mining: An Overview from a Database Perspective , 1996, IEEE Trans. Knowl. Data Eng..

[26]  Minrui Fei,et al.  A Novel Hybrid Binary PSO Algorithm , 2011, ICSI.

[27]  Cory J. Butz,et al.  A Foundational Approach to Mining Itemset Utilities from Databases , 2004, SDM.

[28]  Antonio Gomariz,et al.  The SPMF Open-Source Data Mining Library Version 2 , 2016, ECML/PKDD.