Attribute Index and Uniform Design Based Multiobjective Association Rule Mining with Evolutionary Algorithm

In association rule mining, evaluating an association rule needs to repeatedly scan database to compare the whole database with the antecedent, consequent of a rule and the whole rule. In order to decrease the number of comparisons and time consuming, we present an attribute index strategy. It only needs to scan database once to create the attribute index of each attribute. Then all metrics values to evaluate an association rule do not need to scan database any further, but acquire data only by means of the attribute indices. The paper visualizes association rule mining as a multiobjective problem rather than a single objective one. In order to make the acquired solutions scatter uniformly toward the Pareto frontier in the objective space, elitism policy and uniform design are introduced. The paper presents the algorithm of attribute index and uniform design based multiobjective association rule mining with evolutionary algorithm, abbreviated as IUARMMEA. It does not require the user-specified minimum support and minimum confidence anymore, but uses a simple attribute index. It uses a well-designed real encoding so as to extend its application scope. Experiments performed on several databases demonstrate that the proposed algorithm has excellent performance, and it can significantly reduce the number of comparisons and time consumption.

[1]  Catherine Blake,et al.  UCI Repository of machine learning databases , 1998 .

[2]  Mahdi Nasiri,et al.  Numeric Multi-Objective Rule Mining Using Simulated Annealing Algorithm , 2011 .

[3]  Jie Zhang,et al.  Parallel Multi-Swarm PSO Based on K-Medoids and Uniform Design , 2013 .

[4]  Marco Laumanns,et al.  SPEA2: Improving the strength pareto evolutionary algorithm , 2001 .

[5]  Zvi M. Kedem,et al.  Pincer-Search: An Efficient Algorithm for Discovering the Maximum Frequent Set , 2002, IEEE Trans. Knowl. Data Eng..

[6]  Lothar Thiele,et al.  Comparison of Multiobjective Evolutionary Algorithms: Empirical Results , 2000, Evolutionary Computation.

[7]  Arun N. Swami,et al.  Set-oriented mining for association rules in relational databases , 1995, Proceedings of the Eleventh International Conference on Data Engineering.

[8]  Sumit Katiyar,et al.  Hybrid Spectral Efficient Cellular Network Deployment to Reduce RF Pollution , 2012 .

[9]  Ulrich Güntzer,et al.  Algorithms for association rule mining — a general survey and comparison , 2000, SKDD.

[10]  Yuping Wang,et al.  A clustering multi-objective evolutionary algorithm based on orthogonal and uniform design , 2009, 2009 IEEE Congress on Evolutionary Computation.

[11]  Rajeev Motwani,et al.  Dynamic itemset counting and implication rules for market basket data , 1997, SIGMOD '97.

[12]  Bilal Alatas,et al.  MODENAR: Multi-objective differential evolution algorithm for mining numeric association rules , 2008, Appl. Soft Comput..

[13]  Erhan Akin,et al.  Rough particle swarm optimization and its applications in data mining , 2008, Soft Comput..

[14]  David E. Goldberg,et al.  A niched Pareto genetic algorithm for multiobjective optimization , 1994, Proceedings of the First IEEE Conference on Evolutionary Computation. IEEE World Congress on Computational Intelligence.

[15]  David W. Coit,et al.  Multi-objective optimization using genetic algorithms: A tutorial , 2006, Reliab. Eng. Syst. Saf..

[16]  Li-Chen Fu,et al.  A two-phase evolutionary algorithm for multiobjective mining of classification rules , 2010, IEEE Congress on Evolutionary Computation.

[17]  Alex A. Freitas,et al.  Discovering comprehensible classification rules with a genetic algorithm , 2000, Proceedings of the 2000 Congress on Evolutionary Computation. CEC00 (Cat. No.00TH8512).

[18]  Marco Laumanns,et al.  A unified model for multi-objective evolutionary algorithms with elitism , 2000, Proceedings of the 2000 Congress on Evolutionary Computation. CEC00 (Cat. No.00TH8512).

[19]  Srinivasan Parthasarathy,et al.  New Algorithms for Fast Discovery of Association Rules , 1997, KDD.

[20]  Jesús Alcalá-Fdez,et al.  A multi-objective evolutionary algorithm for mining quantitative association rules , 2011, 2011 11th International Conference on Intelligent Systems Design and Applications.

[21]  Zvi M. Kedem,et al.  Pincer-Search: A New Algorithm for Discovering the Maximum Frequent Set , 1998, EDBT.

[22]  Clarisse Dhaenens,et al.  Parallel Genetic Algorithms for multi-objective rule mining , 2005 .

[23]  Jian Pei,et al.  Mining Frequent Patterns without Candidate Generation: A Frequent-Pattern Tree Approach , 2006, Sixth IEEE International Conference on Data Mining - Workshops (ICDMW'06).

[24]  C. K. Kwong,et al.  A multi-objective genetic algorithm approach to rule mining for affective product design , 2012, Expert Syst. Appl..

[25]  Carlos A. Coello Coello,et al.  Evolutionary Multi-Objective Optimization: Basic Concepts and Some Applications in Pattern Recognition , 2011, MCPR.

[26]  Kalyanmoy Deb,et al.  A fast and elitist multiobjective genetic algorithm: NSGA-II , 2002, IEEE Trans. Evol. Comput..

[27]  Shamkant B. Navathe,et al.  An Efficient Algorithm for Mining Association Rules in Large Databases , 1995, VLDB.

[28]  Rajib Mall,et al.  Application of elitist multi-objective genetic algorithm for classification rule generation , 2008, Appl. Soft Comput..

[29]  Chengqi Zhang,et al.  Genetic algorithm-based strategy for identifying association rules without specifying actual minimum support , 2009, Expert Syst. Appl..

[30]  Peter J. Fleming,et al.  Genetic Algorithms for Multiobjective Optimization: FormulationDiscussion and Generalization , 1993, ICGA.

[31]  Behrouz Minaei-Bidgoli,et al.  Multi objective association rule mining with genetic algorithm without specifying minimum support and minimum confidence , 2011, Expert Syst. Appl..

[32]  Rajib Mall,et al.  Predictive and comprehensible rule discovery using a multi-objective genetic algorithm , 2006, Knowl. Based Syst..

[33]  B. Nath,et al.  A New Approach on Rare Association Rule Mining , 2012 .

[34]  Nicolás Marín,et al.  A Complexity Guided Algorithm for Association Rule Extraction on Fuzzy DataCubes , 2008, IEEE Transactions on Fuzzy Systems.

[35]  Rakesh Agarwal,et al.  Fast Algorithms for Mining Association Rules , 1994, VLDB 1994.

[36]  Kalyanmoy Deb,et al.  Muiltiobjective Optimization Using Nondominated Sorting in Genetic Algorithms , 1994, Evolutionary Computation.

[37]  Siu Cheung Hui,et al.  Associative Classification With Artificial Immune System , 2009, IEEE Transactions on Evolutionary Computation.

[38]  Sujatha Srinivasan,et al.  Evolutionary multi objective optimization for rule mining: a review , 2011, Artificial Intelligence Review.

[39]  Jian Pei,et al.  Mining frequent patterns without candidate generation , 2000, SIGMOD '00.

[40]  Behrouz Minaei-Bidgoli,et al.  Clustering Based Multi-Objective Rule Mining using Genetic Algorithm , 2010, J. Digit. Content Technol. its Appl..

[41]  S. Rahman Reliability Engineering and System Safety , 2011 .

[42]  Qingfu Zhang,et al.  Multiobjective evolutionary algorithms: A survey of the state of the art , 2011, Swarm Evol. Comput..

[43]  Yuping Wang,et al.  Uniform Design Based Hybrid Genetic Algorithm for Multiobjective Bilevel Convex Programming , 2011, 2011 Seventh International Conference on Computational Intelligence and Security.

[44]  Bhabesh Nath,et al.  Multi-objective rule mining using genetic algorithms , 2004, Inf. Sci..

[45]  Hisao Ishibuchi,et al.  Multiobjective Genetic Fuzzy Systems , 2015, Handbook of Computational Intelligence.

[46]  Yuping Wang,et al.  Multiobjective programming using uniform design and genetic algorithm , 2000, IEEE Trans. Syst. Man Cybern. Part C.

[47]  Mehmet Kaya Autonomous classifiers with understandable rule using multi-objective genetic algorithms , 2010, Expert Syst. Appl..

[48]  Bart Goethals,et al.  Memory issues in frequent itemset mining , 2004, SAC '04.

[49]  V. J. Rayward-Smith,et al.  Data mining rules using multi-objective evolutionary algorithms , 2003, The 2003 Congress on Evolutionary Computation, 2003. CEC '03..

[50]  C. Fonseca,et al.  GENETIC ALGORITHMS FOR MULTI-OBJECTIVE OPTIMIZATION: FORMULATION, DISCUSSION, AND GENERALIZATION , 1993 .

[51]  Tomasz Imielinski,et al.  Mining association rules between sets of items in large databases , 1993, SIGMOD Conference.