EvoCMAR: A New Evolutionary Method to Directly Mine Association Rules for Classification

In this paper, we propose an evolutionary method with a three-layer structure to directly mine association rules for classification. The association rules have been demonstrated to be useful for classification, such as classification based on association rule (CBA) and classification method based on multiple association rule (CMAR), and they are found to be more accurate than some traditional methods, such as C4.5. Generally speaking, there are two phases in an associative classification method: (i) association rules mining; (ii) classification by association rules. However, the two phases are almost separated, viz, during the first phase, the mining of association rules does not focus on classification. Moreover, when building the classifier in the second phase, most of the association rues will be pruned. As a result, if we are able to directly mine the classification association rules, we can save time. Meanwhile, we can expect even better accuracy because the mining procedure itself considers the classification. In this paper, we build a novel evolutionary method, named evolutionary classification method based on multiple association rule (EvoCMAR), to tackle these problems, and the simulation results show that it performs well in both accuracy and speed. © 2010 Institute of Electrical Engineers of Japan. Published by John Wiley & Sons, Inc.

[1]  George Karypis,et al.  Frequent substructure-based approaches for classifying chemical compounds , 2003, IEEE Transactions on Knowledge and Data Engineering.

[2]  Andreas Stafylopatis,et al.  Efficient Evolution of Accurate Classification Rules Using a Combination of Gene Expression Programming and Clonal Selection , 2008, IEEE Transactions on Evolutionary Computation.

[3]  Jiawei Han,et al.  Discriminative Frequent Pattern Analysis for Effective Classification , 2007, 2007 IEEE 23rd International Conference on Data Engineering.

[4]  Jiuyong Li On optimal rule discovery , 2006 .

[5]  Ester Bernadó-Mansilla,et al.  Fuzzy-UCS: A Michigan-Style Learning Fuzzy-Classifier System for Supervised Learning , 2009, IEEE Transactions on Evolutionary Computation.

[6]  Monique Snoeck,et al.  Classification With Ant Colony Optimization , 2007, IEEE Transactions on Evolutionary Computation.

[7]  Jian Pei,et al.  Mining frequent patterns without candidate generation , 2000, SIGMOD 2000.

[8]  Sung-Bae Cho,et al.  An Evolutionary Algorithm Approach to Optimal Ensemble Classifiers for DNA Microarray Data Analysis , 2008, IEEE Transactions on Evolutionary Computation.

[9]  Jinyan Li,et al.  Efficient mining of emerging patterns: discovering trends and differences , 1999, KDD '99.

[10]  Jiawei Han,et al.  CPAR: Classification based on Predictive Association Rules , 2003, SDM.

[11]  Jian Pei,et al.  CMAR: accurate and efficient classification based on multiple class-association rules , 2001, Proceedings 2001 IEEE International Conference on Data Mining.

[12]  Jianyong Wang,et al.  HARMONY: Efficiently Mining the Best Rules for Classification , 2005, SDM.

[13]  Tomasz Imielinski,et al.  Mining association rules between sets of items in large databases , 1993, SIGMOD Conference.

[14]  Wynne Hsu,et al.  Integrating Classification and Association Rule Mining , 1998, KDD.

[15]  Weimin Xiao,et al.  Evolving accurate and compact classification rules with gene expression programming , 2003, IEEE Trans. Evol. Comput..

[16]  Ramakrishnan Srikant,et al.  Fast algorithms for mining association rules , 1998, VLDB 1998.

[17]  Alex Alves Freitas,et al.  Understanding the crucial differences between classification and discovery of association rules: a position paper , 2000, SKDD.

[18]  Lalit M. Patnaik,et al.  Application of genetic programming for multicategory pattern classification , 2000, IEEE Trans. Evol. Comput..