Accuracy improvement of genetic fuzzy rule selection with candidate rule addition and membership tuning

Data mining is a very active and rapidly growing research area in the field of computer science. Its goal is to obtain useful knowledge for users from a database. Association rule mining from a database is one of the most well-known data mining techniques. In general, a large number of if-then rules are extracted by specifying minimum support and confidence levels. They are, however, too complicated as knowledge for users to understand many rules at one time. Multiobjective genetic fuzzy rule selection from Pareto-optimal and near Pareto-optimal rules is a promising approach which can obtain an accurate and simple rule set by considering the accuracy maximization and the complexity minimization. In this paper, we propose two extensions of multiobjective genetic fuzzy rule selection for designing more accurate fuzzy rule-based classifiers. One extension is to add compatible rules with misclassified patterns into candidate rules for genetic fuzzy rule selection. The other is to tune membership functions after genetic fuzzy rule selection. We examine the effects of these extensions through computational experiments on imbalanced data sets.

[1]  S. García,et al.  An Extension on "Statistical Comparisons of Classifiers over Multiple Data Sets" for all Pairwise Comparisons , 2008 .

[2]  Hisao Ishibuchi,et al.  Rule weight specification in fuzzy rule-based classification systems , 2005, IEEE Transactions on Fuzzy Systems.

[3]  Hisao Ishibuchi,et al.  Three-objective genetics-based machine learning for linguistic rule extraction , 2001, Inf. Sci..

[4]  M. Friedman The Use of Ranks to Avoid the Assumption of Normality Implicit in the Analysis of Variance , 1937 .

[5]  Jesús Alcalá-Fdez,et al.  A Proposal for the Genetic Lateral Tuning of Linguistic Fuzzy Systems and Its Interaction With Rule Selection , 2007, IEEE Transactions on Fuzzy Systems.

[6]  Roberto J. Bayardo,et al.  Mining the most interesting rules , 1999, KDD '99.

[7]  Hisao Ishibuchi,et al.  Selecting fuzzy if-then rules for classification problems using genetic algorithms , 1995, IEEE Trans. Fuzzy Syst..

[8]  J. Shaffer Modified Sequentially Rejective Multiple Test Procedures , 1986 .

[9]  Hisao Ishibuchi,et al.  Obtaining accurate classifiers with Pareto-optimal and near Pareto-optimal rules , 2008, Artificial Life and Robotics.

[10]  S. Holm A Simple Sequentially Rejective Multiple Test Procedure , 1979 .

[11]  Francisco Herrera,et al.  Learning the membership function contexts for mining fuzzy association rules by using genetic algorithms , 2009, Fuzzy Sets Syst..

[12]  Hisao Ishibuchi,et al.  Single-objective and two-objective genetic algorithms for selecting linguistic rules for pattern classification problems , 1997, Fuzzy Sets Syst..

[13]  Hisao Ishibuchi,et al.  Fuzzy rule selection by multi-objective genetic local search algorithms and rule evaluation measures in data mining , 2004, Fuzzy Sets Syst..

[14]  Jiawei Han,et al.  CPAR: Classification based on Predictive Association Rules , 2003, SDM.

[15]  Wynne Hsu,et al.  Integrating Classification and Association Rule Mining , 1998, KDD.

[16]  Hisao Ishibuchi,et al.  Effect of rule weights in fuzzy rule-based classification systems , 2000, Ninth IEEE International Conference on Fuzzy Systems. FUZZ- IEEE 2000 (Cat. No.00CH37063).

[17]  José M. Alonso,et al.  An Interpretability-Guided Modeling Process for Learning Comprehensible Fuzzy Rule-Based Classifiers , 2009, 2009 Ninth International Conference on Intelligent Systems Design and Applications.

[18]  S. Bridges,et al.  Genetic Algorithm Optimization of Membership Functions for Mining Fuzzy Association Rules , 2000 .

[19]  Reda Alhajj,et al.  Utilizing Genetic Algorithms to Optimize Membership Functions for Fuzzy Weighted Association Rules Mining , 2006, Applied Intelligence.

[20]  Hisao Ishibuchi,et al.  Prescreening of Candidate Rules Using Association Rule Mining and Pareto-optimality in Genetic Rule Selection , 2007, KES.

[21]  Janez Demsar,et al.  Statistical Comparisons of Classifiers over Multiple Data Sets , 2006, J. Mach. Learn. Res..

[22]  Kalyanmoy Deb,et al.  A fast and elitist multiobjective genetic algorithm: NSGA-II , 2002, IEEE Trans. Evol. Comput..

[23]  G. Hommel,et al.  Improvements of General Multiple Test Procedures for Redundant Systems of Hypotheses , 1988 .