Effects of Data Reduction on the Generalization Ability of Parallel Distributed Genetic Fuzzy Rule Selection

Genetic fuzzy rule selection has been successfully used to design accurate and interpretable fuzzy classifiers from numerical data. In our former study, we proposed its parallel distributed implementation which can drastically decrease the computational time by dividing both a population and a training data set into sub-groups. In this paper, we examine the effect of data reduction on the generalization ability of fuzzy rule-based classifiers designed by our parallel distributed approach. Through computational experiments, we show that data reduction can be realized without severe deterioration in the generalization ability of the designed fuzzy classifiers.

[1]  Maliha S. Nash,et al.  Handbook of Parametric and Nonparametric Statistical Procedures , 2001, Technometrics.

[2]  Francisco Herrera,et al.  On the combination of evolutionary algorithms and stratified strategies for training set selection in data mining , 2006, Appl. Soft Comput..

[3]  C. G. Hilborn,et al.  The Condensed Nearest Neighbor Rule , 1967 .

[4]  Huan Liu,et al.  Instance Selection and Construction for Data Mining , 2001 .

[5]  Ron Kohavi,et al.  Feature Selection for Knowledge Discovery and Data Mining , 1998 .

[6]  F. Gomide,et al.  Ten years of genetic fuzzy systems: current framework and new trends , 2001, Proceedings Joint 9th IFSA World Congress and 20th NAFIPS International Conference (Cat. No. 01TH8569).

[7]  Peter E. Hart,et al.  The condensed nearest neighbor rule (Corresp.) , 1968, IEEE Trans. Inf. Theory.

[8]  Hisao Ishibuchi,et al.  Multiobjective Genetic Fuzzy Systems: Review and Future Research Directions , 2007, 2007 IEEE International Fuzzy Systems Conference.

[9]  Hisao Ishibuchi,et al.  Parallel distributed genetic fuzzy rule selection , 2008, Soft Comput..

[10]  Hisao Ishibuchi,et al.  Classification and modeling with linguistic information granules - advanced approaches to linguistic data mining , 2004, Advanced information processing.

[11]  Francisco Herrera,et al.  Genetic fuzzy systems: taxonomy, current research trends and prospects , 2008, Evol. Intell..

[12]  Enrique Alba,et al.  Parallelism and evolutionary algorithms , 2002, IEEE Trans. Evol. Comput..

[13]  Yusuke Nojima,et al.  Computational Efficiency of Parallel Distributed Genetic Fuzzy Rule Selection for Large Data Sets , 2008 .

[14]  Hisao Ishibuchi,et al.  Data Set Subdivision for Parallel Distributed Implementation of Genetic Fuzzy Rule Selection , 2007, 2007 IEEE International Fuzzy Systems Conference.

[15]  Erick Cantú-Paz,et al.  A Survey of Parallel Genetic Algorithms , 2000 .

[16]  Hiroshi Motoda,et al.  Feature Selection for Knowledge Discovery and Data Mining , 1998, The Springer International Series in Engineering and Computer Science.

[17]  Hisao Ishibuchi,et al.  Selecting fuzzy if-then rules for classification problems using genetic algorithms , 1995, IEEE Trans. Fuzzy Syst..

[18]  Francisco Herrera,et al.  Stratification for scaling up evolutionary prototype selection , 2005, Pattern Recognit. Lett..