Appropriate granularity specification for fuzzy classifier design by data complexity measures

Tens of thousands of classifiers have been proposed so far. There is no best classifier among them. It is said that the performance of each classifier strongly depends on data sets used for comparison. In recent years, a number of data complexity measures have been proposed to characterize each data set. The aim of this study is to develop a framework for selecting an appropriate classifier and/or its appropriate parameter specification among candidate classifiers based on data complexity measures. It will be possible to clarify the domain of competence of classifiers. As a preliminary study, we propose an appropriate granularity specification method for fuzzy classifier design. First we examine a relation between the performance of classifiers with different granularities and the data complexity of artificial data sets. Next we extract if-then rule-based knowledge from the classification results on the artificial data sets.

[1]  Jesús Alcalá-Fdez,et al.  A Proposal for the Genetic Lateral Tuning of Linguistic Fuzzy Systems and Its Interaction With Rule Selection , 2007, IEEE Transactions on Fuzzy Systems.

[2]  José Martínez Sotoca,et al.  Data Characterization for Effective Prototype Selection , 2005, IbPRIA.

[3]  Francisco Herrera,et al.  Domains of competence of fuzzy rule based classification systems with data complexity measures: A case of study using a fuzzy hybrid genetic based machine learning method , 2010, Fuzzy Sets Syst..

[4]  Hisao Ishibuchi,et al.  Analysis of interpretability-accuracy tradeoff of fuzzy systems by multiobjective fuzzy genetics-based machine learning , 2007, Int. J. Approx. Reason..

[5]  José Martínez Sotoca,et al.  An analysis of how training data complexity affects the nearest neighbor classifiers , 2007, Pattern Analysis and Applications.

[6]  F. Herrera,et al.  A proposal on reasoning methods in fuzzy rule-based classification systems , 1999 .

[7]  Robert P. W. Duin,et al.  On the nonlinearity of pattern classifiers , 1996, Proceedings of 13th International Conference on Pattern Recognition.

[8]  Hisao Ishibuchi,et al.  Fuzzy rule selection by multi-objective genetic local search algorithms and rule evaluation measures in data mining , 2004, Fuzzy Sets Syst..

[9]  John C. Platt,et al.  Fast training of support vector machines using sequential minimal optimization, advances in kernel methods , 1999 .

[10]  Kalyanmoy Deb,et al.  A fast and elitist multiobjective genetic algorithm: NSGA-II , 2002, IEEE Trans. Evol. Comput..

[11]  Hong Yan,et al.  Fuzzy Algorithms: With Applications to Image Processing and Pattern Recognition , 1996, Advances in Fuzzy Systems - Applications and Theory.

[12]  María José del Jesús,et al.  KEEL: a software tool to assess evolutionary algorithms for data mining problems , 2008, Soft Comput..

[13]  Francisco Herrera,et al.  Generating single granularity-based fuzzy classification rules for multiobjective genetic fuzzy rule selection , 2009, 2009 IEEE International Conference on Fuzzy Systems.

[14]  Tin Kam Ho,et al.  Complexity Measures of Supervised Classification Problems , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[15]  Hisao Ishibuchi,et al.  Rule weight specification in fuzzy rule-based classification systems , 2005, IEEE Transactions on Fuzzy Systems.

[16]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[17]  Tin Kam Ho,et al.  Measures of Geometrical Complexity in Classification Problems , 2006 .