Feature Subset Selection for Fuzzy Classification Methods

The automatic generation of fuzzy systems have been widely investigated with several proposed approaches in the literature. Since for most methods the generation process complexity increases exponentially with the number of features, a previous feature selection can highly improve the process. Filters, wrappers and embedded methods are used for feature selection. For fuzzy systems it would be desirable to take the fuzzy granulation of the features domains into account for the feature selection process. In this paper a fuzzy wrapper, previously proposed by the authors, and a fuzzy C4.5 decision tree are used to select features. They are compared with three classic filters and the features selected by the original C4.5 decision tree algorithm, as an embedded method. Results using 10 datasets indicate that the use of the fuzzy granulation of features domains is an advantage to select features for the purpose of inducing fuzzy rule bases.

[1]  Alberto Maria Segre,et al.  Programs for Machine Learning , 1994 .

[2]  Lotfi A. Zadeh,et al.  Fuzzy Sets , 1996, Inf. Control..

[3]  J. Ross Quinlan,et al.  Bagging, Boosting, and C4.5 , 1996, AAAI/IAAI, Vol. 1.

[4]  Masao Mukaidono,et al.  A fuzzy neural network for pattern classification and feature selection , 2002, Fuzzy Sets Syst..

[5]  J. Ross Quinlan,et al.  C4.5: Programs for Machine Learning , 1992 .

[6]  Ian H. Witten,et al.  Data mining: practical machine learning tools and techniques with Java implementations , 2002, SGMD.

[7]  Li-Xin Wang,et al.  The WM method completed: a flexible fuzzy system approach to data mining , 2003, IEEE Trans. Fuzzy Syst..

[8]  Huan Liu,et al.  Efficient Feature Selection via Analysis of Relevance and Redundancy , 2004, J. Mach. Learn. Res..

[9]  Louis Wehenkel,et al.  A complete fuzzy decision tree technique , 2003, Fuzzy Sets Syst..

[10]  Ian H. Witten,et al.  Data mining: practical machine learning tools and techniques, 3rd Edition , 1999 .

[11]  Janez Demsar,et al.  Statistical Comparisons of Classifiers over Multiple Data Sets , 2006, J. Mach. Learn. Res..

[12]  José M. Alonso,et al.  An Interpretability-Guided Modeling Process for Learning Comprehensible Fuzzy Rule-Based Classifiers , 2009, 2009 Ninth International Conference on Intelligent Systems Design and Applications.

[13]  Inés Couso,et al.  Some Results about Mutual Information-based Feature Selection and Fuzzy Discretization of Vague Data , 2007, 2007 IEEE International Fuzzy Systems Conference.

[14]  Mark A. Hall,et al.  Correlation-based Feature Selection for Discrete and Numeric Class Machine Learning , 1999, ICML.

[15]  Ian Witten,et al.  Data Mining , 2000 .

[16]  Trevor P. Martin,et al.  Feature Subset Selection Using a Fuzzy Method , 2009, 2009 International Conference on Intelligent Human-Machine Systems and Cybernetics.

[17]  Larry A. Rendell,et al.  A Practical Approach to Feature Selection , 1992, ML.

[18]  Huan Liu,et al.  A Probabilistic Approach to Feature Selection - A Filter Solution , 1996, ICML.

[19]  Byung Ro Moon,et al.  Feature Selection in Genetic Fuzzy Discretization for the Pattern Classification Problems , 2007, IEICE Trans. Inf. Syst..

[20]  Heloisa A. Camargo,et al.  Optimising the Fuzzy Granulation of Attribute Domains , 2009, IFSA/EUSFLAT Conf..

[21]  Francisco Herrera,et al.  Ten years of genetic fuzzy systems: current framework and new trends , 2004, Fuzzy Sets Syst..

[22]  Heloisa A. Camargo,et al.  Fuzzy Feature Subset Selection Using the Wang & Mendel Method , 2008, 2008 Eighth International Conference on Hybrid Intelligent Systems.

[23]  Inés Couso,et al.  Mutual information-based feature selection and partition design in fuzzy rule-based classifiers from vague data , 2008, Int. J. Approx. Reason..

[24]  Inés Couso,et al.  Genetic Feature Selection for Fuzzy Discretized Data , 2008 .

[25]  L. Jain,et al.  Fuzzy sets and their application to clustering and training , 2000 .

[26]  Sanmay Das,et al.  Filters, Wrappers and a Boosting-Based Hybrid for Feature Selection , 2001, ICML.