CART data analysis to attain interpretability in a Fuzzy Logic Classifier

A data driven methodology to automatically derive a Fuzzy Logic Classifier (FLC) only on the basis of the raw signals available, is proposed. The first step is a feature selection performed with the approach of Classification and Regression Trees (CART), to extract the variables in the database which are the most critical for the problem under study. Then a CART is produced using only the previously selected features and is provided to a fully automated algorithm which determines the membership functions and the most appropriate rules to reproduce the classification tree obtained with CART. The resulting FLC attains good performance in terms of generalization and classification, still providing a set of rules which can be easily interpreted in order to achieve a first, intuitive understanding of the phenomenon involved. To assess the potentiality of the approach, the method has been applied to a synthetic database provided for the NIPS 2003 feature selection competition and to a real classification problem.

[1]  Michio Sugeno,et al.  Fuzzy identification of systems and its applications to modeling and control , 1985, IEEE Transactions on Systems, Man, and Cybernetics.

[2]  Ralf Mikut,et al.  Interpretability issues in data-based learning of fuzzy systems , 2005, Fuzzy Sets Syst..

[3]  James P. Egan,et al.  Signal detection theory and ROC analysis , 1975 .

[4]  Xizhao Wang,et al.  On the optimization of fuzzy decision trees , 2000, Fuzzy Sets Syst..

[5]  Louis Wehenkel,et al.  A complete fuzzy decision tree technique , 2003, Fuzzy Sets Syst..

[6]  Ajith Abraham,et al.  Feature deduction and ensemble design of intrusion detection systems , 2005, Comput. Secur..

[7]  Stephen L. Chiu,et al.  Fuzzy Model Identification Based on Cluster Estimation , 1994, J. Intell. Fuzzy Syst..

[8]  Serge Guillaume,et al.  Designing fuzzy inference systems from data: An interpretability-oriented review , 2001, IEEE Trans. Fuzzy Syst..

[9]  Zuhair Bandar,et al.  On constructing a fuzzy inference framework using crisp decision trees , 2006, Fuzzy Sets Syst..

[10]  Ferenc Szeifert,et al.  Data-driven generation of compact, accurate, and linguistically sound fuzzy classifiers based on a decision-tree initialization , 2003, Int. J. Approx. Reason..

[11]  Dimitrios I. Fotiadis,et al.  A methodology for automated fuzzy model generation , 2008, Fuzzy Sets Syst..

[12]  John Q. Gan,et al.  Low-level interpretability and high-level interpretability: a unified view of data-driven interpretable fuzzy system modelling , 2008, Fuzzy Sets Syst..

[13]  Isabelle Guyon,et al.  Design of experiments for the NIPS 2003 variable selection benchmark , 2003 .

[14]  Hidetomo Ichihashi,et al.  Neuro-fuzzy ID3: a method of inducing fuzzy decision trees with linear programming for maximizing entropy and an algebraic method for incremental learning , 1996, Fuzzy Sets Syst..

[15]  E. H. Mamdani,et al.  An Experiment in Linguistic Synthesis with a Fuzzy Logic Controller , 1999, Int. J. Man Mach. Stud..

[16]  Lotfi A. Zadeh,et al.  Fuzzy Logic Toolbox User''''s Guide , 1995 .

[17]  E. Chong,et al.  Fuzzy modeling by induction and pruning of decision trees , 1994, Proceedings of 1994 9th IEEE International Symposium on Intelligent Control.

[18]  William Nick Street,et al.  Breast Cancer Diagnosis and Prognosis Via Linear Programming , 1995, Oper. Res..

[19]  José F. Moreno,et al.  CART-based feature selection of hyperspectral images for crop cover classification , 2003, Proceedings 2003 International Conference on Image Processing (Cat. No.03CH37429).

[20]  Alexander E. Gegov,et al.  Complexity Management in Fuzzy Systems - A Rule Base Compression Approach , 2007, Studies in Fuzziness and Soft Computing.

[21]  Wei-Yin Loh,et al.  Classification and regression trees , 2011, WIREs Data Mining Knowl. Discov..

[22]  Kim-Fung Man,et al.  Agent-based evolutionary approach for interpretable rule-based knowledge extraction , 2005, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[23]  José Valente de Oliveira,et al.  Semantic constraints for membership function optimization , 1999, IEEE Trans. Syst. Man Cybern. Part A.

[24]  John Q. Gan,et al.  An Unsupervised Kernel Based Fuzzy C-Means Clustering Algorithm With Kernel Normalisation , 2004, Int. J. Comput. Intell. Appl..

[25]  Steve R. Gunn,et al.  Result Analysis of the NIPS 2003 Feature Selection Challenge , 2004, NIPS.