Comparison of fuzzy clustering algorithms for classification

The identification of fuzzy models for classification is a very complex task. Often, real world databases have a large number of features and the most relevant ones must be chosen. Recently, a new automatic feature selection for classification problems was proposed to construct compact fuzzy classification models. This technique used the classical fuzzy c-means algorithm. However, other fuzzy clustering algorithms, such as possibilistic c-means, fuzzy possibilistic c-means or possibilistic fuzzy c-means can be used to cluster the data. An open topic of research is what clustering algorithms can be used to derive fuzzy models for classification. This paper addresses this topic, by comparing fuzzy clustering algorithms in terms of computational efficiency and accuracy in classification problems. The algorithms were tested in well-known data sets: iris plant, wine, hepatitis, breast cancer and in a difficult real-world problem: the prediction of bankruptcy

[1]  Derek A. Linkens,et al.  Input selection and partition validation for fuzzy modelling using neural network , 1999, Fuzzy Sets Syst..

[2]  Roderick J. A. Little,et al.  Statistical Analysis with Missing Data: Little/Statistical Analysis with Missing Data , 2002 .

[3]  Heiko Timm,et al.  Differentiated Treatment of Missing Values in Fuzzy Clustering , 2003, IFSA.

[4]  Robert Babuska,et al.  Input selection for nonlinear regression models , 2004, IEEE Transactions on Fuzzy Systems.

[5]  James C. Bezdek,et al.  Pattern Recognition with Fuzzy Objective Function Algorithms , 1981, Advanced Applications in Pattern Recognition.

[6]  James M. Keller,et al.  A possibilistic approach to clustering , 1993, IEEE Trans. Fuzzy Syst..

[7]  James M. Keller,et al.  The possibilistic C-means algorithm: insights and recommendations , 1996, IEEE Trans. Fuzzy Syst..

[8]  João Miguel da Costa Sousa,et al.  Modified Regularity Criterion in Dynamic Fuzzy Modeling Applied to Industrial Processes , 2005, The 14th IEEE International Conference on Fuzzy Systems, 2005. FUZZ '05..

[9]  James M. Keller,et al.  A possibilistic fuzzy c-means clustering algorithm , 2005, IEEE Transactions on Fuzzy Systems.

[10]  Thomas A. Runkler Ant colony optimization of clustering models , 2005, Int. J. Intell. Syst..

[11]  R. Nogueira,et al.  The prediction of bankruptcy using fuzzy classifiers , 2005, 2005 ICSC Congress on Computational Intelligence Methods and Applications.

[12]  T. Schneider Analysis of Incomplete Climate Data: Estimation of Mean Values and Covariance Matrices and Imputation of Missing Values. , 2001 .

[13]  Petra Perner,et al.  Data Mining - Concepts and Techniques , 2002, Künstliche Intell..

[14]  James M. Keller,et al.  Will the real iris data please stand up? , 1999, IEEE Trans. Fuzzy Syst..

[15]  Michio Sugeno,et al.  A fuzzy-logic-based approach to qualitative modeling , 1993, IEEE Trans. Fuzzy Syst..

[16]  Magne Setnes,et al.  GA-fuzzy modeling and classification: complexity and performance , 2000, IEEE Trans. Fuzzy Syst..

[17]  James C. Bezdek,et al.  Fuzzy c-means clustering of incomplete data , 2001, IEEE Trans. Syst. Man Cybern. Part B.

[18]  William Nick Street,et al.  Breast Cancer Diagnosis and Prognosis Via Linear Programming , 1995, Oper. Res..

[19]  James C. Bezdek,et al.  A mixed c-means clustering model , 1997, Proceedings of 6th International Fuzzy Systems Conference.

[20]  Magne Setnes,et al.  Fuzzy modeling of client preference from large data sets: an application to target selection in direct marketing , 2001, IEEE Trans. Fuzzy Syst..

[21]  Shigeo Abe,et al.  Function approximation based on fuzzy rules extracted from partitioned numerical data , 1999, IEEE Trans. Syst. Man Cybern. Part B.

[22]  David E. Booth,et al.  Analysis of Incomplete Multivariate Data , 2000, Technometrics.

[23]  Uzay Kaymak,et al.  Fuzzy Decision Making in Modeling and Control , 2002, World Scientific Series in Robotics and Intelligent Systems.

[24]  Michio Sugeno,et al.  Fuzzy identification of systems and its applications to modeling and control , 1985, IEEE Transactions on Systems, Man, and Cybernetics.