Enhancement of the classification and reconstruction performance of fuzzy C-means with refinements of prototypes

Abstract Owning to their abilities to reveal structural relationships in data, fuzzy clustering plays a pivotal role in fuzzy modeling, pattern recognition, and data analysis. As supporting an unsupervised mode of learning, fuzzy clustering, brings about unique opportunities to build a structural backbone of numerous constructs in the areas identified above. A follow-up phase is required when the structural findings developed in the form of fuzzy clusters need some refinements, usually when the clustering results are used afterwards in the model being developed in a supervised mode. Following this general line of thought, in this study we propose a novel approach to optimize the clustering and classification performance of the Fuzzy C-Means (FCM) algorithm. Proceeding with a collection of clusters (information granules) produced by the FCM, we carry out the refinements of the results (in order to improve the representation or classification capabilities of fuzzy clusters) by adjusting a location of the prototypes so that a certain performance index becomes optimized. At this phase, the optimization is carried out in a supervised mode with the aid of Differential Evolution (DE). We propose five different strategies to adjust a location of the prototypes. Experimental studies completed on synthetic data and publicly available real-world data quantify the improvement of the representation and classification abilities of the clustering method.

[1]  Jesús Alcalá-Fdez,et al.  KEEL Data-Mining Software Tool: Data Set Repository, Integration of Algorithms and Experimental Analysis Framework , 2011, J. Multiple Valued Log. Soft Comput..

[2]  Magne Setnes,et al.  Fuzzy relational classifier trained by fuzzy clustering , 1999, IEEE Trans. Syst. Man Cybern. Part B.

[3]  Witold Pedrycz,et al.  Anomaly detection in time series data using a fuzzy c-means clustering , 2013, 2013 Joint IFSA World Congress and NAFIPS Annual Meeting (IFSA/NAFIPS).

[4]  Ujjwal Maulik,et al.  A study of some fuzzy cluster validity indices, genetic clustering and application to pixel classification , 2005, Fuzzy Sets Syst..

[5]  R. Storn,et al.  Differential Evolution: A Practical Approach to Global Optimization (Natural Computing Series) , 2005 .

[6]  P. Holland,et al.  Robust regression using iteratively reweighted least-squares , 1977 .

[7]  R. Fisher THE USE OF MULTIPLE MEASUREMENTS IN TAXONOMIC PROBLEMS , 1936 .

[8]  Witold Pedrycz,et al.  A Development of Fuzzy Encoding and Decoding Through Fuzzy Clustering , 2008, IEEE Transactions on Instrumentation and Measurement.

[9]  Andries P. Engelbrecht,et al.  Computational Intelligence: An Introduction , 2002 .

[10]  Piotr A. Kowalski,et al.  Complete Gradient Clustering Algorithm for Features Analysis of X-Ray Images , 2010 .

[11]  Ujjwal Maulik,et al.  Validity index for crisp and fuzzy clusters , 2004, Pattern Recognit..

[12]  Witold Pedrycz,et al.  Kernel-based fuzzy clustering and fuzzy clustering: A comparative experimental study , 2010, Fuzzy Sets Syst..

[13]  Enrico Zio,et al.  Evolutionary fuzzy clustering for the Classification of transients in nuclear components , 2005 .

[14]  J. C. Dunn,et al.  A Fuzzy Relative of the ISODATA Process and Its Use in Detecting Compact Well-Separated Clusters , 1973 .

[15]  Witold Pedrycz,et al.  Fuzzy Clustering With Partial Supervision in Organization and Classification of Digital Images , 2008, IEEE Transactions on Fuzzy Systems.

[16]  Andrzej Bargiela,et al.  Fuzzy clustering with semantically distinct families of variables: Descriptive and predictive aspects , 2010, Pattern Recognit. Lett..

[17]  James M. Keller,et al.  A possibilistic approach to clustering , 1993, IEEE Trans. Fuzzy Syst..

[18]  Witold Pedrycz,et al.  Fuzzy relational structures: Learning alternatives for fuzzy modeling , 2013, 2013 Joint IFSA World Congress and NAFIPS Annual Meeting (IFSA/NAFIPS).

[19]  James C. Bezdek,et al.  Pattern Recognition with Fuzzy Objective Function Algorithms , 1981, Advanced Applications in Pattern Recognition.

[20]  David G. Stork,et al.  Pattern Classification , 1973 .

[21]  J. Bezdek,et al.  FCM: The fuzzy c-means clustering algorithm , 1984 .

[22]  Xing Xu,et al.  Comparison between Particle Swarm Optimization, Differential Evolution and Multi-Parents Crossover , 2007 .

[23]  Hidetomo Ichihashi,et al.  Fuzzy clustering for categorical multivariate data , 2001, Proceedings Joint 9th IFSA World Congress and 20th NAFIPS International Conference (Cat. No. 01TH8569).

[24]  Ujjwal Maulik,et al.  Modified differential evolution based fuzzy clustering for pixel classification in remote sensing imagery , 2009, Pattern Recognit..

[25]  Ujjwal Maulik,et al.  Fuzzy partitioning using a real-coded variable-length genetic algorithm for pixel classification , 2003, IEEE Trans. Geosci. Remote. Sens..

[26]  Raghu Krishnapuram,et al.  Fuzzy co-clustering of documents and keywords , 2003, The 12th IEEE International Conference on Fuzzy Systems, 2003. FUZZ '03..

[27]  Gerardo Beni,et al.  A Validity Measure for Fuzzy Clustering , 1991, IEEE Trans. Pattern Anal. Mach. Intell..

[28]  Carlos A. Reyes García,et al.  Analysis of an Infant Cry Recognizer for the Early Identification of Pathologies , 2004, Summer School on Neural Networks.

[29]  I. Codreanu A parallel between differential evolution and genetic algorithms with exemplification in a microfluidics optimization problem , 2005, CAS 2005 Proceedings. 2005 International Semiconductor Conference, 2005..

[30]  Jonathan M. Garibaldi,et al.  Improving semi-supervised fuzzy c-means classification of Breast Cancer data using feature selection , 2013, 2013 IEEE International Conference on Fuzzy Systems (FUZZ-IEEE).

[31]  L Podofillini,et al.  Identification and classification of dynamic event tree scenarios via possibilistic clustering: application to a steam generator tube rupture event. , 2009, Accident; analysis and prevention.

[32]  Witold Pedrycz,et al.  Anomaly Detection and Characterization in Spatial Time Series Data: A Cluster-Centric Approach , 2014, IEEE Transactions on Fuzzy Systems.

[33]  Rainer Storn,et al.  Differential Evolution – A Simple and Efficient Heuristic for global Optimization over Continuous Spaces , 1997, J. Glob. Optim..

[34]  Witold Pedrycz,et al.  Clustering Spatiotemporal Data: An Augmented Fuzzy C-Means , 2013, IEEE Transactions on Fuzzy Systems.

[35]  Hidetomo Ichihashi,et al.  Postsupervised Hard c-Means Classifier , 2006, RSCTC.

[36]  Matteo Rosa Sentinella,et al.  Comparison and integrated use of differential evolution and genetic algorithms for space trajectory optimisation , 2007, 2007 IEEE Congress on Evolutionary Computation.

[37]  Paul Compton,et al.  Inductive knowledge acquisition: a case study , 1987 .

[38]  Witold Pedrycz,et al.  Fuzzy Systems Engineering - Toward Human-Centric Computing , 2007 .

[39]  J. Bezdek Cluster Validity with Fuzzy Sets , 1973 .

[40]  Yishu Zhai,et al.  Multiscale edge detection based on fuzzy c-means clustering , 2006, 2006 1st International Symposium on Systems and Control in Aerospace and Astronautics.

[41]  Enrico Zio,et al.  Bagged Ensemble of Fuzzy C Means Classifiers for Nuclear Transient Identification , 2011 .

[42]  G. Klir,et al.  Evolutionary fuzzy c-means clustering algorithm , 1995, Proceedings of 1995 IEEE International Conference on Fuzzy Systems..

[43]  Edward Tunstel,et al.  An Introduction to Evolutionary Computation , 2001 .

[44]  Carlos A. Reyes García,et al.  Infant Cry Classification to Identify Hypo Acoustics and Asphyxia Comparing an Evolutionary-Neural System with a Neural Network System , 2005, MICAI.

[45]  Hidetomo Ichihashi,et al.  Fuzzy c-Means Classifier for Incomplete Data Sets with Outliers and Missing Values , 2005, International Conference on Computational Intelligence for Modelling, Control and Automation and International Conference on Intelligent Agents, Web Technologies and Internet Commerce (CIMCA-IAWTIC'06).

[46]  E. Zio,et al.  Identification of nuclear transients via optimized fuzzy clustering , 2005 .