Automatic design of interpretable fuzzy predicate systems for clustering using self-organizing maps

Abstract In the area of pattern recognition, clustering algorithms are a family of unsupervised classifiers designed with the aim to discover unrevealed structures in the data. While this is a never ending research topic, many methods have been developed with good theoretical and practical properties. One of such methods is based on self organizing maps (SOM), which have been successfully used for data clustering, using a two levels clustering approach. Newer on the field, clustering systems based on fuzzy logic improve the performance of traditional approaches. In this paper we combine both approaches. Most of the previous works on fuzzy clustering are based on fuzzy inference systems, but we propose the design of a new clustering system in which we use predicate fuzzy logic to perform the clustering task, being automatically designed based on data. Given a datum, degrees of truth of fuzzy predicates associated with each cluster are computed using continuous membership functions defined over data features. The predicate with the maximum degree of truth determines the cluster to be assigned. Knowledge is discovered from data, obtained using the SOM generalization aptitude and taking advantage of the well-known SOM abilities to discover natural data grouping when compared with direct clustering. In addition, the proposed approach adds linguistic interpretability when membership functions are analyzed by a field expert. We also present how this approach can be used to deal with partitioned data. Results show that clustering accuracy obtained is high and it outperforms other methods in the majority of datasets tested.

[1]  Lotfi A. Zadeh,et al.  The concept of a linguistic variable and its application to approximate reasoning-III , 1975, Inf. Sci..

[2]  T. Kohonen Self-organized formation of topology correct feature maps , 1982 .

[3]  I. Burhan Türksen,et al.  Enhanced Fuzzy System Models With Improved Fuzzy Clustering Algorithm , 2008, IEEE Transactions on Fuzzy Systems.

[4]  Wei-Shen Tai,et al.  Apply extended self-organizing map to cluster and classify mixed-type data , 2011, Neurocomputing.

[5]  Lotfi A. Zadeh,et al.  The Concepts of a Linguistic Variable and its Application to Approximate Reasoning , 1975 .

[6]  Michalis Vazirgiannis,et al.  On Clustering Validation Techniques , 2001, Journal of Intelligent Information Systems.

[7]  Zhaohong Deng,et al.  Knowledge-Leverage-Based Fuzzy System and Its Modeling , 2013, IEEE Transactions on Fuzzy Systems.

[8]  Virginia L. Ballarin,et al.  A framework for tissue discrimination in Magnetic Resonance brain images based on predicates analysis and Compensatory Fuzzy Logic , 2008 .

[9]  Erkki Oja,et al.  Class distributions on SOM surfaces for feature extraction and object retrieval , 2004, Neural Networks.

[10]  Alan C. Evans,et al.  MRI Simulation Based Evaluation and Classifications Methods , 1999, IEEE Trans. Medical Imaging.

[11]  Okyay Kaynak,et al.  Fuzzy modeling based on generalized conjunction operations , 2002, IEEE Trans. Fuzzy Syst..

[12]  Gustavo J. Meschino,et al.  Using SOM as a Tool for Automated Design of Clustering Systems Based on Fuzzy Predicates , 2012, WSOM.

[13]  Fionn Murtagh,et al.  Interpreting the Kohonen self-organizing feature map using contiguity-constrained clustering , 1995, Pattern Recognit. Lett..

[14]  T. Kohonen Self-organized formation of topographically correct feature maps , 1982 .

[15]  Shigeo Abe,et al.  Function approximation based on fuzzy rules extracted from partitioned numerical data , 1999, IEEE Trans. Syst. Man Cybern. Part B.

[16]  M. Hanmandlu,et al.  The brain MR Image segmentation techniques and use of diagnostic packages. , 2010, Academic radiology.

[17]  Chia-Feng Juang,et al.  A Self-Organizing TS-Type Fuzzy Network With Support Vector Learning and its Application to Classification Problems , 2007, IEEE Transactions on Fuzzy Systems.

[18]  Jouko Lampinen,et al.  Clustering properties of hierarchical self-organizing maps , 1992, Journal of Mathematical Imaging and Vision.

[19]  Feng Zhao,et al.  Fuzzy clustering algorithms with self-tuning non-local spatial information for image segmentation , 2013, Neurocomputing.

[20]  Stéphane Canu,et al.  Kernel functions in Takagi-Sugeno-Kang fuzzy system with nonsingleton fuzzy input , 2013, 2013 IEEE International Conference on Fuzzy Systems (FUZZ-IEEE).

[21]  Jorge Marx Gómez,et al.  COMPENSATORY LOGIC: A FUZZY NORMATIVE MODEL FOR DECISION MAKING , 2003 .

[22]  Marie Cottrell,et al.  Missing values : processing with the Kohonen algorithm , 2007, ArXiv.

[23]  Christopher M. Bishop,et al.  Pattern Recognition and Machine Learning (Information Science and Statistics) , 2006 .

[24]  Hinrich Schütze,et al.  Introduction to information retrieval , 2008 .

[25]  Witold Pedrycz,et al.  Fuzzy clustering with a knowledge-based guidance , 2004, Pattern Recognit. Lett..

[26]  Tomas Eklund,et al.  Fuzzy Clustering of the Self-Organizing Map: Some Applications on Financial Time Series , 2011, WSOM.

[27]  Tomas Eklund,et al.  Financial performance analysis of European banks using a fuzzified Self-Organizing Map , 2013, Int. J. Knowl. Based Intell. Eng. Syst..

[28]  Frédéric Alexandre,et al.  Self-organizing Map Initialization , 2005, ICANN.

[29]  Vladimir Estivill-Castro,et al.  Why so many clustering algorithms: a position paper , 2002, SKDD.

[30]  Didier Dubois Fuzzy sets and systems , 1980 .

[31]  Esa Alhoniemi,et al.  Clustering of the self-organizing map , 2000, IEEE Trans. Neural Networks Learn. Syst..

[32]  Melody Y. Kiang,et al.  Extending the Kohonen self-organizing map networks for clustering analysis , 2002 .

[33]  A. Evans,et al.  MRI simulation-based evaluation of image-processing and classification methods , 1999, IEEE Transactions on Medical Imaging.

[34]  Robert LIN,et al.  NOTE ON FUZZY SETS , 2014 .

[35]  Virginia L. Ballarin,et al.  Arithmetic Mean Based Compensatory Fuzzy Logic , 2011, Int. J. Comput. Intell. Appl..

[36]  E. G. Mansoori,et al.  Fuzzy Rule Based Clustering for Gene Expression Data , 2013, 2013 4th International Conference on Intelligent Systems, Modelling and Simulation.

[37]  Georg Pölzlbauer Survey and Comparison of Quality Measures for Self-Organizing Maps , 2004 .

[38]  Eghbal G. Mansoori,et al.  FRBC: A Fuzzy Rule-Based Clustering Algorithm , 2011, IEEE Transactions on Fuzzy Systems.

[39]  N GhazaviSean,et al.  Medical data mining by fuzzy modeling with selected features , 2008 .

[40]  Peter Sarlin Exploiting the self-organizing financial stability map , 2013, Eng. Appl. Artif. Intell..

[41]  Ashfaqur Rahman,et al.  Cluster-Oriented Ensemble Classifier: Impact of Multicluster Characterization on Ensemble Classifier Learning , 2012, IEEE Transactions on Knowledge and Data Engineering.

[42]  Klaus Pawelzik,et al.  Quantifying the neighborhood preservation of self-organizing feature maps , 1992, IEEE Trans. Neural Networks.

[43]  Enrique H. Ruspini,et al.  A New Approach to Clustering , 1969, Inf. Control..

[44]  Juha Vesanto,et al.  Data exploration process based on the self-organizing map , 2002 .

[45]  Paulo Cortez,et al.  Modeling wine preferences by data mining from physicochemical properties , 2009, Decis. Support Syst..

[46]  Novruz Allahverdi,et al.  Rule extraction from trained adaptive neural networks using artificial immune systems , 2009, Expert Syst. Appl..