A Fuzzy Clustering Model for Fuzzy Data with Outliers

This paper proposes a fuzzy clustering model for fuzzy data with outliers. The model is based on Wasserstein distance between interval valued data, which is generalized to fuzzy data. In addition, Keller's approach is used to identify outliers and reduce their influences. The authors also define a transformation to change the distance to the Euclidean distance. With the help of this approach, the problem of fuzzy clustering of fuzzy data is reduced to fuzzy clustering of crisp data. In order to show the performance of the proposed clustering algorithm, two simulation experiments are discussed.

[1]  Haiyoung Lee A Cluster validity Index for Fuzzy Clustering , 1999 .

[2]  Yun Kyong Kim,et al.  Some properties of a new metric on the space of fuzzy numbers , 2004, Fuzzy Sets Syst..

[3]  Soon-H. Kwon Cluster validity index for fuzzy clustering , 1998 .

[4]  N. Nagaveni,et al.  An Ontology Based Model for Document Clustering , 2011, Int. J. Intell. Inf. Technol..

[5]  Mohsen Rostamy-Malkhalifeh,et al.  A Multi-Criteria Intuitionistic Fuzzy Group Decision Making Method for Supplier Selection with VIKOR Method , 2012, Int. J. Fuzzy Syst. Appl..

[6]  Carlo Bertoluzza,et al.  On a new class of distances between fuzzy numbers , 1995 .

[7]  Ravi Bhushan Mishra,et al.  Multi-Agent Negotiation in B2C E-Commerce Based on Data Mining Methods , 2010, Int. J. Intell. Inf. Technol..

[8]  Janusz Kacprzyk,et al.  Distances between intuitionistic fuzzy sets , 2000, Fuzzy Sets Syst..

[9]  Alison L Gibbs,et al.  On Choosing and Bounding Probability Metrics , 2002, math/0209021.

[10]  Rami Zwick,et al.  Measures of similarity among fuzzy concepts: A comparative analysis , 1987, Int. J. Approx. Reason..

[11]  Pierpaolo D'Urso,et al.  A robust fuzzy k-means clustering model for interval valued data , 2006, Comput. Stat..

[12]  Azriel Rosenfeld,et al.  Fuzzy Digital Topology , 1979, Inf. Control..

[13]  Jesse Hoey,et al.  Decision Theory Models for Applications in Artificial Intelligence: Concepts and Solutions , 2011 .

[14]  Miin-Shen Yang,et al.  Fuzzy clustering algorithms for mixed feature variables , 2004, Fuzzy Sets Syst..

[15]  Miin-Shen Yang,et al.  On a similarity measure between LR‐type fuzzy numbers and its application to database acquisition , 2005, Int. J. Intell. Syst..

[16]  Mohamed A. Ismail,et al.  Fuzzy clustering for symbolic data , 1998, IEEE Trans. Fuzzy Syst..

[17]  Peter M. A. Sloot,et al.  Complex Systems Modeling by Cellular Automata , 2009, Encyclopedia of Artificial Intelligence.

[18]  Chengyi Zhang,et al.  Similarity measures on three kinds of fuzzy sets , 2006, Pattern Recognit. Lett..

[19]  Antonio Irpino,et al.  Dynamic clustering of interval data using a Wasserstein-based distance , 2008, Pattern Recognit. Lett..

[20]  Isabelle Bloch,et al.  On fuzzy distances and their use in image processing under imprecision , 1999, Pattern Recognit..

[21]  J. Bezdek Numerical taxonomy with fuzzy sets , 1974 .

[22]  Witold Pedrycz,et al.  Two nonparametric models for fusing heterogeneous fuzzy data , 1998, IEEE Trans. Fuzzy Syst..

[23]  V. Sugumaran The Inaugural Issue of the International Journal of Intelligent Information Technologies , 2005 .

[24]  K. Chidananda Gowda,et al.  Symbolic clustering using a new similarity measure , 1992, IEEE Trans. Syst. Man Cybern..

[25]  Lucien Duckstein,et al.  Comparison of fuzzy numbers using a fuzzy distance measure , 2002, Fuzzy Sets Syst..

[26]  Mike Bennett,et al.  Meaning Makers: User Generated Ambient Presence , 2009, Int. J. Ambient Comput. Intell..

[27]  Pierpaolo D'Urso,et al.  A weighted fuzzy c , 2006, Comput. Stat. Data Anal..

[28]  James C. Bezdek,et al.  Pattern Recognition with Fuzzy Objective Function Algorithms , 1981, Advanced Applications in Pattern Recognition.

[29]  Edwin Diday,et al.  Symbolic clustering using a new dissimilarity measure , 1991, Pattern Recognit..

[30]  James M. Keller,et al.  Analysis and efficient implementation of a linguistic fuzzy c-means , 2002, IEEE Trans. Fuzzy Syst..

[31]  Miin-Shen Yang,et al.  On a class of fuzzy c-numbers clustering procedures for fuzzy data , 1996, Fuzzy Sets Syst..

[32]  A. Keller Fuzzy clustering with outliers , 2000, PeachFuzz 2000. 19th International Conference of the North American Fuzzy Information Processing Society - NAFIPS (Cat. No.00TH8500).

[33]  Nikolai Dahlem,et al.  OntoClippy: A User-Friendly Ontology Design and Creation Methodology , 2011, Int. J. Intell. Inf. Technol..

[34]  W. Näther On random fuzzy variables of second order and their application to linear statistical inference with fuzzy data , 2000 .

[35]  Kevin Curran,et al.  Pervasive and Ubiquitous Technology Innovations for Ambient Intelligence Environments , 2012 .

[36]  P. Kloeden,et al.  Metric Spaces Of Fuzzy Sets Theory And Applications , 1975 .

[37]  J. Bezdek Cluster Validity with Fuzzy Sets , 1973 .

[38]  Witold Pedrycz,et al.  Advances in Fuzzy Clustering and its Applications , 2007 .

[39]  Vijayan Sugumaran Intelligent Information Technologies: Concepts, Methodologies, Tools and Applications , 2007 .

[40]  Hans-Jürgen Zimmermann,et al.  Fuzzy Set Theory - and Its Applications , 1985 .

[41]  C. Pappis,et al.  A comparative assessment of measures of similarity of fuzzy values , 1993 .

[42]  Ferenc Szeifert,et al.  Data-driven generation of compact, accurate, and linguistically sound fuzzy classifiers based on a decision-tree initialization , 2003, Int. J. Approx. Reason..

[43]  Przemyslaw Grzegorzewski,et al.  Distances between intuitionistic fuzzy sets and/or interval-valued fuzzy sets based on the Hausdorff metric , 2004, Fuzzy Sets Syst..

[44]  Qi Liu,et al.  A new similarity measure of generalized fuzzy numbers and its application to pattern recognition , 2004, Pattern Recognit. Lett..

[45]  Yi Li,et al.  A cluster validity index for fuzzy clustering , 2008, Inf. Sci..

[46]  Witold Pedrycz,et al.  A parametric model for fusing heterogeneous fuzzy data , 1996, IEEE Trans. Fuzzy Syst..

[47]  James C. Bezdek,et al.  On cluster validity for the fuzzy c-means model , 1995, IEEE Trans. Fuzzy Syst..

[48]  Miin-Shen Yang,et al.  Fuzzy clustering procedures for conical fuzzy vector data , 1999, Fuzzy Sets Syst..

[49]  Gerardo Beni,et al.  A Validity Measure for Fuzzy Clustering , 1991, IEEE Trans. Pattern Anal. Mach. Intell..

[50]  Vijayan Sugumaran Organizational Efficiency through Intelligent Information Technologies , 2012 .

[51]  Marcelo H. Ang,et al.  Mobile Robots Navigation, Mapping, and Localization Part I , 2009, Encyclopedia of Artificial Intelligence.

[52]  Alejandro Pazos Sierra,et al.  Encyclopedia of Artificial Intelligence , 2008 .

[53]  M. Sato,et al.  Fuzzy clustering model for fuzzy data , 1995, Proceedings of 1995 IEEE International Conference on Fuzzy Systems..

[54]  Pavel Berkhin,et al.  A Survey of Clustering Data Mining Techniques , 2006, Grouping Multidimensional Data.

[55]  David L. Olson,et al.  Similarity measures between intuitionistic fuzzy (vague) sets: A comparative analysis , 2007, Pattern Recognit. Lett..

[56]  Epaminondas Panas,et al.  Functional Form, Elasticity and Lexical Richness: Estimates and Implications , 2012 .

[57]  Miin-Shen Yang,et al.  Fuzzy clustering on LR-type fuzzy numbers with an application in Taiwanese tea evaluation , 2005, Fuzzy Sets Syst..

[58]  Carlos A. Iglesias,et al.  The Agent-Oriented Methodology MAS-CommonKADS , 2005 .