Fuzzy Double Clustering: A Robust Proposal

In this paper a robust fuzzy methodology for simultaneously clustering objects and variables is proposed. Starting from Double k-Means, different fuzzy generalizations for categorical multivariate data have been proposed in literature which are not appropriate for heterogeneous two-mode datasets, especially if outliers occur. In practice, in these cases, the existing fuzzy procedures do not recognize them. In order to overcome that inconvenience and to take into account a certain amount of outlying observations a new fuzzy approach with noise clusters for the objects and variables is introduced and discussed.

[1]  Sadaaki Miyamoto,et al.  Fuzzy c-means as a regularization and maximum entropy approach , 1997 .

[2]  J. MacQueen Some methods for classification and analysis of multivariate observations , 1967 .

[3]  Raghu Krishnapuram,et al.  Fuzzy co-clustering of documents and keywords , 2003, The 12th IEEE International Conference on Fuzzy Systems, 2003. FUZZ '03..

[4]  William-Chandra Tjhi,et al.  Fuzzy co-clustering of Web documents , 2005, 2005 International Conference on Cyberworlds (CW'05).

[5]  Hidetomo Ichihashi,et al.  Fuzzy clustering for categorical multivariate data , 2001, Proceedings Joint 9th IFSA World Congress and 20th NAFIPS International Conference (Cat. No. 01TH8569).

[6]  Rajesh N. Davé,et al.  Characterization and detection of noise in clustering , 1991, Pattern Recognit. Lett..

[7]  William-Chandra Tjhi,et al.  Dual Fuzzy-Possibilistic Coclustering for Categorization of Documents , 2009, IEEE Transactions on Fuzzy Systems.

[8]  Vichi Maurizio Double k-means Clustering for Simultaneous Classification of Objects and Variables , 2001 .

[9]  J. Bezdek Cluster Validity with Fuzzy Sets , 1973 .