Interval-valued fuzzy set approach to fuzzy co-clustering for data classification

Data clustering is aimed at discovering a structure in data. The revealed structure is usually represented in terms of prototypes and partition matrices. In some cases, the prototypes are simultaneously formed using data and features by running a co-clustering (bi-clustering) algorithm. Interval valued fuzzy clustering exhibits advantages when handling uncertainty. This study introduces a novel clustering technique by combining fuzzy co-clustering approach and interval-valued fuzzy sets in which two values of the fuzzifier of the fuzzy clustering algorithm are used to form the footprint of uncertainty (FOU). The study demonstrates the performance of the proposed method through a series of experiments completed for various datasets (including color segmentation, multi-spectral image classification, and document categorization). The experiments quantify the quality of results with the aid of validity indices and visual inspection. Some comparative analysis is also covered.

[1]  Madasu Hanmandlu,et al.  A non-extensive entropy feature and its application to texture classification , 2013, Neurocomputing.

[2]  Yang Yan,et al.  Fuzzy semi-supervised co-clustering for text documents , 2013, Fuzzy Sets Syst..

[3]  Witold Pedrycz,et al.  Towards hybrid clustering approach to data classification: Multiple kernels based interval-valued Fuzzy C-Means algorithms , 2015, Fuzzy Sets Syst..

[4]  Witold Pedrycz,et al.  Interval Type-2 fuzzy C-Means approach to collaborative clustering , 2015, 2015 IEEE International Conference on Fuzzy Systems (FUZZ-IEEE).

[5]  J. Bezdek,et al.  FCM: The fuzzy c-means clustering algorithm , 1984 .

[6]  Hamido Fujita,et al.  The synergistic combination of particle swarm optimization and fuzzy sets to design granular classifier , 2015, Knowl. Based Syst..

[7]  Hamido Fujita,et al.  Hierarchical cluster ensemble model based on knowledge granulation , 2016, Knowl. Based Syst..

[8]  J. Bezdek Cluster Validity with Fuzzy Sets , 1973 .

[9]  William-Chandra Tjhi,et al.  Possibilistic fuzzy co-clustering of large document collections , 2007, Pattern Recognit..

[10]  Yejun Xu,et al.  Weak transitivity of interval-valued fuzzy relations , 2014, Knowl. Based Syst..

[11]  Pierpaolo D'Urso,et al.  Fuzzy c-ordered medoids clustering for interval-valued data , 2016, Pattern Recognit..

[12]  Shie-Jue Lee,et al.  Data-Based System Modeling Using a Type-2 Fuzzy Neural Network With a Hybrid Learning Algorithm , 2011, IEEE Transactions on Neural Networks.

[13]  Inderjit S. Dhillon,et al.  Information-theoretic co-clustering , 2003, KDD '03.

[14]  Hidetomo Ichihashi,et al.  Fuzzy clustering for categorical multivariate data , 2001, Proceedings Joint 9th IFSA World Congress and 20th NAFIPS International Conference (Cat. No. 01TH8569).

[15]  Witold Pedrycz,et al.  Semi-supervising Interval Type-2 Fuzzy C-Means clustering with spatial information for multi-spectral satellite image classification and change detection , 2015, Comput. Geosci..

[16]  William-Chandra Tjhi,et al.  A heuristic-based fuzzy co-clustering algorithm for categorization of high-dimensional data , 2008, Fuzzy Sets Syst..

[17]  Gerardo Beni,et al.  A Validity Measure for Fuzzy Clustering , 1991, IEEE Trans. Pattern Anal. Mach. Intell..

[18]  A T de CarvalhoFrancisco de,et al.  Fuzzy K-means clustering algorithms for interval-valued data based on adaptive quadratic distances , 2010 .

[19]  Furu Wei,et al.  Constrained co-clustering for textual documents , 2010, AAAI 2010.

[20]  William-Chandra Tjhi,et al.  A partitioning based algorithm to fuzzy co-cluster documents and words , 2006, Pattern Recognit. Lett..

[21]  Horng-Lin Shieh A Hybrid Fuzzy Clustering Method with a Robust Validity Index , 2014 .

[22]  O. Colot,et al.  Color Image Segmentation using Type-2 Fuzzy Sets , 2006, 2006 1ST IEEE International Conference on E-Learning in Industrial Electronics.

[23]  Jerry M. Mendel,et al.  Operations on type-2 fuzzy sets , 2001, Fuzzy Sets Syst..

[24]  Jianying Hu,et al.  Regularized Co-Clustering with Dual Supervision , 2008, NIPS.

[25]  K. K. Bhoyar,et al.  Color Image Segmentation using Fast Fuzzy C-Means Algorithm , 2010 .

[26]  Myriam Regattieri Delgado,et al.  Land Cover Classification Based on General Type-2 Fuzzy Classifiers , 2008 .

[27]  Yuchi Kanzawa Comparison of imputation strategies in FNM-based and RFCM-based fuzzy co-clustering , 2012, The 6th International Conference on Soft Computing and Intelligent Systems, and The 13th International Symposium on Advanced Intelligence Systems.

[28]  Jerry M. Mendel,et al.  On clarifying some definitions and notations used for type-2 fuzzy sets as well as some recommended changes , 2016, Inf. Sci..

[29]  A. Bovik,et al.  A universal image quality index , 2002, IEEE Signal Processing Letters.

[30]  Alan C. Bovik,et al.  Mean squared error: Love it or leave it? A new look at Signal Fidelity Measures , 2009, IEEE Signal Processing Magazine.

[31]  Raghu Krishnapuram,et al.  Fuzzy co-clustering of documents and keywords , 2003, The 12th IEEE International Conference on Fuzzy Systems, 2003. FUZZ '03..

[32]  Long Thanh Ngo,et al.  An interval Type-2 Fuzzy Subtractive Clustering approach to obstacle detection of robot vision using RGB-D camera , 2014, Int. J. Hybrid Intell. Syst..

[33]  Arlindo L. Oliveira,et al.  Biclustering algorithms for biological data analysis: a survey , 2004, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[34]  Huifang Ma,et al.  Orthogonal Nonnegative Matrix Tri-factorization for Semi-supervised Document Co-clustering , 2010, PAKDD.

[35]  Jerry M. Mendel,et al.  Advances in type-2 fuzzy sets and systems , 2007, Inf. Sci..

[36]  Oscar Castillo,et al.  A review on type-2 fuzzy logic applications in clustering, classification and pattern recognition , 2014, Appl. Soft Comput..

[37]  Mohammad Hossein Fazel Zarandi,et al.  A new cluster validity measure based on general type-2 fuzzy sets: Application in gene expression data clustering , 2014, Knowl. Based Syst..

[38]  Junzo Watada,et al.  A genetic type-2 fuzzy C-means clustering approach to M-FISH segmentation , 2014, J. Intell. Fuzzy Syst..

[39]  Francisco de A. T. de Carvalho,et al.  Fuzzy K-means clustering algorithms for interval-valued data based on adaptive quadratic distances , 2010, Fuzzy Sets Syst..