Basic Consideration of Co-Clustering Based on Rough Set Theory

In the field of clustering, rough clustering, which is clustering based on rough set theory, is a promising approach for dealing with the certainty, possibility, and uncertainty of belonging of object to clusters. Generalized rough C-means (GRCM), which is a rough set-based extension of hard C-means (HCM; k-means), can extract the overlapped cluster structure by assigning objects to the upper areas of their relatively near clusters. Co-clustering is a useful technique for summarizing co-occurrence information between objects and items such as the frequency of keywords in documents and the purchase history of users. Fuzzy co-clustering induced by multinomial mixture models (FCCMM) is a statistical model-based co-clustering method and introduces a mechanism for adjusting the fuzziness degrees of both objects and items. In this paper, we propose a novel rough co-clustering method, rough co-clustering induced by multinomial mixture models (RCCMM), with reference to GRCM and FCCMM. RCCMM aims to appropriately extract the overlapped co-cluster structure inherent in co-occurrence information by considering the certainty, possibility, and uncertainty. Through numerical experiments, we verified whether the proposed method can appropriately extract the overlapped co-cluster structure.

[1]  Pawan Lingras,et al.  Interval Set Clustering of Web Users with Rough K-Means , 2004, Journal of Intelligent Information Systems.

[2]  Hidetomo Ichihashi,et al.  Fuzzy clustering for categorical multivariate data , 2001, Proceedings Joint 9th IFSA World Congress and 20th NAFIPS International Conference (Cat. No. 01TH8569).

[3]  François Yvon,et al.  Inference and evaluation of the multinomial mixture model for text clustering , 2006, Inf. Process. Manag..

[4]  Seiki Ubukata,et al.  A unified approach for cluster-wise and general noise rejection approaches for k-means clustering , 2019, PeerJ Comput. Sci..

[5]  Georg Peters,et al.  Some refinements of rough k-means clustering , 2006, Pattern Recognit..

[6]  J. MacQueen Some methods for classification and analysis of multivariate observations , 1967 .

[7]  James C. Bezdek,et al.  Pattern Recognition with Fuzzy Objective Function Algorithms , 1981, Advanced Applications in Pattern Recognition.

[8]  Katsuhiro Honda,et al.  Fuzzy Co-Clustering Induced by Multinomial Mixture Models , 2015, J. Adv. Comput. Intell. Intell. Informatics.