Fuzzy co-clustering considering site-wise confidence of vertically partitioned cooccurrence data

In many real world data analysis tasks, it is expected to get much more useful knowledge by utilizing multiple databases stored in different organizations, such as cooperation groups, state organs and allied countries. However, in many such organizations, they often hesitate to publish their databases because of privacy and security issues although they believe the advantages of collaborative analysis. To utilize such multiple databases without fear of information leaks, a privacy preserving procedure was introduced to fuzzy clustering for categorical multivariate data (FCCM). This method deals with vertically partitioned databases, and several experimental results demonstrated that the method can contribute to revealing global intrinsic co-cluster structure rather than individual site-wise analysis. In this paper, site-wise confidence is introduced to the method to weaken harmful effects of insignificant sites for estimating stable intrinsic co-cluster structures. Site-wise confidence is measured with a degree of coincidence between site-wise object memberships and the global ones. Several experimental results show that site-wise confidence is useful for estimating reliable co-cluster structures.

[1]  D. T. Lee,et al.  Multi-party k-Means Clustering with Privacy Consideration , 2010, International Symposium on Parallel and Distributed Processing with Applications.

[2]  James C. Bezdek,et al.  Pattern Recognition with Fuzzy Objective Function Algorithms , 1981, Advanced Applications in Pattern Recognition.

[3]  Philip S. Yu,et al.  Privacy-Preserving Data Mining - Models and Algorithms , 2008, Advances in Database Systems.

[4]  Katsuhiro Honda,et al.  Fuzzy Co-Clustering Induced by Multinomial Mixture Models , 2015, J. Adv. Comput. Intell. Intell. Informatics.

[5]  Sadaaki Miyamoto,et al.  Fuzzy c-means as a regularization and maximum entropy approach , 1997 .

[6]  Chris Clifton,et al.  Tools for privacy preserving distributed data mining , 2002, SKDD.

[7]  Hidetomo Ichihashi,et al.  FCM-type Cluster Validation in Fuzzy Co-Clustering and Collaborative Filtering Applicability , 2013 .

[8]  Katsuhiro Honda,et al.  A Collaborative Framework for Privacy Preserving Fuzzy Co-Clustering of Vertically Distributed Cooccurrence Matrices , 2015, Adv. Fuzzy Syst..

[9]  Hidetomo Ichihashi,et al.  Fuzzy clustering for categorical multivariate data , 2001, Proceedings Joint 9th IFSA World Congress and 20th NAFIPS International Conference (Cat. No. 01TH8569).

[10]  Katsuhiro Honda,et al.  FCM-type fuzzy co-clustering by K-L information regularization , 2014, 2014 IEEE International Conference on Fuzzy Systems (FUZZ-IEEE).