A Greedy Fuzzy k -Member Co-clustering Algorithm and Collaborative Filtering Applicability

In this chapter, a new algorithm for performing privacy preserving collaborative filtering is proposed by extending the conventional crisp k-member coclustering model into a fuzzy variant.Although the conventional method anonymizes a co-occurrence data matrix by clustering objects into crisp object clusters in conjunction with fuzzy item membership estimation, the new algorithm constructs an anonymized data matrix considering fuzzy partition of objects. Because fuzzy partition is expected to be robust against outliers and extract homogeneous clusters, the proposed method can achieve k-anonymization with fewer information losses than the conventional crisp one. The applicability of the proposed algorithm to collaborative filtering task is demonstrated in a numerical experiment with a real-world purchase history data set.

[1]  Pierangela Samarati,et al.  Protecting Respondents' Identities in Microdata Release , 2001, IEEE Trans. Knowl. Data Eng..

[2]  Hidetomo Ichihashi,et al.  Collaborative filtering by sequential user-item co-cluster extraction from rectangular relational data , 2010, Int. J. Knowl. Eng. Soft Data Paradigms.

[3]  Mukesh K. Mohania,et al.  Advances in Databases: Concepts, Systems and Applications , 2007 .

[4]  Katsuhiro Honda,et al.  A Greedy Algorithm for k-Member Co-clustering and its Applicability to Collaborative Filtering , 2013, KES.

[5]  Philip S. Yu,et al.  Privacy-Preserving Data Mining - Models and Algorithms , 2008, Advances in Database Systems.

[6]  Rajeev Motwani,et al.  Approximation Algorithms for k-Anonymity , 2005 .

[7]  Sadaaki Miyamoto,et al.  Algorithms for Fuzzy Clustering - Methods in c-Means Clustering with Applications , 2008, Studies in Fuzziness and Soft Computing.

[8]  Hidetomo Ichihashi,et al.  A fuzzy variant of k-member clustering for collaborative filtering with data anonymization , 2012, 2012 IEEE International Conference on Fuzzy Systems.

[9]  Hidetomo Ichihashi,et al.  Fuzzy clustering for categorical multivariate data , 2001, Proceedings Joint 9th IFSA World Congress and 20th NAFIPS International Conference (Cat. No. 01TH8569).

[10]  James C. Bezdek,et al.  Pattern Recognition with Fuzzy Objective Function Algorithms , 1981, Advanced Applications in Pattern Recognition.

[11]  John Riedl,et al.  An Algorithmic Framework for Performing Collaborative Filtering , 1999, SIGIR Forum.

[12]  Latanya Sweeney,et al.  k-Anonymity: A Model for Protecting Privacy , 2002, Int. J. Uncertain. Fuzziness Knowl. Based Syst..

[13]  Elisa Bertino,et al.  Efficient k -Anonymization Using Clustering Techniques , 2007, DASFAA.

[14]  Bradley N. Miller,et al.  GroupLens: applying collaborative filtering to Usenet news , 1997, CACM.

[15]  J A Swets,et al.  Measuring the accuracy of diagnostic systems. , 1988, Science.