FCMF: Federated collective matrix factorization for heterogeneous collaborative filtering

Abstract Protecting users’ privacy has drawn tremendous attention from the community of recommender systems, i.e., both the original data and the learned model parameters should not be exposed. Federated learning is an emerging and promising paradigm, where a server collects gradients from multiple distributed parts and then updates the model parameters with the aggregated gradients. However, there are some security issues neglected in existing works. For example, the server may infer the users’ rating behaviors on the items via the received gradients. In this paper, we focus on heterogeneous collaborative filtering (HCF) by exploiting users’ different types of feedback such as 5-star numerical ratings and like/dislike binary ratings in a privacy-aware manner. Specifically, we design a novel and generic federated matrix factorization algorithm for HCF, i.e., federated collective matrix factorization (FCMF). The main goal of our FCMF is to leverage the heterogeneous feedback data to accurately estimate users’ preferences on the premise of protecting users’ private information. Therefore, we keep the original rating data and the users’ latent feature vectors locally, and choose the low sensitive items’ latent vectors as a bridge for joint training. Furthermore, we use homomorphic encryption and differential privacy to ensure the security of both participants in collective training. To study the effectiveness of our FCMF, we conduct extensive empirical studies on four real-world datasets and find that our FCMF is equivalent to the centralized method that aggregates the heterogeneous data in one single place. Moreover, the introduction of homomorphic encryption and differential privacy do not affect the recommendation accuracy much.

[1]  Fei Chen,et al.  Federated Meta-Learning with Fast Convergence and Efficient Communication , 2018 .

[2]  Kuan Eeik Tan,et al.  Federated Collaborative Filtering for Privacy-Preserving Personalized Recommendation System , 2019, ArXiv.

[3]  Alejandro Bellogín,et al.  Content-based recommendation in social tagging systems , 2010, RecSys '10.

[4]  Tianjian Chen,et al.  Federated Machine Learning: Concept and Applications , 2019 .

[5]  Stefano Battiston,et al.  A model of a trust-based recommendation system on a social network , 2006, Autonomous Agents and Multi-Agent Systems.

[6]  Peter Richtárik,et al.  Federated Optimization: Distributed Machine Learning for On-Device Intelligence , 2016, ArXiv.

[7]  Geoffrey J. Gordon,et al.  Relational learning via collective matrix factorization , 2008, KDD.

[8]  John Riedl,et al.  E-Commerce Recommendation Applications , 2004, Data Mining and Knowledge Discovery.

[9]  Roksana Boreli,et al.  Applying Differential Privacy to Matrix Factorization , 2015, RecSys.

[10]  Cynthia Dwork,et al.  Calibrating Noise to Sensitivity in Private Data Analysis , 2006, TCC.

[11]  Benjamin Schrauwen,et al.  Deep content-based music recommendation , 2013, NIPS.

[12]  Peter Richtárik,et al.  Federated Learning: Strategies for Improving Communication Efficiency , 2016, ArXiv.

[13]  Naixue Xiong,et al.  Cold-Start Recommendation Using Bi-Clustering and Fusion for Large-Scale Social Recommender Systems , 2014, IEEE Transactions on Emerging Topics in Computing.

[14]  Kunal Talwar,et al.  Mechanism Design via Differential Privacy , 2007, 48th Annual IEEE Symposium on Foundations of Computer Science (FOCS'07).

[15]  Jiahui Liu,et al.  Personalized news recommendation based on click behavior , 2010, IUI '10.

[16]  Daniel A. Spielman,et al.  Spectral Graph Theory and its Applications , 2007, 48th Annual IEEE Symposium on Foundations of Computer Science (FOCS'07).

[17]  Chang Hui Research on Privacy-Preserving Collaborative Filtering Recommendation Based on Distributed Data , 2006 .

[18]  François Fouss,et al.  Random-Walk Computation of Similarities between Nodes of a Graph with Application to Collaborative Recommendation , 2007, IEEE Transactions on Knowledge and Data Engineering.

[19]  Zhong Ming,et al.  Interaction-Rich Transfer Learning for Collaborative Filtering with Heterogeneous User Feedback , 2014, IEEE Intelligent Systems.

[20]  Cynthia Dwork,et al.  Differential Privacy: A Survey of Results , 2008, TAMC.

[21]  Yehuda Koren,et al.  Matrix Factorization Techniques for Recommender Systems , 2009, Computer.

[22]  Jun Wang,et al.  Facilitating Privacy-preserving Recommendation-as-a-Service with Machine Learning , 2018, CCS.

[23]  Zhong Ming,et al.  Collaborative Recommendation with Multiclass Preference Context , 2017, IEEE Intelligent Systems.

[24]  Guangquan Zhang,et al.  A Cross-Domain Recommender System With Kernel-Induced Knowledge Transfer for Overlapping Entities , 2019, IEEE Transactions on Neural Networks and Learning Systems.

[25]  Lior Rokach,et al.  Recommender Systems Handbook , 2010 .

[26]  Pim Tuyls,et al.  Efficient Binary Conversion for Paillier Encrypted Values , 2006, EUROCRYPT.

[27]  P Xiong,et al.  A Survey on Differential Privacy and Applications , 2014 .

[28]  Yehuda Koren,et al.  Factorization meets the neighborhood: a multifaceted collaborative filtering model , 2008, KDD.

[29]  Jonathan L. Herlocker,et al.  Evaluating collaborative filtering recommender systems , 2004, TOIS.

[30]  Kai Chen,et al.  Secure Federated Matrix Factorization , 2019, IEEE Intelligent Systems.

[31]  Data Sharing via Differentially Private Coupled Matrix Factorization , 2020, ACM Trans. Knowl. Discov. Data.