How to Put Users in Control of their Data via Federated Pair-Wise Recommendation

Recommendation services are extensively adopted in several user-centered applications as a tool to alleviate the information overload problem and help users in orienteering in a vast space of possible choices. In such scenarios, privacy is a crucial concern since users may not be willing to share their sensitive preferences (e.g., visited locations, read books, bought items) with a central server. Unfortunately, data harvesting and collection is at the basis of modern, state-of-the-art approaches to recommendation. Decreased users' willingness to share personal information along with data minimization/protection policies (such as the European GDPR), can result in the "data scarcity" dilemma affecting data-intensive applications such as recommender systems (RS). We argue that scarcity of adequate data due to privacy concerns can severely impair the quality of learned models and, in the long term, result in a turnover and disloyal customers with direct consequences for lives, society, and businesses. To address these issues, we present FPL, an architecture in which users collaborate in training a central factorization model while controlling the amount of sensitive data leaving their devices. The proposed approach implements pair-wise learning to rank optimization by following the Federated Learning principles conceived originally to mitigate the privacy risks of traditional machine learning. We have conducted an extensive experimental evaluation on three Foursquare datasets and have verified the effectiveness of the proposed architecture concerning accuracy and beyond-accuracy objectives. We have analyzed the impact of communication cost with the central server on the system's performance, by varying the amount of local computation and training parallelism. Finally, we have carefully examined the impact of disclosed users' information on the quality of the final model and ...

[1]  Matthew D. Hoffman,et al.  Variational Autoencoders for Collaborative Filtering , 2018, WWW.

[2]  Simone Scardapane,et al.  Fully Decentralized Semi-supervised Learning via Privacy-preserving Matrix Completion , 2017, IEEE Transactions on Neural Networks and Learning Systems.

[3]  Hubert Eichner,et al.  Towards Federated Learning at Scale: System Design , 2019, SysML.

[4]  Tommaso Di Noia,et al.  Local Popularity and Time in top-N Recommendation , 2018, ECIR.

[5]  Lars Schmidt-Thieme,et al.  BPR: Bayesian Personalized Ranking from Implicit Feedback , 2009, UAI.

[6]  Sean M. McNee,et al.  Being accurate is not enough: how accuracy metrics have hurt recommender systems , 2006, CHI Extended Abstracts.

[7]  Tsan-sheng Hsu,et al.  Privacy-Preserving Collaborative Recommender Systems , 2010, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[8]  Yehuda Koren,et al.  OrdRec: an ordinal model for predicting personalized item rating distributions , 2011, RecSys '11.

[9]  Hamed Zamani,et al.  Recommender Systems Fairness Evaluation via Generalized Cross Entropy , 2019, RMSE@RecSys.

[10]  Gediminas Adomavicius,et al.  Impact of data characteristics on recommender systems performance , 2012, TMIS.

[11]  Jakub Konecný,et al.  Federated Optimization: Distributed Optimization Beyond the Datacenter , 2015, ArXiv.

[12]  Pieter H. Hartel,et al.  Privacy in Recommender Systems , 2013, Social Media Retrieval.

[13]  Wenliang Du,et al.  Privacy-preserving collaborative filtering using randomized perturbation techniques , 2003, Third IEEE International Conference on Data Mining.

[14]  James Caverlee,et al.  Fairness-Aware Tensor-Based Recommendation , 2018, CIKM.

[15]  Qing Ling,et al.  Decentralized low-rank matrix completion , 2012, 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[16]  Tianjian Chen,et al.  Federated Machine Learning: Concept and Applications , 2019 .

[17]  Kuan Eeik Tan,et al.  Federated Collaborative Filtering for Privacy-Preserving Personalized Recommendation System , 2019, ArXiv.

[18]  Amar Saini,et al.  PrivateJobMatch: a privacy-oriented deferred multi-match recommender system for stable employment , 2019, RecSys.

[19]  Saul Vargas,et al.  Novelty and Diversity in Recommender Systems , 2015, Recommender Systems Handbook.

[20]  Blaise Agüera y Arcas,et al.  Communication-Efficient Learning of Deep Networks from Decentralized Data , 2016, AISTATS.

[21]  Robin Burke,et al.  Bias Disparity in Collaborative Recommendation: Algorithmic Evaluation and Comparison , 2019, RMSE@RecSys.

[22]  Rainer Gemulla,et al.  Distributed Matrix Completion , 2012, 2012 IEEE 12th International Conference on Data Mining.

[23]  Elias Z. Tragos,et al.  PDMFRec: a decentralised matrix factorisation with tunable user-centric privacy , 2019, RecSys.

[24]  Tommaso Di Noia,et al.  On the discriminative power of hyper-parameters in cross-validation and how to choose them , 2019, RecSys.

[25]  Tsvi Kuflik,et al.  Enhancing privacy and preserving accuracy of a distributed collaborative filtering , 2007, RecSys '07.

[26]  Yifan Hu,et al.  Collaborative Filtering for Implicit Feedback Datasets , 2008, 2008 Eighth IEEE International Conference on Data Mining.

[27]  Yehuda Koren,et al.  Factor in the neighbors: Scalable and accurate collaborative filtering , 2010, TKDD.

[28]  Debajyoti Mukhopadhyay,et al.  Role of Matrix Factorization Model in Collaborative Filtering Algorithm: A Survey , 2015, ArXiv.

[29]  Gert R. G. Lanckriet,et al.  Learning Content Similarity for Music Recommendation , 2011, IEEE Transactions on Audio, Speech, and Language Processing.

[30]  Robin Burke,et al.  Balanced Neighborhoods for Fairness-Aware Collaborative Recommendation , 2017 .

[31]  Alan Hanjalic,et al.  List-wise learning to rank with matrix factorization for collaborative filtering , 2010, RecSys '10.

[32]  Martha Larson,et al.  BlurM(or)e: Revisiting Gender Obfuscation in the User-Item Matrix , 2019, RMSE@RecSys.

[33]  Agustí Verde Parera,et al.  General data protection regulation , 2018 .

[34]  Gediminas Adomavicius,et al.  Improving Aggregate Recommendation Diversity Using Ranking-Based Techniques , 2012, IEEE Transactions on Knowledge and Data Engineering.

[35]  Patrick Seemann,et al.  Matrix Factorization Techniques for Recommender Systems , 2014 .

[36]  Yehuda Koren,et al.  Matrix Factorization Techniques for Recommender Systems , 2009, Computer.

[37]  Jiebo Luo,et al.  2016 Ieee International Conference on Big Data (big Data) Solving Cold-start Problem in Large-scale Recommendation Engines: a Deep Learning Approach , 2022 .

[38]  Peter Richtárik,et al.  Federated Optimization: Distributed Machine Learning for On-Device Intelligence , 2016, ArXiv.

[39]  Max Mühlhäuser,et al.  Efficient privacy-preserving recommendations based on social graphs , 2019, RecSys.

[40]  Richard Nock,et al.  Advances and Open Problems in Federated Learning , 2019, Found. Trends Mach. Learn..

[41]  Liang He,et al.  Evaluating recommender systems , 2012, Seventh International Conference on Digital Information Management (ICDIM 2012).

[42]  Daqing Zhang,et al.  Participatory Cultural Mapping Based on Collective Behavior Data in Location-Based Social Networks , 2016, ACM Trans. Intell. Syst. Technol..

[43]  Stefan Weiss The Need for a Paradigm Shift in Addressing Privacy Risks in Social Networking Applications , 2007, FIDIS.

[44]  Tommaso Di Noia,et al.  Towards Effective Device-Aware Federated Learning , 2019, AI*IA.