Privacy-Preserving Collaborative Filtering

Collaborative filtering (CF) techniques are becoming very popular on the Internet and are widely used in several domains to cope with information overload. E-commerce sites use filtering systems to recommend products to customers based on the preferences of like-minded customers, but their systems do not protect user privacy. Because users concerned about privacy may give false information, it is not easy to collect high-quality user data for collaborative filtering, and recommendation systems using poor data produce inaccurate recommendations. This means that privacy measures are key to the success of collecting high-quality data and providing accurate recommendations. This article discusses collaborative filtering with privacy based on both correlation and singular-value decomposition (SVD) and proposes the use of randomized perturbation techniques to protect user privacy while producing reasonably accurate recommendations. Such techniques add randomness to the original data, preventing the data collector (the server) from learning private user data, but this scheme can still provide accurate recommendations. Experiments were conducted with real datasets to evaluate the overall performance of the proposed scheme. The results were used for analysis of how different parameters affect accuracy. Collaborative filtering systems using randomized perturbation techniques were found to provide accurate recommendations while preserving user privacy.

[1]  Lorrie Faith Cranor,et al.  'I didn't buy it for myself' privacy and ecommerce personalization , 2003, WPES '03.

[2]  Bradley N. Miller,et al.  GroupLens: applying collaborative filtering to Usenet news , 1997, CACM.

[3]  Bart Selman,et al.  Referral Web: combining social networks and collaborative filtering , 1997, CACM.

[4]  Jayant R. Haritsa,et al.  Maintaining Data Privacy in Association Rule Mining , 2002, VLDB.

[5]  Mark S. Ackerman,et al.  Beyond Concern: Understanding Net Users' Attitudes About Online Privacy , 1999, ArXiv.

[6]  Douglas B. Terry,et al.  Using collaborative filtering to weave an information tapestry , 1992, CACM.

[7]  Kenneth Y. Goldberg,et al.  Eigentaste: A Constant Time Collaborative Filtering Algorithm , 2001, Information Retrieval.

[8]  John Riedl,et al.  Explaining collaborative filtering recommendations , 2000, CSCW '00.

[9]  John Riedl,et al.  GroupLens: an open architecture for collaborative filtering of netnews , 1994, CSCW '94.

[10]  Michael K. Reiter,et al.  Crowds: anonymity for Web transactions , 1998, TSEC.

[11]  Charu C. Aggarwal,et al.  On the design and quantification of privacy preserving data mining algorithms , 2001, PODS.

[12]  John Riedl,et al.  Analysis of recommendation algorithms for e-commerce , 2000, EC '00.

[13]  Daniel Lemire,et al.  Scale and Translation Invariant Collaborative Filtering Systems , 2004, Information Retrieval.

[14]  David Heckerman,et al.  Empirical Analysis of Predictive Algorithms for Collaborative Filtering , 1998, UAI.

[15]  Oded Goldreich,et al.  A randomized protocol for signing contracts , 1985, CACM.

[16]  Paul F. Syverson,et al.  Anonymous connections and onion routing , 1998, IEEE J. Sel. Areas Commun..

[17]  John F. Canny,et al.  Collaborative filtering with privacy via factor analysis , 2002, SIGIR '02.

[18]  Eric Horvitz,et al.  Collaborative Filtering by Personality Diagnosis: A Hybrid Memory and Model-Based Approach , 2000, UAI.

[19]  Loren Terveen,et al.  PHOAKS: a system for sharing recommendations , 1997, CACM.

[20]  Gilles Brassard,et al.  All-or-Nothing Disclosure of Secrets , 1986, CRYPTO.

[21]  John F. Canny,et al.  Collaborative filtering with privacy , 2002, Proceedings 2002 IEEE Symposium on Security and Privacy.

[22]  Alexandre V. Evfimievski,et al.  Randomization in privacy preserving data mining , 2002, SKDD.

[23]  Moni Naor,et al.  Oblivious transfer and polynomial evaluation , 1999, STOC '99.

[24]  Michael J. Pazzani,et al.  Improvement of Collaborative Filtering with the Simple Bayesian Classifier 1 , 2002 .

[25]  Alexandre V. Evfimievski,et al.  Privacy preserving mining of association rules , 2002, Inf. Syst..

[26]  Yoav Shoham,et al.  Fab: content-based, collaborative recommendation , 1997, CACM.

[27]  Yoav Shoham,et al.  Content-Based, Collaborative Recommendation. , 1997 .

[28]  John Riedl,et al.  Application of Dimensionality Reduction in Recommender System - A Case Study , 2000 .

[29]  Yehuda Lindell,et al.  Privacy Preserving Data Mining , 2002, Journal of Cryptology.

[30]  Pattie Maes,et al.  Social information filtering: algorithms for automating “word of mouth” , 1995, CHI '95.

[31]  Michael J. Pazzani,et al.  Collaborative Filtering with the Simple Bayesian Classifier , 2000, PRICAI.

[32]  Silvio Micali,et al.  Computationally Private Information Retrieval with Polylogarithmic Communication , 1999, EUROCRYPT.