An Improved Profile-Based CF Scheme with Privacy

Traditional collaborative filtering (CF) systems widely employing k-nearest neighbor (kNN) algorithms mostly attempt to alleviate the contemporary problem of information overload by generating personalized predictions for items that users might like. Unlike their popularity and extensive usage, they suffer from several problems. First, with increasing number of users and/or items, scalability becomes a challenge. Second, as the number of ratable items increases and number of ratings provided by each individual remains as a tiny fraction, CF systems suffer from sparsity problem. Third, many schemes fail to protect private data referred to as privacy problem. Due to such problems, accuracy and online performance become worse. In this paper, we propose two preprocessing schemes to overcome scalability and sparsity problems. First, we suggest using a novel content-based profiling of users to estimate similarities on a reduced data for better performance. Second, we propose pseudo-prediction protocol to help CF systems surmount sparsity. We finally propose to use randomization methods to preserve individual users' confidential data, where we show that our proposed preprocessing schemes can be applied to perturbed data. We analyze our schemes in terms of privacy. To investigate their effects on accuracy and performance, we perform real databased experiments. Empirical results demonstrate that our preprocessing schemes improve both performance and accuracy.

[1]  John Riedl,et al.  An algorithmic framework for performing collaborative filtering , 1999, SIGIR '99.

[2]  John F. Canny,et al.  Collaborative filtering with privacy via factor analysis , 2002, SIGIR '02.

[3]  Zhang Liang,et al.  A hybrid approach to collaborative filtering for overcoming data sparsity , 2008, 2008 9th International Conference on Signal Processing.

[4]  Tiejian Luo,et al.  A novel approach to solve the sparsity problem in collaborative filtering , 2010, 2010 International Conference on Networking, Sensing and Control (ICNSC).

[5]  Feng Zhang,et al.  A Collaborative Filtering Algorithm Employing Genetic Clustering to Ameliorate the Scalability Issue , 2006, 2006 IEEE International Conference on e-Business Engineering (ICEBE'06).

[6]  Liu Chao,et al.  Using Temporal Information to Improve Predictive Accuracy of Collaborative Filtering Algorithms , 2010, 2010 12th International Asia-Pacific Web Conference.

[7]  Xinyuan Liang Reasoning Algorithm of Multi-Value Fuzzy Causality Diagram Based on Unitizing Coefficient , 2007 .

[8]  Minyi Guo,et al.  An Efficient Collaborative Filtering Approach Using Smoothing and Fusing , 2009, 2009 International Conference on Parallel Processing.

[9]  Hiroo Sekiya,et al.  Collaborative filtering based on an iterative prediction method to alleviate the sparsity problem , 2009, iiWAS.

[10]  Huseyin Polat,et al.  Privacy-Preserving Eigentaste-Based Collaborative Filtering , 2007, IWSEC.

[11]  Ahmet Arslan,et al.  A collaborative filtering method based on artificial immune network , 2009, Expert Syst. Appl..

[12]  Douglas M. Blough,et al.  Privacy Preserving Collaborative Filtering Using Data Obfuscation , 2007, 2007 IEEE International Conference on Granular Computing (GRC 2007).

[13]  Gang Chen,et al.  Collaborative Filtering Using Orthogonal Nonnegative Matrix Tri-factorization , 2007, Seventh IEEE International Conference on Data Mining Workshops (ICDMW 2007).

[14]  Ferda Nur Alpaslan,et al.  Using Social Networks to Solve Data Sparsity Problem in One-Class Collaborative Filtering , 2010, 2010 Seventh International Conference on Information Technology: New Generations.

[15]  David Heckerman,et al.  Empirical Analysis of Predictive Algorithms for Collaborative Filtering , 1998, UAI.

[16]  Huseyin Polat,et al.  Providing Naïve Bayesian Classifier-Based Private Recommendations on Partitioned Data , 2007, PKDD.

[17]  Jonathan L. Herlocker,et al.  Evaluating collaborative filtering recommender systems , 2004, TOIS.

[18]  Tsvi Kuflik,et al.  Enhancing privacy and preserving accuracy of a distributed collaborative filtering , 2007, RecSys '07.

[19]  Wenliang Du,et al.  Privacy-Preserving Collaborative Filtering , 2005, Int. J. Electron. Commer..

[20]  Kwang-Seok Hong,et al.  Improving Prediction Accuracy Using Entropy Weighting in Collaborative Filtering , 2009, 2009 Symposia and Workshops on Ubiquitous, Autonomic and Trusted Computing.