"Fulfilling the Needs of Gray-Sheep Users in Recommender Systems, A Clustering Solution"

Recommender systems apply data mining techniques for filtering unseen information and can predict whether a user would like a given item. This paper focuses on graysheep users problem responsible for the increased error rate in collaborative filtering based recommender systems algorithms. The main contribution of this paper lies in showing that (1) the presence of gray-sheep users can affect the performance— accuracy and coverage—of collaborative filtering based algorithms, depending on the data sparsity and distribution; (2) graysheep users can be identified using clustering algorithms in offline fashion, where the similarity threshold to isolate these users from the rest of clusters can be found empirically; (3) contentbased profile of gray-sheep users can be used for making accurate recommendations. The effectiveness of the proposed algorithm is tested on the MovieLens dataset and community of movie fans in the FilmTrust Website, using mean absolute error, receiver operating characteristic sensitivity, and coverage.

[1]  Michael J. Pazzani,et al.  A Framework for Collaborative, Content-Based and Demographic Filtering , 1999, Artificial Intelligence Review.

[2]  Ken Lang,et al.  NewsWeeder: Learning to Filter Netnews , 1995, ICML.

[3]  Amirthalingam Ramanan,et al.  Resource-Allocating Codebook for patch-based face recognition , 2009, 2009 International Conference on Industrial and Information Systems (ICIIS).

[4]  Bradley N. Miller,et al.  GroupLens: applying collaborative filtering to Usenet news , 1997, CACM.

[5]  Adam Prügel-Bennett,et al.  An Improved Switching Hybrid Recommender System Using Naive Bayes Classifier and Collaborative Filtering , 2010 .

[6]  John Riedl,et al.  ClustKNN: A Highly Scalable Hybrid Model- & Memory-Based CF Algorithm , 2006 .

[7]  Jonathan L. Herlocker,et al.  Evaluating collaborative filtering recommender systems , 2004, TOIS.

[8]  Adam Prügel-Bennett,et al.  Building Switching Hybrid Recommender System Using Machine Learning Classifiers and Collaborative Filtering , 2010 .

[9]  Mark Claypool,et al.  Combining Content-Based and Collaborative Filters in an Online Newspaper , 1999, SIGIR 1999.

[10]  Sergei Vassilvitskii,et al.  k-means++: the advantages of careful seeding , 2007, SODA '07.

[11]  John Riedl,et al.  Item-based collaborative filtering recommendation algorithms , 2001, WWW '01.

[12]  Robin D. Burke,et al.  Hybrid Recommender Systems: Survey and Experiments , 2002, User Modeling and User-Adapted Interaction.

[13]  David Heckerman,et al.  Empirical Analysis of Predictive Algorithms for Collaborative Filtering , 1998, UAI.