Probabilistic Relevance Models for Collaborative Filtering

Implicit acquisition of user preferences from user logs makes log-based collaborative filtering favorable in practice for recommendations. In this paper 1 , we follow a formal approach in text retrieval to reformulate the problem. Based on the classic probability ranking principle, we propose a probabilistic user-item relevance model. Under this formal model, we show that user-based and item-based approaches are only two different factorizations with different independence assumptions. Moreover, we show that smoothing is an important aspect to estimate the parameters of the models due to data sparsity. By adding linear interpolation smoothing, the proposed model gives a probabilistic justification of using TF£IDFlike item ranking in collaborative filtering. Besides giving the insight understanding of the problem of collaborative filtering, we also show experiments in which the proposed method provides a better recommendation performance on a music play-list data set.

[1]  Jun Wang,et al.  Distributed collaborative filtering for peer-to-peer file sharing systems , 2006, SAC.

[2]  David A. Hull Using statistical testing in the evaluation of retrieval experiments , 1993, SIGIR.

[3]  ChengXiang Zhai,et al.  Probabilistic Relevance Models Based on Document and Query Generation , 2003 .

[4]  Eric Horvitz,et al.  Collaborative Filtering by Personality Diagnosis: A Hybrid Memory and Model-Based Approach , 2000, UAI.

[5]  Benjamin M. Marlin,et al.  Collaborative Filtering: A Machine Learning Perspective , 2004 .

[6]  George Karypis,et al.  Evaluation of Item-Based Top-N Recommendation Algorithms , 2001, CIKM '01.

[7]  Michael McGill,et al.  Introduction to Modern Information Retrieval , 1983 .

[8]  Thomas Hofmann,et al.  Latent Class Models for Collaborative Filtering , 1999, IJCAI.

[9]  Qiang Yang,et al.  Scalable collaborative filtering using cluster-based smoothing , 2005, SIGIR '05.

[10]  Djoerd Hiemstra,et al.  Term-specific smoothing for the language modeling approach to information retrieval: the importance of a query term , 2002, SIGIR '02.

[11]  David Heckerman,et al.  Empirical Analysis of Predictive Algorithms for Collaborative Filtering , 1998, UAI.

[12]  David Madigan,et al.  On the Naive Bayes Model for Text Categorization , 2003, AISTATS.

[13]  George Karypis,et al.  Item-based top-N recommendation algorithms , 2004, TOIS.

[14]  John Riedl,et al.  Item-based collaborative filtering recommendation algorithms , 2001, WWW '01.

[15]  John F. Canny,et al.  Collaborative filtering with privacy via factor analysis , 2002, SIGIR '02.

[16]  Greg Linden,et al.  Amazon . com Recommendations Item-to-Item Collaborative Filtering , 2001 .

[17]  Mark Claypool,et al.  Implicit interest indicators , 2001, IUI '01.