Using mixture models for collaborative filtering

A collaborative filtering system at an e-commerce site or similar service uses data about aggregate user behavior to make recommendations tailored to specific user interests. We develop recommendation algorithms with provable performance guarantees in a probabilistic mixture model for collaborative filtering proposed by Hofmann and Puzicha. We identify certain novel parameters of mixture models that are closely connected with the best achievable performance of a recommendation algorithm; we show that for any system in which these parameters are bounded, it is possible to give recommendations whose quality converges to optimal as the amount of data grows. All our bounds depend on a new measure of independence that can be viewed as an L"1-analogue of the smallest singular value of a matrix. Using this, we introduce a technique based on generalized pseudoinverse matrices and linear programming for handling sets of high-dimensional vectors. We also show that standard approaches based on L"2 spectral methods are not strong enough to yield comparable results, thereby suggesting some inherent limitations of spectral analysis.

[1]  Alan M. Frieze,et al.  Existence and Construction of Edge-Disjoint Paths on Expander Graphs , 1994, SIAM J. Comput..

[2]  John Riedl,et al.  Item-based collaborative filtering recommendation algorithms , 2001, WWW '01.

[3]  Mark Sandler,et al.  On the use of linear programming for unsupervised text classification , 2005, KDD '05.

[4]  Paul Resnick,et al.  Recommender systems , 1997, CACM.

[5]  Ravi Kumar,et al.  Recommendation systems: a probabilistic analysis , 1998, Proceedings 39th Annual Symposium on Foundations of Computer Science (Cat. No.98CB36280).

[6]  Prabhakar Raghavan,et al.  Competitive recommendation systems , 2002, STOC '02.

[7]  Gene H. Golub,et al.  Matrix computations , 1983 .

[8]  David Heckerman,et al.  Empirical Analysis of Predictive Algorithms for Collaborative Filtering , 1998, UAI.

[9]  Anna R. Karlin,et al.  Spectral analysis of data , 2001, STOC '01.

[10]  Jon M. Kleinberg,et al.  Convergent algorithms for collaborative filtering , 2003, EC '03.

[11]  Geoffrey J. McLachlan,et al.  Finite Mixture Models , 2019, Annual Review of Statistics and Its Application.

[12]  Thomas Hofmann,et al.  Latent Class Models for Collaborative Filtering , 1999, IJCAI.

[13]  W. Hoeffding Probability Inequalities for sums of Bounded Random Variables , 1963 .

[14]  John Riedl,et al.  Analysis of recommendation algorithms for e-commerce , 2000, EC '00.

[15]  Greg Linden,et al.  Amazon . com Recommendations Item-to-Item Collaborative Filtering , 2001 .

[16]  John Riedl,et al.  PolyLens: A recommender system for groups of user , 2001, ECSCW.