A collaborative filtering system at an e-commerce site or similar service uses data about aggregate user behavior to make recommendations tailored to specific user interests. We develop recommendation algorithms with provable performance guarantees in a probabilistic mixture model for collaborative filtering proposed by Hoffman and Puzicha. We identify certain novel parameters of mixture models that are closely connected with the best achievable performance of a recommendation algorithm; we show that for any system in which these parameters are bounded, it is possible to give recommendations whose quality converges to optimal as the amount of data grows.All our bounds depend on a new measure of independence that can be viewed as an L1-analogue of the smallest singular value of a matrix. Using this, we introduce a technique based on generalized pseudoinverse matrices and linear programming for handling sets of high-dimensional vectors. We also show that standard approaches based on L2-spectral methods are not strong enough to yield comparable results, thereby suggesting some inherent limitations of spectral analysis.
[1]
Paul Resnick,et al.
Recommender systems
,
1997,
CACM.
[2]
David Heckerman,et al.
Empirical Analysis of Predictive Algorithms for Collaborative Filtering
,
1998,
UAI.
[3]
Andrew McCallum,et al.
Distributional clustering of words for text classification
,
1998,
SIGIR '98.
[4]
Thomas Hofmann,et al.
Latent Class Models for Collaborative Filtering
,
1999,
IJCAI.
[5]
Geoffrey J. McLachlan,et al.
Finite Mixture Models
,
2019,
Annual Review of Statistics and Its Application.
[6]
Anna R. Karlin,et al.
Spectral analysis of data
,
2001,
STOC '01.
[7]
Ravi Kumar,et al.
Recommendation Systems
,
2001
.
[8]
Prabhakar Raghavan,et al.
Competitive recommendation systems
,
2002,
STOC '02.
[9]
Ron Bekkerman,et al.
Distributional clustering of words for text categorization
,
2003
.
[10]
Jon M. Kleinberg,et al.
Convergent algorithms for collaborative filtering
,
2003,
EC '03.
[11]
Greg Linden,et al.
Amazon . com Recommendations Item-to-Item Collaborative Filtering
,
2001
.