A recommender system uses information from a user's past behavior to present items of interest to him. A fundamental problem in recommender systems is approximating a full user-item matrix where most of the entries are missing. The rows of the matrix represent the users and the columns represent the items. The entries indicate the plausibility that the user will enjoy the item. In this thesis the items are movies and the entries ratings. In this thesis I compare three statistical models that how a user will rate a movie. The first two are Bernoulli models that predict whether a rating is greater than three out of five. The first Bernoulli model uses logistic regression. The second Bernoulli model is a latent factor model. The third model extends the latent factor model to use a five class multinomial. A five class multinomial is chosen to predict a rating on a scale of one to five. The results show that latent factor model that uses a Bernoulli distribution has a better accuracy than a model trained by logistic regression. The latent factor model is extended to use a multinomial. The accuracy of variants of the multinomial model are evaluated. A technique to initialize the multinomial model is shown to improve the accuracy. However the accuracy is lower than other models used in the Netflix competition. The Bernoulli and multinomial latent factor models are compared against each other. The Bernoulli model is more accurate
[1]
Alan Agresti,et al.
Categorical Data Analysis
,
2003
.
[2]
J. Gill,et al.
Generalized Linear Models: A Unified Approach
,
2000
.
[3]
Inderjit S. Dhillon,et al.
Clustering with Bregman Divergences
,
2005,
J. Mach. Learn. Res..
[4]
Stephen P. Boyd,et al.
Convex Optimization
,
2004,
Algorithms and Theory of Computation Handbook.
[5]
Inderjit S. Dhillon,et al.
A generalized maximum entropy approach to bregman co-clustering and matrix approximation
,
2004,
J. Mach. Learn. Res..
[6]
Deepak Agarwal,et al.
Predictive discrete latent factor models for large scale dyadic data
,
2007,
KDD '07.
[7]
Eric R. Ziegel,et al.
Generalized Linear Models
,
2002,
Technometrics.
[8]
Inderjit S. Dhillon,et al.
Information-theoretic co-clustering
,
2003,
KDD '03.