Two-Way Latent Grouping Model for User Preference Prediction

We introduce a novel latent grouping model for predicting the relevance of a new document to a user. The model assumes a latent group structure for both users and documents. We compared the model against a state-of-the-art method, the User Rating Profile model, where only users have a latent group structure. We estimate both models by Gibbs sampling. The new method predicts relevance more accurately for new documents that have few known ratings. The reason is that generalization over documents then becomes necessary and hence the twoway grouping is profitable.

[1]  Takeo Kanade,et al.  Maximum Entropy for Collaborative Filtering , 2004, UAI.

[2]  Henry Tirri,et al.  Bayesian analysis of online newspaper log data , 2003, 2003 Symposium on Applications and the Internet Workshops, 2003. Proceedings..

[3]  P. Donnelly,et al.  Inference of population structure using multilocus genotype data. , 2000, Genetics.

[4]  Samuel Kaski,et al.  PRIMA — Proactive information retrieval by adaptive models of users ’ attention and interests , 2004 .

[5]  Mikko Koivisto,et al.  Sum-Product Algorithms for the Analysis of Genetic Risks , 2004 .

[6]  Bradley N. Miller,et al.  GroupLens: applying collaborative filtering to Usenet news , 1997, CACM.

[7]  Benjamin M. Marlin,et al.  Modeling User Rating Profiles For Collaborative Filtering , 2003, NIPS.

[8]  Roded Sharan,et al.  Biclustering Algorithms: A Survey , 2007 .

[9]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[10]  Luo Si,et al.  A Bayesian Approach toward Active Learning for Collaborative Filtering , 2004, UAI.

[11]  Pattie Maes,et al.  Social information filtering: algorithms for automating “word of mouth” , 1995, CHI '95.

[12]  Thomas Hofmann,et al.  Latent semantic models for collaborative filtering , 2004, TOIS.

[13]  Wray L. Buntine Variational Extensions to EM and Multinomial PCA , 2002, ECML.

[14]  Arlindo L. Oliveira,et al.  Biclustering algorithms for biological data analysis: a survey , 2004, IEEE/ACM Transactions on Computational Biology and Bioinformatics.