Bayesian Probabilistic Multi-Topic Matrix Factorization for Rating Prediction

Recently, Local Matrix Factorization (LMF) [Lee et al., 2013] has been shown to be more effective than traditional matrix factorization for rating prediction. The core idea for LMF is to first partition the original matrix into several smaller submatrices, further exploit local structures of submatrices for better low-rank approximation. Various clustering-based methods with heuristic extensions have been proposed for LMF in the literature. To develop a more principled solution for LMF, this paper presents a Bayesian Probabilistic Multi-Topic Matrix Factorization model. We treat the set of the rated items by a user as a document, and employ latent topic models to cluster items as topics. Subsequently, a user has a distribution over the set of topics. We further set topic-specific latent vectors for both users and items. The final prediction is obtained by an ensemble of the results from the corresponding topic-specific latent vectors in each topic. Using a multi-topic latent representation, our model is more powerful to reflect the complex characteristics for users and items in rating prediction, and enhance the model interpretability. Extensive experiments on large real-world datasets demonstrate the effectiveness of the proposed model.

[1]  Alfred Kobsa,et al.  Proceedings of the 8th ACM Conference on Recommender systems , 2014, RecSys 2014.

[2]  Zoubin Ghahramani,et al.  Proceedings of the 24th international conference on Machine learning , 2007, ICML 2007.

[3]  J. Meigs,et al.  WHO Technical Report , 1954, The Yale Journal of Biology and Medicine.

[4]  Gregor Heinrich Parameter estimation for text analysis , 2009 .

[5]  Samy Bengio,et al.  Local collaborative ranking , 2014, WWW.

[6]  Guokun Lai,et al.  Explicit factor models for explainable recommendation based on phrase-level sentiment analysis , 2014, SIGIR.

[7]  Yoram Singer,et al.  Local Low-Rank Matrix Approximation , 2013, ICML.

[8]  Inderjit S. Dhillon,et al.  Information-theoretic co-clustering , 2003, KDD '03.

[9]  Arindam Banerjee,et al.  Bayesian Co-clustering , 2008, 2008 Eighth IEEE International Conference on Data Mining.

[10]  Michael I. Jordan,et al.  Advances in Neural Information Processing Systems 30 , 1995 .

[11]  原田 秀逸 私の computer 環境 , 1998 .

[12]  Andrei Broder,et al.  Proceedings of the 23rd International Conference on World Wide Web , 2014, WWW 2014.

[13]  Yehuda Koren,et al.  Matrix Factorization Techniques for Recommender Systems , 2009, Computer.

[14]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[15]  Chong Wang,et al.  Collaborative topic modeling for recommending scientific articles , 2011, KDD.

[16]  Ameet Talwalkar,et al.  Divide-and-Conquer Matrix Factorization , 2011, NIPS.

[17]  Arkadiusz Paterek,et al.  Improving regularized singular value decomposition for collaborative filtering , 2007 .

[18]  Jure Leskovec,et al.  Hidden factors and hidden topics: understanding rating dimensions with review text , 2013, RecSys.

[19]  Li Shang,et al.  WEMAREC: Accurate and Scalable Recommendation through Weighted and Ensemble Matrix Approximation , 2015, SIGIR.

[20]  Alexander J. Smola,et al.  ACCAMS: Additive Co-Clustering to Approximate Matrices Succinctly , 2014, WWW.

[21]  O. Bagasra,et al.  Proceedings of the National Academy of Sciences , 1914, Science.

[22]  Yong Liu,et al.  Your neighbors affect your ratings: on geographical neighborhood influence to rating prediction , 2014, SIGIR.

[23]  Yifan Hu,et al.  Collaborative Filtering for Implicit Feedback Datasets , 2008, 2008 Eighth IEEE International Conference on Data Mining.

[24]  Thomas G. Dietterich What is machine learning? , 2020, Archives of Disease in Childhood.

[25]  Mark Steyvers,et al.  Finding scientific topics , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[26]  Nando de Freitas,et al.  An Introduction to MCMC for Machine Learning , 2004, Machine Learning.

[27]  Ruslan Salakhutdinov,et al.  Bayesian probabilistic matrix factorization using Markov chain Monte Carlo , 2008, ICML '08.

[28]  Charles X. Ling,et al.  Clustering-based factorized collaborative filtering , 2013, RecSys.

[29]  Neil Yorke-Smith,et al.  LibRec: A Java Library for Recommender Systems , 2015, UMAP Workshops.