Integrating Topic Models and Latent Factors for Recommendation

The research of personalized recommendation techniques today has mostly parted into two mainstream directions, i.e., the factorization-based approaches and topic models. Practically, they aim to benefit from the numerical ratings and textual reviews, correspondingly, which compose two major information sources in various real-world systems. However, although the two approaches are supposed to be correlated for their same goal of accurate recommendation, there still lacks a clear theoretical understanding of how their objective functions can be mathematically bridged to leverage the numerical ratings and textual reviews collectively, and why such a bridge is intuitively reasonable to match up their learning procedures for the rating prediction and top-N recommendation tasks, respectively. In this work, we exposit with mathematical analysis that, the vector-level randomization functions to coordinate the optimization objectives of factorizational and topic models unfortunately do not exist at all, although they are usually pre-assumed and intuitively designed in the literature. Fortunately, we also point out that one can avoid the seeking of such a randomization function by optimizing a Joint Factorizational Topic (JFT) model directly. We apply our JFT model to restaurant recommendation, and study its performance in both normal and cross-city recommendation scenarios, where the latter is an extremely difficult task for its inherent cold-start nature. Experimental results on real-world datasets verified the appealing performance of our approach against previous methods, on both rating prediction and top-N recommendation tasks.

[1]  Lars Schmidt-Thieme,et al.  BPR: Bayesian Personalized Ranking from Implicit Feedback , 2009, UAI.

[2]  Chong Wang,et al.  Collaborative topic modeling for recommending scientific articles , 2011, KDD.

[3]  Roberto Turrin,et al.  Performance of recommender algorithms on top-n recommendation tasks , 2010, RecSys '10.

[4]  Yehuda Koren,et al.  Advances in Collaborative Filtering , 2011, Recommender Systems Handbook.

[5]  Deepak Agarwal,et al.  fLDA: matrix factorization through latent dirichlet allocation , 2010, WSDM '10.

[6]  Domonkos Tikk,et al.  Investigation of Various Matrix Factorization Methods for Large Recommender Systems , 2008, 2008 IEEE International Conference on Data Mining Workshops.

[7]  Iryna Gurevych,et al.  Beyond the stars: exploiting free-text user reviews to improve the accuracy of movie recommendations , 2009, TSA@CIKM.

[8]  Han Zhao,et al.  SoF: Soft-Cluster Matrix Factorization for Probabilistic Clustering , 2015, AAAI.

[9]  Greg Linden,et al.  Amazon . com Recommendations Item-to-Item Collaborative Filtering , 2001 .

[10]  Geoffrey J. Gordon,et al.  A Unified View of Matrix Factorization Models , 2008, ECML/PKDD.

[11]  D. Bartholomew Latent Variable Models And Factor Analysis , 1987 .

[12]  Yongfeng Zhang,et al.  Browser-oriented universal cross-site recommendation and explanation based on user browsing logs , 2014, RecSys '14.

[13]  Yifan Hu,et al.  Collaborative Filtering for Implicit Feedback Datasets , 2008, 2008 Eighth IEEE International Conference on Data Mining.

[14]  Junehwa Song,et al.  MovieCommenter: Aspect-based collaborative filtering by utilizing user comments , 2011, 7th International Conference on Collaborative Computing: Networking, Applications and Worksharing (CollaborateCom).

[15]  John K. Debenham,et al.  Informed Recommender: Basing Recommendations on Consumer Product Reviews , 2007, IEEE Intelligent Systems.

[16]  Michael J. Pazzani,et al.  Content-Based Recommendation Systems , 2007, The Adaptive Web.

[17]  Xu Chen,et al.  HLBPR: A Hybrid Local Bayesian Personal Ranking Method , 2016, WWW.

[18]  Ruslan Salakhutdinov,et al.  Probabilistic Matrix Factorization , 2007, NIPS.

[19]  Yu He,et al.  The YouTube video recommendation system , 2010, RecSys '10.

[20]  Guokun Lai,et al.  Explicit factor models for explainable recommendation based on phrase-level sentiment analysis , 2014, SIGIR.

[21]  Taghi M. Khoshgoftaar,et al.  A Survey of Collaborative Filtering Techniques , 2009, Adv. Artif. Intell..

[22]  Stathes Hadjiefthymiades,et al.  Facing the cold start problem in recommender systems , 2014, Expert Syst. Appl..

[23]  Yiqun Liu,et al.  Boost Phrase-level Polarity Labelling with Review-level Sentiment Classification , 2015, ArXiv.

[24]  Yan Liu,et al.  Collaborative Topic Regression with Social Matrix Factorization for Recommendation Systems , 2012, ICML.

[25]  Abhinandan Das,et al.  Google news personalization: scalable online collaborative filtering , 2007, WWW '07.

[26]  Ruslan Salakhutdinov,et al.  Bayesian probabilistic matrix factorization using Markov chain Monte Carlo , 2008, ICML '08.

[27]  Amélie Marian,et al.  Beyond the Stars: Improving Rating Predictions using Review Text Content , 2009, WebDB.

[28]  Nathan Srebro,et al.  Fast maximum margin matrix factorization for collaborative prediction , 2005, ICML.

[29]  Michael R. Lyu,et al.  Ratings meet reviews, a combined approach to recommend , 2014, RecSys '14.

[30]  Yiqun Liu,et al.  Do users rate or review?: boost phrase-level sentiment labeling with review-level sentiment classification , 2014, SIGIR.

[31]  Anindya Datta,et al.  Using Adjective Features from User Reviews to Generate Higher Quality and Explainable Recommendations , 2012, Shaping the Future of ICT Research.

[32]  Shi Feng,et al.  Localized matrix factorization for recommendation based on matrix block diagonal forms , 2013, WWW.

[33]  Yiqun Liu,et al.  Task-based Recommendation on a Web-Scale , 2015 .

[34]  Tommi S. Jaakkola,et al.  Maximum-Margin Matrix Factorization , 2004, NIPS.

[35]  Yehuda Koren,et al.  Matrix Factorization Techniques for Recommender Systems , 2009, Computer.

[36]  David M. Blei,et al.  Probabilistic topic models , 2012, Commun. ACM.

[37]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[38]  John Riedl,et al.  Recommender systems: from algorithms to user experience , 2012, User Modeling and User-Adapted Interaction.

[39]  Ram Dantu,et al.  Group Recommendation System for Facebook , 2008, OTM Workshops.

[40]  Guokun Lai,et al.  Daily-Aware Personalized Recommendation based on Feature-Level Time Series Analysis , 2015, WWW.

[41]  Xu Chen,et al.  Learning to Rank Features for Recommendation over Multiple Categories , 2016, SIGIR.

[42]  Lior Rokach,et al.  Introduction to Recommender Systems Handbook , 2011, Recommender Systems Handbook.

[43]  Jure Leskovec,et al.  Hidden factors and hidden topics: understanding rating dimensions with review text , 2013, RecSys.

[44]  Yiqun Liu,et al.  Understanding the Sparsity: Augmented Matrix Factorization with Sampled Constraints on Unobservables , 2014, CIKM.

[45]  Yiqun Liu,et al.  Improve collaborative filtering through bordered block diagonal form matrices , 2013, SIGIR.

[46]  Jon Whittle,et al.  Free Text In User Reviews: Their Role In Recommender Systems , 2011 .

[47]  Yongfeng Zhang,et al.  Incorporating Phrase-level Sentiment Analysis on Textual Reviews for Personalized Recommendation , 2015, WSDM.

[48]  Yiqun Liu,et al.  A Unified Framework for Emotional Elements Extraction Based on Finite State Matching Machine , 2013, NLPCC.

[49]  Stan Lipovetsky,et al.  Latent Variable Models and Factor Analysis , 2001, Technometrics.

[50]  Yiqun Liu,et al.  Catch the Black Sheep: Unified Framework for Shilling Attack Detection Based on Fraudulent Action Propagation , 2015, IJCAI.