Hybrid recommendations by content-aligned Bayesian personalized ranking

ABSTRACT In many application domains of recommender systems, content-based (CB) information are available for users, objects or both. CB information plays an important role in the process of recommendation, especially in cold-start scenarios, where the volume of feedback data is low. However, CB information may come from several, possibly external, sources varying in reliability, coverage or relevance to the recommending task. Therefore, each content source or attribute possess a different level of informativeness, which should be taken into consideration during the process of recommendation. In this paper, we propose a Content-Aligned Bayesian Personalized Ranking Matrix Factorization method (CABPR), extending Bayesian Personalized Ranking Matrix Factorization (BPR) by incorporating multiple sources of content information into the BPR’s optimization procedure. The working principle of CABPR is to calculate user-to-user and object-to-object similarity matrices based on the content information and penalize differences in latent factors of closely related users’ or objects’. CABPR further estimates relevance of similarity matrices as a part of the optimization procedure. CABPR method is a significant extension of a previously published BPR_MCA method, featuring additional variants of optimization criterion and improved optimization procedure. Four variants of CABPR were evaluated on two publicly available datasets: MovieLens 1M dataset, extended by data from IMDB, DBTropes and ZIP code statistics and LOD-RecSys dataset extended by the information available from DBPedia. Experiments shown that CABPR significantly improves over standard BPR as well as BPR_MCA method w.r.t. several cold-start scenarios.

[1]  Mu Zhu,et al.  Content‐boosted matrix factorization techniques for recommender systems , 2012, Stat. Anal. Data Min..

[2]  Tommaso Di Noia,et al.  Linked Open Data-Enabled Recommender Systems: ESWC 2014 Challenge on Book Recommendation , 2014, SemWebEval@ESWC.

[3]  Yehuda Koren,et al.  Matrix Factorization Techniques for Recommender Systems , 2009, Computer.

[4]  Peter Vojtás,et al.  Recommending for Disloyal Customers with Low Consumption Rate , 2014, SOFSEM.

[5]  Domonkos Tikk,et al.  Recommending new movies: even a few ratings are more valuable than metadata , 2009, RecSys '09.

[6]  Alexander J. Smola,et al.  Improving maximum margin matrix factorization , 2008, Machine Learning.

[7]  Luis M. de Campos,et al.  Combining content-based and collaborative recommendations: A hybrid approach based on Bayesian networks , 2010, Int. J. Approx. Reason..

[8]  Peter Vojtás,et al.  Using Implicit Preference Relations to Improve Recommender Systems , 2017, Journal on Data Semantics.

[9]  F. Maxwell Harper,et al.  The MovieLens Datasets: History and Context , 2016, TIIS.

[10]  Mark Goadrich,et al.  The relationship between Precision-Recall and ROC curves , 2006, ICML.

[11]  Yoav Shoham,et al.  Fab: content-based, collaborative recommendation , 1997, CACM.

[12]  Hao Ding,et al.  Collaborative matrix factorization with multiple similarities for predicting drug-target interactions , 2013, KDD.

[13]  Wu-Jun Li,et al.  TagiCoFi: tag informed collaborative filtering , 2009, RecSys '09.

[14]  Pasquale Lops,et al.  Content-based Recommender Systems: State of the Art and Trends , 2011, Recommender Systems Handbook.

[15]  Shou-De Lin,et al.  A Content-Based Matrix Factorization Model for Recipe Recommendation , 2014, PAKDD.

[16]  Matthew D. Zeiler ADADELTA: An Adaptive Learning Rate Method , 2012, ArXiv.

[17]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[18]  Marcin Andrychowicz,et al.  Learning to learn by gradient descent by gradient descent , 2016, NIPS.

[19]  Yifan Hu,et al.  Collaborative Filtering for Implicit Feedback Datasets , 2008, 2008 Eighth IEEE International Conference on Data Mining.

[20]  Yann LeCun,et al.  Dimensionality Reduction by Learning an Invariant Mapping , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[21]  Malte Kiesel,et al.  DBTropes - a Linked Data Wrapper Approach Incorporating Community Feedback , 2010, EKAW.

[22]  Ruslan Salakhutdinov,et al.  Probabilistic Matrix Factorization , 2007, NIPS.

[23]  Jakub Lokoc,et al.  Product Exploration based on Latent Visual Attributes , 2017, CIKM.

[24]  Lars Schmidt-Thieme,et al.  BPR: Bayesian Personalized Ranking from Implicit Feedback , 2009, UAI.

[25]  Yoram Singer,et al.  Adaptive Subgradient Methods for Online Learning and Stochastic Optimization , 2011, J. Mach. Learn. Res..

[26]  Derek Bridge,et al.  Product-Seeded and Basket-Seeded Recommendations for Small-Scale Retailers , 2016, Journal on Data Semantics.

[27]  Ning Qian,et al.  On the momentum term in gradient descent learning algorithms , 1999, Neural Networks.

[28]  Alistair A. Young,et al.  Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) , 2017, MICCAI 2017.

[29]  Dietmar Jannach,et al.  Using graded implicit feedback for bayesian personalized ranking , 2014, RecSys '14.

[30]  A KonstanJoseph,et al.  The MovieLens Datasets , 2015 .

[31]  Heri Ramampiaro,et al.  Content-Based Social Recommendation with Poisson Matrix Factorization , 2017, ECML/PKDD.

[32]  Sebastian Ruder,et al.  An overview of gradient descent optimization algorithms , 2016, Vestnik komp'iuternykh i informatsionnykh tekhnologii.

[33]  Yngve Lamo,et al.  Extended Content-boosted Matrix Factorization Algorithm for Recommender Systems , 2014, KES.

[34]  Christopher C. Johnson Logistic Matrix Factorization for Implicit Feedback Data , 2014 .

[35]  Martin J. Wainwright,et al.  Randomized smoothing for (parallel) stochastic optimization , 2012, 2012 IEEE 51st IEEE Conference on Decision and Control (CDC).

[36]  Mu Zhu,et al.  Content-boosted matrix factorization for recommender systems: experiments with recipe recommendation , 2011, RecSys '11.

[37]  Joseph A. Konstan,et al.  Evaluating recommender behavior for new users , 2014, RecSys '14.

[38]  Ladislav Peska Linking Content Information with Bayesian Personalized Ranking via Multiple Content Alignments , 2017, HT.

[39]  Zheng Wang,et al.  Bayesian network based business information retrieval model , 2008, Knowledge and Information Systems.

[40]  Suju Rajan,et al.  Beyond clicks: dwell time for personalization , 2014, RecSys '14.

[41]  SmolaAlex,et al.  Improving maximum margin matrix factorization , 2008 .

[42]  Saul Vargas,et al.  Novelty and Diversity in Recommender Systems , 2015, Recommender Systems Handbook.

[43]  Ruslan Salakhutdinov,et al.  Bayesian probabilistic matrix factorization using Markov chain Monte Carlo , 2008, ICML '08.