Deep hybrid recommender systems via exploiting document context and statistics of items

The sparsity of user-to-item rating data is one of the major obstacles to achieving high rating prediction accuracy of model-based collaborative filtering (CF) recommender systems. To overcome the obstacle, researchers proposed hybrid methods for recommender systems that exploit auxiliary information together with rating data. In particular, document modeling-based hybrid methods were recently proposed that additionally utilize description documents of items such as reviews, abstracts, or synopses in order to improve the rating prediction accuracy. However, they still have two following limitations on further improvements: (1) They ignore contextual information such as word order or surrounding words of a word because their document modeling methods use bag-of-words model. (2) They do not explicitly consider Gaussian noise differently in modeling latent factors of items based on description documents together with ratings although Gaussian noise depend on statistics of items.In this paper, we propose a robust document context-aware hybrid method, which integrates convolutional neural network (CNN) into probabilistic matrix factorization (PMF) with the statistics of items to both capture contextual information and consider Gaussian noise differently. Our extensive evaluations on three real-world dataset show that our variant recommendation models based on our proposed method significantly outperform the state-of-the-art recommendation models.

[1]  Yehuda Koren,et al.  Factorization meets the neighborhood: a multifaceted collaborative filtering model , 2008, KDD.

[2]  Yoon Kim,et al.  Convolutional Neural Networks for Sentence Classification , 2014, EMNLP.

[3]  Scott Sanner,et al.  AutoRec: Autoencoders Meet Collaborative Filtering , 2015, WWW.

[4]  Jeffrey Pennington,et al.  GloVe: Global Vectors for Word Representation , 2014, EMNLP.

[5]  Michael R. Lyu,et al.  Ratings meet reviews, a combined approach to recommend , 2014, RecSys '14.

[6]  Jason Weston,et al.  Natural Language Processing (Almost) from Scratch , 2011, J. Mach. Learn. Res..

[7]  Martin Ester,et al.  Collaborative Denoising Auto-Encoders for Top-N Recommender Systems , 2016, WSDM.

[8]  Andrew McCallum,et al.  Ask the GRU: Multi-task Learning for Deep Text Recommendations , 2016, RecSys.

[9]  Zhen Lin,et al.  Topic tensor factorization for recommender system , 2016, Inf. Sci..

[10]  Yoshua Bengio,et al.  Deep Sparse Rectifier Neural Networks , 2011, AISTATS.

[11]  Jure Leskovec,et al.  Hidden factors and hidden topics: understanding rating dimensions with review text , 2013, RecSys.

[12]  Enrique Herrera-Viedma,et al.  A model to represent users trust in recommender systems using ontologies and fuzzy linguistic modeling , 2015, Inf. Sci..

[13]  Chong Wang,et al.  Collaborative topic modeling for recommending scientific articles , 2011, KDD.

[14]  Hwanjo Yu,et al.  Improving top-K recommendation with truster and trustee relationship in user trust network , 2016, Inf. Sci..

[15]  Sheng Li,et al.  Deep Collaborative Filtering via Marginalized Denoising Auto-encoder , 2015, CIKM.

[16]  John Riedl,et al.  An Algorithmic Framework for Performing Collaborative Filtering , 1999, SIGIR Forum.

[17]  Yan Liu,et al.  Collaborative Topic Regression with Social Matrix Factorization for Recommendation Systems , 2012, ICML.

[18]  Ruslan Salakhutdinov,et al.  Bayesian probabilistic matrix factorization using Markov chain Monte Carlo , 2008, ICML '08.

[19]  Julian J. McAuley,et al.  VBPR: Visual Bayesian Personalized Ranking from Implicit Feedback , 2015, AAAI.

[20]  Dit-Yan Yeung,et al.  Collaborative Deep Learning for Recommender Systems , 2014, KDD.

[21]  Donghyun Kim,et al.  Convolutional Matrix Factorization for Document Context-Aware Recommendation , 2016, RecSys.

[22]  Benjamin Schrauwen,et al.  Deep content-based music recommendation , 2013, NIPS.

[23]  Gediminas Adomavicius,et al.  Toward the next generation of recommender systems: a survey of the state-of-the-art and possible extensions , 2005, IEEE Transactions on Knowledge and Data Engineering.

[24]  Enrique Herrera-Viedma,et al.  A quality based recommender system to disseminate information in a university digital library , 2014, Inf. Sci..

[25]  Yifan Hu,et al.  Collaborative Filtering for Implicit Feedback Datasets , 2008, 2008 Eighth IEEE International Conference on Data Mining.

[26]  Ruslan Salakhutdinov,et al.  Probabilistic Matrix Factorization , 2007, NIPS.

[27]  Enrique Herrera-Viedma,et al.  REFORE: A recommender system for researchers based on bibliometrics , 2015, Appl. Soft Comput..