论文信息 - When Collaborative Filtering Meets Reinforcement Learning

When Collaborative Filtering Meets Reinforcement Learning

In this paper, we study a multi-step interactive recommendation problem, where the item recommended at current step may affect the quality of future recommendations. To address the problem, we develop a novel and effective approach, named CFRL, which seamlessly integrates the ideas of both collaborative filtering (CF) and reinforcement learning (RL). More specifically, we first model the recommender-user interactive recommendation problem as an agent-environment RL task, which is mathematically described by a Markov decision process (MDP). Further, to achieve collaborative recommendations for the entire user community, we propose a novel CF-based MDP by encoding the states of all users into a shared latent vector space. Finally, we propose an effective Q-network learning method to learn the agent's optimal policy based on the CF-based MDP. The capability of CFRL is demonstrated by comparing its performance against a variety of existing methods on real-world datasets.

Yu Lei | Wenjie Li | Wenjie Li | Yu Lei

[1] Jun Wang,et al. Interactive collaborative filtering , 2013, CIKM.

[2] Liang Zhang,et al. Recommendations with Negative Feedback via Pairwise Deep Reinforcement Learning , 2018, KDD.

[3] Shane Legg,et al. Human-level control through deep reinforcement learning , 2015, Nature.

[4] Richard S. Sutton,et al. Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[5] Tat-Seng Chua,et al. Fast Matrix Factorization for Online Recommendation with Implicit Feedback , 2016, SIGIR.

[6] Jian Liu,et al. Playlist Recommendation Based on Reinforcement Learning , 2017, IFIP TC12 ICIS.

[7] Guy Shani,et al. An MDP-Based Recommender System , 2002, J. Mach. Learn. Res..

[8] Roberto Turrin,et al. Performance of recommender algorithms on top-n recommendation tasks , 2010, RecSys '10.

[9] Sean M. McNee,et al. Getting to know you: learning new user preferences in recommender systems , 2002, IUI '02.

[10] Demis Hassabis,et al. Mastering the game of Go with deep neural networks and tree search , 2016, Nature.

[11] Nicholas Jing Yuan,et al. DRN: A Deep Reinforcement Learning Framework for News Recommendation , 2018, WWW.

[12] Yehuda Koren,et al. Matrix Factorization Techniques for Recommender Systems , 2009, Computer.

[13] Lior Rokach,et al. Introduction to Recommender Systems Handbook , 2011, Recommender Systems Handbook.

[14] Liang Zhang,et al. Deep reinforcement learning for page-wise recommendations , 2018, RecSys.

[15] Carlos Eduardo R. de Mello,et al. Active learning driven by rating impact analysis , 2010, RecSys '10.

[16] Wei Chu,et al. A contextual-bandit approach to personalized news article recommendation , 2010, WWW '10.