PyRecGym: a reinforcement learning gym for recommender systems
暂无分享,去创建一个
Elias Z. Tragos | Neil J. Hurley | Aonghus Lawlor | Makbule Gulcin Ozsoy | Bichen Shi | James Geraci | Barry Smyth | N. Hurley | B. Smyth | E. Tragos | Bichen Shi | A. Lawlor | Makbule Gülçin Özsoy | James Geraci
[1] Jianhui Chen,et al. Efficient Ordered Combinatorial Semi-Bandits for Whole-Page Recommendation , 2017, AAAI.
[2] Jung-Woo Ha,et al. Reinforcement Learning based Recommender System using Biclustering Technique , 2018, ArXiv.
[3] David Cortes,et al. Adapting multi-armed bandits policies to contextual bandits scenarios , 2018, ArXiv.
[4] Wojciech Zaremba,et al. OpenAI Gym , 2016, ArXiv.
[5] Jiliang Tang,et al. Reinforcement Learning for Online Information Seeking , 2018, ArXiv.
[6] Jun Tan,et al. Stabilizing Reinforcement Learning in Dynamic Environment with Application to Online Recommendation , 2018, KDD.
[7] Nicholas Jing Yuan,et al. DRN: A Deep Reinforcement Learning Framework for News Recommendation , 2018, WWW.
[8] Qingyun Wu,et al. Learning Contextual Bandits in a Non-stationary Environment , 2018, SIGIR.
[9] Wei Chu,et al. A contextual-bandit approach to personalized news article recommendation , 2010, WWW '10.
[10] Jiliang Tang,et al. Model-Based Reinforcement Learning for Whole-Chain Recommendations , 2019, ArXiv.
[11] Alexandros Karatzoglou,et al. RecoGym: A Reinforcement Learning Environment for the problem of Product Recommendation in Online Advertising , 2018, ArXiv.