Debiased Off-Policy Evaluation for Recommendation Systems
暂无分享,去创建一个
[1] Masatoshi Uehara,et al. Minimax Weight and Q-Function Learning for Off-Policy Evaluation , 2019, ICML.
[2] Yao Liu,et al. Representation Balancing MDPs for Off-Policy Policy Evaluation , 2018, NeurIPS.
[3] Mehrdad Farajtabar,et al. More Robust Doubly Robust Off-policy Evaluation , 2018, ICML.
[4] Lihong Li,et al. Learning from Logged Implicit Exploration Data , 2010, NIPS.
[5] Nan Jiang,et al. Doubly Robust Off-policy Value Evaluation for Reinforcement Learning , 2015, ICML.
[6] Tie-Yan Liu,et al. LightGBM: A Highly Efficient Gradient Boosting Decision Tree , 2017, NIPS.
[7] Shota Yasui,et al. Efficient Counterfactual Learning from Bandit Feedback , 2019, AAAI.
[8] Wei Chu,et al. A contextual-bandit approach to personalized news article recommendation , 2010, WWW '10.
[9] Masatoshi Uehara,et al. Double Reinforcement Learning for Efficient Off-Policy Evaluation in Markov Decision Processes , 2019, J. Mach. Learn. Res..
[10] Doina Precup,et al. Eligibility Traces for Off-Policy Policy Evaluation , 2000, ICML.
[11] Miroslav Dudík,et al. Optimal and Adaptive Off-policy Evaluation in Contextual Bandits , 2016, ICML.
[12] Philip S. Thomas,et al. Data-Efficient Off-Policy Policy Evaluation for Reinforcement Learning , 2016, ICML.
[13] Thomas Nedelec,et al. Offline A/B Testing for Recommender Systems , 2018, WSDM.
[14] J. Robins,et al. Double/Debiased Machine Learning for Treatment and Structural Parameters , 2017 .
[15] J. Robins,et al. Semiparametric regression estimation in the presence of dependent censoring , 1995 .
[16] Sergey Levine,et al. Off-Policy Evaluation via Off-Policy Classification , 2019, NeurIPS.
[17] J. Robins,et al. Locally Robust Semiparametric Estimation , 2016, Econometrica.
[18] Wei Chu,et al. Unbiased offline evaluation of contextual-bandit-based news article recommendation algorithms , 2010, WSDM '11.
[19] Whitney K. Newey,et al. Cross-fitting and fast remainder rates for semiparametric estimation , 2017, 1801.09138.
[20] Natalia Gimelshein,et al. PyTorch: An Imperative Style, High-Performance Deep Learning Library , 2019, NeurIPS.
[21] John Langford,et al. Off-policy evaluation for slate recommendation , 2016, NIPS.
[22] John Langford,et al. Doubly Robust Policy Evaluation and Optimization , 2014, ArXiv.