暂无分享,去创建一个
Peter Sunehag | Gabriel Dulac-Arnold | Richard Evans | Ben Coppin | Gabriel Dulac-Arnold | Peter Sunehag | Ben Coppin | Richard Evans | P. Sunehag
[1] Jason Pazis,et al. Generalized Value Functions for Large Action Sets , 2011, ICML.
[2] Sean R Eddy,et al. What is dynamic programming? , 2004, Nature Biotechnology.
[3] Yuval Tassa,et al. Continuous control with deep reinforcement learning , 2015, ICLR.
[4] Shane Legg,et al. Human-level control through deep reinforcement learning , 2015, Nature.
[5] Michail G. Lagoudakis,et al. Reinforcement Learning as Classification: Leveraging Modern Classifiers , 2003, ICML.
[6] Guy Lever,et al. Deterministic Policy Gradient Algorithms , 2014, ICML.
[7] Martin A. Riedmiller,et al. Reinforcement learning in feedback control , 2011, Machine Learning.
[8] Marco Wiering,et al. Using continuous action spaces to solve discrete problems , 2009, 2009 International Joint Conference on Neural Networks.
[9] Yuval Tassa,et al. MuJoCo: A physics engine for model-based control , 2012, 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems.
[10] Peter Sunehag,et al. Deep Reinforcement Learning with Attention for Slate Markov Decision Processes with High-Dimensional States and Actions , 2015, ArXiv.
[11] Longxin Lin. Self-Improving Reactive Agents Based on Reinforcement Learning, Planning and Teaching , 2004, Machine Learning.
[12] Patrick Gallinari,et al. Fast Reinforcement Learning with Large Action Sets Using Error-Correcting Output Codes for MDP Factorization , 2012, ECML/PKDD.
[13] Jianfeng Gao,et al. Deep Reinforcement Learning with an Unbounded Action Space , 2015, ArXiv.
[14] David G. Lowe,et al. Scalable Nearest Neighbor Algorithms for High Dimensional Data , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.
[15] Thomas G. Dietterich,et al. Solving Multiclass Learning Problems via Error-Correcting Output Codes , 1994, J. Artif. Intell. Res..
[16] Richard S. Sutton,et al. Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.