Deterministic Policy Gradient Algorithms
暂无分享,去创建一个
Guy Lever | Martin A. Riedmiller | David Silver | Daan Wierstra | Nicolas Heess | Thomas Degris | D. Silver | N. Heess | Daan Wierstra | T. Degris | Guy Lever | David Silver
[1] Richard S. Sutton,et al. A Menu of Designs for Reinforcement Learning Over Time , 1995 .
[2] Richard S. Sutton,et al. Introduction to Reinforcement Learning , 1998 .
[3] Yishay Mansour,et al. Policy Gradient Methods for Reinforcement Learning with Function Approximation , 1999, NIPS.
[4] Sham M. Kakade,et al. A Natural Policy Gradient , 2001, NIPS.
[5] Richard S. Sutton,et al. Comparing Policy-Gradient Algorithms , 2001 .
[6] Jeff G. Schneider,et al. Covariant policy search , 2003, IJCAI 2003.
[7] Michail G. Lagoudakis,et al. Least-Squares Policy Iteration , 2003, J. Mach. Learn. Res..
[8] Peter Dayan,et al. Q-learning , 1992, Machine Learning.
[9] Ronald J. Williams,et al. Simple Statistical Gradient-Following Algorithms for Connectionist Reinforcement Learning , 2004, Machine Learning.
[10] Peter Szabó,et al. Learning to Control an Octopus Arm with Gaussian Process Temporal Difference Methods , 2005, NIPS.
[11] Richard S. Sutton,et al. Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.
[12] Stefan Schaal,et al. Natural Actor-Critic , 2003, Neurocomputing.
[13] Shalabh Bhatnagar,et al. Incremental Natural Actor-Critic Algorithms , 2007, NIPS.
[14] Shalabh Bhatnagar,et al. Fast gradient-descent methods for temporal-difference learning with linear function approximation , 2009, ICML '09.
[15] Shalabh Bhatnagar,et al. Toward Off-Policy Learning Control with Function Approximation , 2010, ICML.
[16] Jan Peters,et al. Policy Gradient Methods , 2010, Encyclopedia of Machine Learning.
[17] Martin A. Riedmiller,et al. Reinforcement learning in feedback control , 2011, Machine Learning.
[18] Gang Niu,et al. Analysis and Improvement of Policy Gradient Estimation , 2011, NIPS.
[19] Martha White,et al. Linear Off-Policy Actor-Critic , 2012, ICML.
[20] Patrick M. Pilarski,et al. Model-Free reinforcement learning with continuous action in practice , 2012, 2012 American Control Conference (ACC).
[21] Yee Whye Teh,et al. Actor-Critic Reinforcement Learning with Energy-Based Policies , 2012, EWRL.