暂无分享,去创建一个
Yuval Tassa | Martin A. Riedmiller | David Silver | Ziyu Wang | Nicolas Heess | Srinivasan Sriram | Tom Erez | Greg Wayne | S. M. Ali Eslami | TB Dhruva | Josh Merel | Jay Lemmon | Ziyun Wang | D. Silver | N. Heess | Greg Wayne | T. Erez | Yuval Tassa | J. Merel | S. Eslami | TB Dhruva | S. Sriram | Jay Lemmon | David Silver | Tom Erez
[1] Ronald J. Williams,et al. Simple Statistical Gradient-Following Algorithms for Connectionist Reinforcement Learning , 2004, Machine Learning.
[2] Jason Weston,et al. Curriculum learning , 2009, ICML '09.
[3] Aaron Hertzmann,et al. Robust physics-based locomotion using low-dimensional planning , 2010, ACM Trans. Graph..
[4] Yasemin Altun,et al. Relative Entropy Policy Search , 2010 .
[5] Z. Popovic,et al. Terrain-adaptive bipedal locomotion control , 2010, ACM Trans. Graph..
[6] Michiel van de Panne,et al. Curriculum Learning for Motor Skills , 2012, Canadian Conference on AI.
[7] Yuval Tassa,et al. Synthesis and stabilization of complex behaviors through online trajectory optimization , 2012, 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems.
[8] Baining Guo,et al. Terrain runner , 2012, ACM Trans. Graph..
[9] Yuval Tassa,et al. MuJoCo: A physics engine for model-based control , 2012, 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems.
[10] Zoran Popovic,et al. Discovery of complex behaviors through contact-invariant optimization , 2012, ACM Trans. Graph..
[11] Nicolas Pronost,et al. Interactive Character Animation Using Simulated Physics: A State‐of‐the‐Art Review , 2012, Comput. Graph. Forum.
[12] Sergey Levine,et al. Learning Neural Network Policies with Guided Policy Search under Unknown Dynamics , 2014, NIPS.
[13] Yuval Tassa,et al. Learning Continuous Control Policies by Stochastic Value Gradients , 2015, NIPS.
[14] Sergey Levine,et al. Trust Region Policy Optimization , 2015, ICML.
[15] Shane Legg,et al. Massively Parallel Methods for Deep Reinforcement Learning , 2015, ArXiv.
[16] Shane Legg,et al. Human-level control through deep reinforcement learning , 2015, Nature.
[17] Yuval Tassa,et al. Continuous control with deep reinforcement learning , 2015, ICLR.
[18] Pieter Abbeel,et al. Benchmarking Deep Reinforcement Learning for Continuous Control , 2016, ICML.
[19] Alex Graves,et al. Asynchronous Methods for Deep Reinforcement Learning , 2016, ICML.
[20] Demis Hassabis,et al. Mastering the game of Go with deep neural networks and tree search , 2016, Nature.
[21] Sergey Levine,et al. End-to-End Training of Deep Visuomotor Policies , 2015, J. Mach. Learn. Res..
[22] Glen Berseth,et al. Terrain-adaptive locomotion skills using deep reinforcement learning , 2016, ACM Trans. Graph..
[23] Sergey Levine,et al. High-Dimensional Continuous Control Using Generalized Advantage Estimation , 2015, ICLR.
[24] Yuval Tassa,et al. Learning and Transfer of Modulated Locomotor Controllers , 2016, ArXiv.
[25] Nando de Freitas,et al. Sample Efficient Actor-Critic with Experience Replay , 2016, ICLR.
[26] Glen Berseth,et al. DeepLoco , 2017, ACM Trans. Graph..
[27] Tom Schaul,et al. Reinforcement Learning with Unsupervised Auxiliary Tasks , 2016, ICLR.
[28] Sergey Levine,et al. Deep reinforcement learning for robotic manipulation with asynchronous off-policy updates , 2016, 2017 IEEE International Conference on Robotics and Automation (ICRA).
[29] Yuval Tassa,et al. Data-efficient Deep Reinforcement Learning for Dexterous Manipulation , 2017, ArXiv.