暂无分享,去创建一个
Wojciech Zaremba | Jie Tang | John Schulman | Jonas Schneider | Greg Brockman | Vicki Cheung | Ludwig Pettersson | J. Schulman | Wojciech Zaremba | Greg Brockman | Vicki Cheung | Ludwig Pettersson | Jonas Schneider | Jie Tang
[1] Dimitri P. Bertsekas,et al. Dynamic Programming and Optimal Control, Two Volume Set , 1995 .
[2] Richard S. Sutton,et al. Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.
[3] Brian Tanner,et al. RL-Glue: Language-Independent Software for Reinforcement-Learning Experiments , 2009, J. Mach. Learn. Res..
[4] Petr Baudis,et al. PACHI: State of the Art Open Source Go Program , 2011, ACG.
[5] Yuval Tassa,et al. MuJoCo: A physics engine for model-based control , 2012, 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems.
[6] Christos Dimitrakakis,et al. The reinforcement learning competition , 2014 .
[7] Christos Dimitrakakis,et al. The Reinforcement Learning Competition 2014 , 2014, AI Mag..
[8] Sergey Levine,et al. Trust Region Policy Optimization , 2015, ICML.
[9] Shane Legg,et al. Human-level control through deep reinforcement learning , 2015, Nature.
[10] Marc G. Bellemare,et al. The Arcade Learning Environment: An Evaluation Platform for General Agents , 2012, J. Artif. Intell. Res..
[11] Alborz Geramifard,et al. RLPy: a value-function-based reinforcement learning framework for education and research , 2015, J. Mach. Learn. Res..
[12] Pieter Abbeel,et al. Benchmarking Deep Reinforcement Learning for Continuous Control , 2016, ICML.
[13] Alex Graves,et al. Asynchronous Methods for Deep Reinforcement Learning , 2016, ICML.
[14] Wojciech Jaskowski,et al. ViZDoom: A Doom-based AI research platform for visual reinforcement learning , 2016, 2016 IEEE Conference on Computational Intelligence and Games (CIG).