Can Deep Reinforcement Learning Solve Erdos-Selfridge-Spencer Games?
暂无分享,去创建一个
Jacob Andreas | Jon M. Kleinberg | Quoc V. Le | Robert D. Kleinberg | Alex Irpan | Maithra Raghu | A. Irpan | M. Raghu | J. Kleinberg | Jacob Andreas
[1] Demis Hassabis,et al. Mastering the game of Go without human knowledge , 2017, Nature.
[2] Michael L. Littman,et al. Markov Games as a Framework for Multi-Agent Reinforcement Learning , 1994, ICML.
[3] Shane Legg,et al. Human-level control through deep reinforcement learning , 2015, Nature.
[4] Philip Bachman,et al. Deep Reinforcement Learning that Matters , 2017, AAAI.
[5] Alec Radford,et al. Proximal Policy Optimization Algorithms , 2017, ArXiv.
[6] Joel H. Spencer,et al. Randomization, Derandomization and Antirandomization: Three Games , 1994, Theor. Comput. Sci..
[7] Alex Graves,et al. Asynchronous Methods for Deep Reinforcement Learning , 2016, ICML.
[8] Wojciech Zaremba,et al. Domain randomization for transferring deep neural networks from simulation to the real world , 2017, 2017 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).
[9] Yuval Tassa,et al. MuJoCo: A physics engine for model-based control , 2012, 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems.
[10] Balaraman Ravindran,et al. EPOpt: Learning Robust Neural Network Policies Using Model Ensembles , 2016, ICLR.
[11] Bruno Bouzy,et al. Multi-agent Learning Experiments on Repeated Matrix Games , 2010, ICML.
[12] Pieter Abbeel,et al. Benchmarking Deep Reinforcement Learning for Continuous Control , 2016, ICML.
[13] Paul Erdös,et al. On a Combinatorial Game , 1973, J. Comb. Theory A.
[14] Michael H. Bowling,et al. The lemonade stand game competition: solving unsolvable games , 2011, SECO.
[15] Neil Burch,et al. Heads-up limit hold’em poker is solved , 2015, Science.
[16] Benjamin Recht,et al. Simple random search provides a competitive approach to reinforcement learning , 2018, ArXiv.