暂无分享,去创建一个
[1] Jürgen Schmidhuber,et al. Long Short-Term Memory , 1997, Neural Computation.
[2] Shane Legg,et al. Human-level control through deep reinforcement learning , 2015, Nature.
[3] Philip Bachman,et al. Deep Reinforcement Learning that Matters , 2017, AAAI.
[4] M.A. Wiering,et al. Reinforcement Learning in Continuous Action Spaces , 2007, 2007 IEEE International Symposium on Approximate Dynamic Programming and Reinforcement Learning.
[5] Marco Wiering,et al. Opponent Modelling in the Game of Tron using Reinforcement Learning , 2018, ICAART.
[6] Richard S. Sutton,et al. Learning to predict by the methods of temporal differences , 1988, Machine Learning.
[7] Guillaume Lample,et al. Playing FPS Games with Deep Reinforcement Learning , 2016, AAAI.
[8] Gerald Tesauro,et al. Temporal difference learning and TD-Gammon , 1995, CACM.
[9] Marco Wiering,et al. Connectionist reinforcement learning for intelligent unit micro management in StarCraft , 2011, The 2011 International Joint Conference on Neural Networks.
[10] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.
[11] Hado van Hasselt,et al. Double Q-learning , 2010, NIPS.
[12] Marco Wiering. QV(λ)-learning: A New On-policy Reinforcement Learning Algorithm , 2005 .
[13] Alex Graves,et al. Playing Atari with Deep Reinforcement Learning , 2013, ArXiv.
[14] Guy Lever,et al. Deterministic Policy Gradient Algorithms , 2014, ICML.
[15] Chris Watkins,et al. Learning from delayed rewards , 1989 .
[16] Richard S. Sutton,et al. Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.
[17] Yuval Tassa,et al. Continuous control with deep reinforcement learning , 2015, ICLR.
[18] Tom Schaul,et al. Prioritized Experience Replay , 2015, ICLR.
[19] Marc G. Bellemare,et al. The Arcade Learning Environment: An Evaluation Platform for General Agents , 2012, J. Artif. Intell. Res..
[20] Longxin Lin. Self-Improving Reactive Agents Based on Reinforcement Learning, Planning and Teaching , 2004, Machine Learning.
[21] George Cybenko,et al. Approximation by superpositions of a sigmoidal function , 1989, Math. Control. Signals Syst..