Exploring Deep Reinforcement Learning with Multi Q-Learning
暂无分享,去创建一个
[1] Demis Hassabis,et al. Mastering the game of Go with deep neural networks and tree search , 2016, Nature.
[2] Richard S. Sutton,et al. Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.
[3] David Silver,et al. Deep Reinforcement Learning with Double Q-Learning , 2015, AAAI.
[4] Hado van Hasselt,et al. Double Q-learning , 2010, NIPS.
[5] Alex Graves,et al. Playing Atari with Deep Reinforcement Learning , 2013, ArXiv.
[6] John N. Tsitsiklis,et al. Analysis of temporal-difference learning with function approximation , 1996, NIPS 1996.
[7] Peter Dayan,et al. Q-learning , 1992, Machine Learning.
[8] Guigang Zhang,et al. Deep Learning , 2016, Int. J. Semantic Comput..
[9] Leemon C. Baird,et al. Residual Algorithms: Reinforcement Learning with Function Approximation , 1995, ICML.
[10] Martha White,et al. An Emphatic Approach to the Problem of Off-policy Temporal-Difference Learning , 2015, J. Mach. Learn. Res..
[11] Shane Legg,et al. Human-level control through deep reinforcement learning , 2015, Nature.
[12] Jason Weston,et al. A unified architecture for natural language processing: deep neural networks with multitask learning , 2008, ICML '08.
[13] Gerald Tesauro,et al. Temporal difference learning and TD-Gammon , 1995, CACM.
[14] Yuval Tassa,et al. Continuous control with deep reinforcement learning , 2015, ICLR.
[15] Xiaogang Wang,et al. Deep Learning Face Representation from Predicting 10,000 Classes , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.
[16] Andrew Zisserman,et al. Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.
[17] R. Sutton,et al. Gradient temporal-difference learning algorithms , 2011 .