暂无分享,去创建一个
Damien Ernst | Raphaël Fonteneau | Vincent François-Lavet | D. Ernst | R. Fonteneau | Vincent François-Lavet
[1] Bart De Schutter,et al. Reinforcement Learning and Dynamic Programming Using Function Approximators , 2010 .
[2] Yishay Mansour,et al. A Sparse Sampling Algorithm for Near-Optimal Planning in Large Markov Decision Processes , 1999, Machine Learning.
[3] John N. Tsitsiklis,et al. Analysis of temporal-difference learning with function approximation , 1996, NIPS 1996.
[4] Shane Legg,et al. Human-level control through deep reinforcement learning , 2015, Nature.
[5] P. B. Coaker,et al. Applied Dynamic Programming , 1964 .
[6] David Silver,et al. Deep Reinforcement Learning with Double Q-Learning , 2015, AAAI.
[7] Marc G. Bellemare,et al. The Arcade Learning Environment: An Evaluation Platform for General Agents , 2012, J. Artif. Intell. Res..
[8] Nan Jiang,et al. The Dependence of Effective Planning Horizon on Model Accuracy , 2015, AAMAS.
[9] Shane Legg,et al. Massively Parallel Methods for Deep Reinforcement Learning , 2015, ArXiv.
[10] Eduardo F. Morales,et al. An Introduction to Reinforcement Learning , 2011 .
[11] Andrew W. Moore,et al. Reinforcement Learning: A Survey , 1996, J. Artif. Intell. Res..
[12] Stuart E. Dreyfus,et al. Applied Dynamic Programming , 1965 .
[13] Tom Schaul,et al. Dueling Network Architectures for Deep Reinforcement Learning , 2015, ICML.
[14] Peter Stone,et al. Deep Recurrent Q-Learning for Partially Observable MDPs , 2015, AAAI Fall Symposia.
[15] Martin A. Riedmiller. Neural Fitted Q Iteration - First Experiences with a Data Efficient Neural Reinforcement Learning Method , 2005, ECML.
[16] Leemon C. Baird,et al. Residual Algorithms: Reinforcement Learning with Function Approximation , 1995, ICML.
[17] Richard S. Sutton,et al. Introduction to Reinforcement Learning , 1998 .
[18] Tom Schaul,et al. Prioritized Experience Replay , 2015, ICLR.
[19] E B Ebbesen,et al. Cognitive and attentional mechanisms in delay of gratification. , 1972, Journal of personality and social psychology.
[20] Geoffrey J. Gordon,et al. Approximate solutions to markov decision processes , 1999 .