暂无分享,去创建一个
Rémi Munos | Olivier Pietquin | David Budden | Matteo Hessel | Todd Hester | Bilal Piot | Mohammad Gheshlaghi Azar | Hado van Hasselt | Tobias Pohlen | Gabriel Barth-Maron | Dan Horgan | John Quan | Mel Vecerík | Dan Horgan | Tobias Pohlen | D. Budden | Gabriel Barth-Maron | R. Munos | Matteo Hessel | H. V. Hasselt | Bilal Piot | M. G. Azar | John Quan | Todd Hester | O. Pietquin | Mel Vecerík
[1] Frederick R. Forst,et al. On robust estimation of the location parameter , 1980 .
[2] Martin L. Puterman,et al. Markov Decision Processes: Discrete Stochastic Dynamic Programming , 1994 .
[3] Stefan Schaal,et al. Robot Learning From Demonstration , 1997, ICML.
[4] Joelle Pineau,et al. Learning from Limited Demonstrations , 2013, NIPS.
[5] Matthieu Geist,et al. Boosted Bellman Residual Minimization Handling Expert Demonstrations , 2014, ECML/PKDD.
[6] Alessandro Lazaric,et al. Direct Policy Iteration with Demonstrations , 2015, IJCAI.
[7] Nan Jiang,et al. The Dependence of Effective Planning Horizon on Model Accuracy , 2015, AAMAS.
[8] Shane Legg,et al. Human-level control through deep reinforcement learning , 2015, Nature.
[9] Marc G. Bellemare,et al. The Arcade Learning Environment: An Evaluation Platform for General Agents , 2012, J. Artif. Intell. Res..
[10] David Silver,et al. Deep Reinforcement Learning with Double Q-Learning , 2015, AAAI.
[11] David Silver,et al. Learning values across many orders of magnitude , 2016, NIPS.
[12] Tom Schaul,et al. Dueling Network Architectures for Deep Reinforcement Learning , 2015, ICML.
[13] Tom Schaul,et al. Prioritized Experience Replay , 2015, ICLR.
[14] Martin A. Riedmiller,et al. Leveraging Demonstrations for Deep Reinforcement Learning on Robotics Problems with Sparse Rewards , 2017, ArXiv.
[15] Tom Schaul,et al. Rainbow: Combining Improvements in Deep Reinforcement Learning , 2017, AAAI.
[16] David Budden,et al. Distributed Prioritized Experience Replay , 2018, ICLR.
[17] Peter Stone,et al. TD Learning with Constrained Gradients , 2018 .
[18] Jianfeng Gao,et al. BBQ-Networks: Efficient Exploration in Deep Reinforcement Learning for Task-Oriented Dialogue Systems , 2016, AAAI.
[19] Marcin Andrychowicz,et al. Overcoming Exploration in Reinforcement Learning with Demonstrations , 2017, 2018 IEEE International Conference on Robotics and Automation (ICRA).
[20] Tom Schaul,et al. Deep Q-learning From Demonstrations , 2017, AAAI.
[21] Marc G. Bellemare,et al. The Reactor: A fast and sample-efficient Actor-Critic agent for Reinforcement Learning , 2017, ICLR.
[22] Xi Chen,et al. Learning From Demonstration in the Wild , 2018, 2019 International Conference on Robotics and Automation (ICRA).