暂无分享,去创建一个
David Jensen | John Foley | Emma Tosch | Kaleigh Clary | David D. Jensen | Emma Tosch | Kaleigh Clary | John Foley
[1] Elliot Meyerson,et al. Frame Skip Is a Powerful Parameter for Learning to Play Atari , 2015, AAAI Workshop: Learning for General Competency in Video Games.
[2] Romain Laroche,et al. Hybrid Reward Architecture for Reinforcement Learning , 2017, NIPS.
[3] Joshua B. Tenenbaum,et al. Building machines that learn and think like people , 2016, Behavioral and Brain Sciences.
[4] Helge J. Ritter,et al. Modularization of End-to-End Learning: Case Study in Arcade Games , 2019, ArXiv.
[5] Jonathan Dodge,et al. Visualizing and Understanding Atari Agents , 2017, ICML.
[6] Razvan Pascanu,et al. Overcoming catastrophic forgetting in neural networks , 2016, Proceedings of the National Academy of Sciences.
[7] Silvio Savarese,et al. Adversarially Robust Policy Learning: Active construction of physically-plausible perturbations , 2017, 2017 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).
[8] Shane Legg,et al. Human-level control through deep reinforcement learning , 2015, Nature.
[9] Philip Bachman,et al. Deep Reinforcement Learning that Matters , 2017, AAAI.
[10] Marlos C. Machado,et al. Revisiting the Arcade Learning Environment: Evaluation Protocols and Open Problems for General Agents , 2017, J. Artif. Intell. Res..
[11] Shane Legg,et al. Massively Parallel Methods for Deep Reinforcement Learning , 2015, ArXiv.
[12] Alexei A. Efros,et al. Investigating Human Priors for Playing Video Games , 2018, ICML.
[13] Marlos C. Machado,et al. Revisiting the Arcade Learning Environment: Evaluation Protocols and Open Problems for General Agents (Extended Abstract) , 2018, IJCAI.
[14] Nan Jiang,et al. The Dependence of Effective Planning Horizon on Model Accuracy , 2015, AAMAS.
[15] Tim Salimans,et al. Learning Montezuma's Revenge from a Single Demonstration , 2018, ArXiv.
[16] Alec Radford,et al. Proximal Policy Optimization Algorithms , 2017, ArXiv.
[17] John Schulman,et al. Gotta Learn Fast: A New Benchmark for Generalization in RL , 2018, ArXiv.
[18] Andre Cohen,et al. An object-oriented representation for efficient reinforcement learning , 2008, ICML '08.
[19] Patrick Gallinari,et al. Fast Reinforcement Learning with Large Action Sets Using Error-Correcting Output Codes for MDP Factorization , 2012, ECML/PKDD.
[20] Scott M. Jordan. Using Cumulative Distribution Based Performance Analysis to Benchmark Models , 2018 .
[21] Peter Stone,et al. Transfer Learning for Reinforcement Learning Domains: A Survey , 2009, J. Mach. Learn. Res..
[22] Sandy H. Huang,et al. Adversarial Attacks on Neural Network Policies , 2017, ICLR.
[23] Emma Brunskill,et al. Strategic Object Oriented Reinforcement Learning , 2018, ArXiv.
[24] David Silver,et al. Deep Reinforcement Learning with Double Q-Learning , 2015, AAAI.
[25] Dileep George,et al. Schema Networks: Zero-shot Transfer with a Generative Causal Model of Intuitive Physics , 2017, ICML.
[26] Carlos Guestrin,et al. Generalizing plans to new environments in relational MDPs , 2003, IJCAI 2003.
[27] John Foley,et al. Let's Play Again: Variability of Deep Reinforcement Learning Agents in Atari Environments , 2019, ArXiv.
[28] Elman Mansimov,et al. Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation , 2017, NIPS.
[29] Alex Graves,et al. Playing Atari with Deep Reinforcement Learning , 2013, ArXiv.
[30] Shimon Whiteson,et al. TreeQN and ATreeC: Differentiable Tree-Structured Models for Deep Reinforcement Learning , 2017, ICLR.
[31] Joan Bruna,et al. Intriguing properties of neural networks , 2013, ICLR.
[32] Taehoon Kim,et al. Quantifying Generalization in Reinforcement Learning , 2018, ICML.
[33] Andrew Zisserman,et al. Deep Inside Convolutional Networks: Visualising Image Classification Models and Saliency Maps , 2013, ICLR.
[34] S. Levine,et al. Can Deep Reinforcement Learning Solve Erdos-Selfridge-Spencer Games ? , 2018 .
[35] Thore Graepel,et al. Re-evaluating evaluation , 2018, NeurIPS.
[36] Marc G. Bellemare,et al. The Arcade Learning Environment: An Evaluation Platform for General Agents , 2012, J. Artif. Intell. Res..
[37] Marc G. Bellemare,et al. Investigating Contingency Awareness Using Atari 2600 Games , 2012, AAAI.
[38] Rui Wang,et al. Paired Open-Ended Trailblazer (POET): Endlessly Generating Increasingly Complex and Diverse Learning Environments and Their Solutions , 2019, ArXiv.
[39] Richard Evans,et al. Deep Reinforcement Learning in Large Discrete Action Spaces , 2015, 1512.07679.
[40] Andreas Krause,et al. Learning programs from noisy data , 2016, POPL.
[41] Joelle Pineau,et al. Natural Environment Benchmarks for Reinforcement Learning , 2018, ArXiv.
[42] Alex Graves,et al. Asynchronous Methods for Deep Reinforcement Learning , 2016, ICML.
[43] Amos J. Storkey,et al. Exploration by Random Network Distillation , 2018, ICLR.
[44] Tom Schaul,et al. Rainbow: Combining Improvements in Deep Reinforcement Learning , 2017, AAAI.
[45] Richard S. Sutton,et al. Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.