Non-local Self-attention Structure for Function Approximation in Deep Reinforcement Learning
暂无分享,去创建一个
Qing Li | Shu-Tao Xia | Yao Yao | Xi Xiao | Guangwu Hu | Zhixiang Wang | Dianyan Zhang | Zhendong Peng
[1] Elman Mansimov,et al. Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation , 2017, NIPS.
[2] Jakob Uszkoreit,et al. A Decomposable Attention Model for Natural Language Inference , 2016, EMNLP.
[3] Demis Hassabis,et al. Mastering the game of Go without human knowledge , 2017, Nature.
[4] Abhinav Gupta,et al. Non-local Neural Networks , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.
[5] Han Zhang,et al. Self-Attention Generative Adversarial Networks , 2018, ICML.
[6] Wojciech Jaskowski,et al. ViZDoom: A Doom-based AI research platform for visual reinforcement learning , 2016, 2016 IEEE Conference on Computational Intelligence and Games (CIG).
[7] Alex Graves,et al. Playing Atari with Deep Reinforcement Learning , 2013, ArXiv.
[8] Alec Radford,et al. Proximal Policy Optimization Algorithms , 2017, ArXiv.
[9] Christos Dimitrakakis,et al. TORCS, The Open Racing Car Simulator , 2005 .
[10] Shane Legg,et al. Human-level control through deep reinforcement learning , 2015, Nature.
[11] Sergey Levine,et al. Trust Region Policy Optimization , 2015, ICML.
[12] Demis Hassabis,et al. Mastering the game of Go with deep neural networks and tree search , 2016, Nature.
[13] Yishay Mansour,et al. Policy Gradient Methods for Reinforcement Learning with Function Approximation , 1999, NIPS.
[14] Yuval Tassa,et al. Continuous control with deep reinforcement learning , 2015, ICLR.
[15] Takeru Miyato,et al. cGANs with Projection Discriminator , 2018, ICLR.
[16] Wojciech Zaremba,et al. OpenAI Gym , 2016, ArXiv.
[17] Lukasz Kaiser,et al. Attention is All you Need , 2017, NIPS.
[18] Alex Graves,et al. Asynchronous Methods for Deep Reinforcement Learning , 2016, ICML.