暂无分享,去创建一个
Bo Li | Qing Wang | Tong Zhang | Yang Zheng | Lei Han | Ji Liu | Peng Sun | Yongsheng Liu | Han Liu | Jiechao Xiong | Xinghai Sun | Ji Liu | T. Zhang | Peng Sun | Xinghai Sun | Lei Han | J. Xiong | Qing Wang | Bo Li | Yang Zheng | Yongsheng Liu | Han Liu | Jiechao Xiong
[1] Peter Norvig,et al. Artificial Intelligence: A Modern Approach , 1995 .
[2] Richard S. Sutton,et al. Introduction to Reinforcement Learning , 1998 .
[3] Doina Precup,et al. Between MDPs and Semi-MDPs: A Framework for Temporal Abstraction in Reinforcement Learning , 1999, Artif. Intell..
[4] Ian Millington,et al. Artificial Intelligence for Games , 2006, The Morgan Kaufmann series in interactive 3D technology.
[5] Anthony Brabazon,et al. Evolving Behaviour Trees for the Mario AI Competition Using Grammatical Evolution , 2011, EvoApplications.
[6] Santiago Ontañón,et al. A Survey of Real-Time Strategy Game AI Research and Competition in StarCraft , 2013, IEEE Transactions on Computational Intelligence and AI in Games.
[7] Petter Ögren,et al. Towards a unified behavior trees framework for robot control , 2014, 2014 IEEE International Conference on Robotics and Automation (ICRA).
[8] Sergey Levine,et al. Trust Region Policy Optimization , 2015, ICML.
[9] Shane Legg,et al. Human-level control through deep reinforcement learning , 2015, Nature.
[10] David Silver,et al. Deep Reinforcement Learning with Double Q-Learning , 2015, AAAI.
[11] Alex Graves,et al. Strategic Attentive Writer for Learning Macro-Actions , 2016, NIPS.
[12] Tom Schaul,et al. Dueling Network Architectures for Deep Reinforcement Learning , 2015, ICML.
[13] Rob Fergus,et al. Learning Multiagent Communication with Backpropagation , 2016, NIPS.
[14] Alex Graves,et al. Asynchronous Methods for Deep Reinforcement Learning , 2016, ICML.
[15] Tom Schaul,et al. Unifying Count-Based Exploration and Intrinsic Motivation , 2016, NIPS.
[16] Demis Hassabis,et al. Mastering the game of Go with deep neural networks and tree search , 2016, Nature.
[17] Wojciech Jaskowski,et al. ViZDoom: A Doom-based AI research platform for visual reinforcement learning , 2016, 2016 IEEE Conference on Computational Intelligence and Games (CIG).
[18] Sergey Levine,et al. End-to-End Training of Deep Visuomotor Policies , 2015, J. Mach. Learn. Res..
[19] Sergey Levine,et al. High-Dimensional Continuous Control Using Generalized Advantage Estimation , 2015, ICLR.
[20] Nicolas Usunier,et al. Episodic Exploration for Deep Deterministic Policies: An Application to StarCraft Micromanagement Tasks , 2016, ArXiv.
[21] David Churchill. Heuristic Search Techniques for Real-Time Strategy Games , 2016 .
[22] Tom Schaul,et al. FeUdal Networks for Hierarchical Reinforcement Learning , 2017, ICML.
[23] Doina Precup,et al. The Option-Critic Architecture , 2016, AAAI.
[24] Matthew E. Taylor,et al. Autonomous Extracting a Hierarchical Structure of Tasks in Reinforcement Learning and Multi-task Reinforcement Learning , 2017, ArXiv.
[25] Romain Laroche,et al. Hybrid Reward Architecture for Reinforcement Learning , 2017, NIPS.
[26] Yuandong Tian,et al. ELF: An Extensive, Lightweight and Flexible Research Platform for Real-time Strategy Games , 2017, NIPS.
[27] Tom Schaul,et al. StarCraft II: A New Challenge for Reinforcement Learning , 2017, ArXiv.
[28] Peng Peng,et al. Multiagent Bidirectionally-Coordinated Nets: Emergence of Human-level Coordination in Learning to Play StarCraft Combat Games , 2017, 1703.10069.
[29] Ali Farhadi,et al. Target-driven visual navigation in indoor scenes using deep reinforcement learning , 2016, 2017 IEEE International Conference on Robotics and Automation (ICRA).
[30] Yuandong Tian,et al. Training Agent for First-Person Shooter Game with Actor-Critic Curriculum Learning , 2016, ICLR.
[31] Demis Hassabis,et al. Mastering the game of Go without human knowledge , 2017, Nature.
[32] Alec Radford,et al. Proximal Policy Optimization Algorithms , 2017, ArXiv.
[33] Sergey Levine,et al. (CAD)$^2$RL: Real Single-Image Flight without a Single Real Image , 2016, Robotics: Science and Systems.
[34] Shimon Whiteson,et al. Counterfactual Multi-Agent Policy Gradients , 2017, AAAI.
[35] Pieter Abbeel,et al. Meta Learning Shared Hierarchies , 2017, ICLR.
[36] Razvan Pascanu,et al. Relational Deep Reinforcement Learning , 2018, ArXiv.