LIIR: Learning Individual Intrinsic Reward in Multi-Agent Reinforcement Learning
暂无分享,去创建一个
Lei Han | Dacheng Tao | Ji Liu | Yali Du | Tianhong Dai | Meng Fang | Ji Liu | D. Tao | Meng Fang | Yali Du | Lei Han | Tianhong Dai
[1] Jun Wang,et al. Multiagent Bidirectionally-Coordinated Nets for Learning to Play StarCraft Combat Games , 2017, ArXiv.
[2] Ming Tan,et al. Multi-Agent Reinforcement Learning: Independent versus Cooperative Agents , 1997, ICML.
[3] Richard L. Lewis,et al. Optimal Rewards for Cooperative Agents , 2014, IEEE Transactions on Autonomous Mental Development.
[4] R. J. Williams,et al. Simple Statistical Gradient-Following Algorithms for Connectionist Reinforcement Learning , 2004, Machine Learning.
[5] Guy Lever,et al. Value-Decomposition Networks For Cooperative Multi-Agent Learning Based On Team Reward , 2018, AAMAS.
[6] Wojciech Jaskowski,et al. ViZDoom: A Doom-based AI research platform for visual reinforcement learning , 2016, 2016 IEEE Conference on Computational Intelligence and Games (CIG).
[7] Richard L. Lewis,et al. Reward Design via Online Gradient Ascent , 2010, NIPS.
[8] David Silver,et al. Meta-Gradient Reinforcement Learning , 2018, NeurIPS.
[9] Etienne Perot,et al. Deep Reinforcement Learning framework for Autonomous Driving , 2017, Autonomous Vehicles and Machines.
[10] Tom Schaul,et al. StarCraft II: A New Challenge for Reinforcement Learning , 2017, ArXiv.
[11] Sam Devlin,et al. Reward shaping for knowledge-based multi-objective multi-agent reinforcement learning , 2018, The Knowledge Engineering Review.
[12] Murray Shanahan,et al. Feature Control as Intrinsic Motivation for Hierarchical Reinforcement Learning , 2017, IEEE Transactions on Neural Networks and Learning Systems.
[13] Alec Radford,et al. Proximal Policy Optimization Algorithms , 2017, ArXiv.
[14] D. Barrios-Aranibar,et al. LEARNING FROM DELAYED REWARDS USING INFLUENCE VALUES APPLIED TO COORDINATION IN MULTI-AGENT SYSTEMS , 2007 .
[15] Marek Grzes,et al. Reward Shaping in Episodic Reinforcement Learning , 2017, AAMAS.
[16] Shimon Whiteson,et al. The StarCraft Multi-Agent Challenge , 2019, AAMAS.
[17] Marcin Andrychowicz,et al. Learning to learn by gradient descent by gradient descent , 2016, NIPS.
[18] Satinder Singh,et al. On Learning Intrinsic Rewards for Policy Gradient Methods , 2018, NeurIPS.
[19] Shimon Whiteson,et al. QMIX: Monotonic Value Function Factorisation for Deep Multi-Agent Reinforcement Learning , 2018, ICML.
[20] Qing Wang,et al. Exponentially Weighted Imitation Learning for Batched Historical Data , 2018, NeurIPS.
[21] Joshua B. Tenenbaum,et al. Hierarchical Deep Reinforcement Learning: Integrating Temporal Abstraction and Intrinsic Motivation , 2016, NIPS.
[22] Bo Li,et al. TStarBots: Defeating the Cheating Level Builtin AI in StarCraft II in the Full Game , 2018, ArXiv.
[23] Rob Fergus,et al. Learning Multiagent Communication with Backpropagation , 2016, NIPS.
[24] Tong Zhang,et al. Grid-Wise Control for Multi-Agent Reinforcement Learning in Video Game AI , 2019, ICML.
[25] Sergey Levine,et al. Trust Region Policy Optimization , 2015, ICML.
[26] Bartunov Sergey,et al. Meta-Learning with Memory-Augmented Neural Networks , 2016 .
[27] Marco Wiering,et al. Multi-Agent Reinforcement Learning for Traffic Light control , 2000 .
[28] Shimon Whiteson,et al. Counterfactual Multi-Agent Policy Gradients , 2017, AAAI.
[29] Joshua Achiam,et al. On First-Order Meta-Learning Algorithms , 2018, ArXiv.
[30] Honglak Lee,et al. Deep Learning for Reward Design to Improve Monte Carlo Tree Search in ATARI Games , 2016, IJCAI.
[31] Zongqing Lu,et al. Learning Attentional Communication for Multi-Agent Cooperation , 2018, NeurIPS.
[32] Dorian Kodelja,et al. Multiagent cooperation and competition with deep reinforcement learning , 2015, PloS one.
[33] Richard S. Sutton,et al. Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.
[34] Srikanth Kandula,et al. Resource Management with Deep Reinforcement Learning , 2016, HotNets.
[35] Alexei A. Efros,et al. Curiosity-Driven Exploration by Self-Supervised Prediction , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).
[36] Richard L. Lewis,et al. Intrinsically Motivated Reinforcement Learning: An Evolutionary Perspective , 2010, IEEE Transactions on Autonomous Mental Development.