Adaptive Learning: A New Decentralized Reinforcement Learning Approach for Cooperative Multiagent Systems
暂无分享,去创建一个
Jing Chen | Shaofei Chen | Meng-Lin Li | J. Chen | S. Chen | Mengyuan Li
[1] Guy Lever,et al. Value-Decomposition Networks For Cooperative Multi-Agent Learning Based On Team Reward , 2018, AAMAS.
[2] Yujing Hu,et al. Q-value Path Decomposition for Deep Multiagent Reinforcement Learning , 2020, ICML.
[3] Shimon Whiteson,et al. Counterfactual Multi-Agent Policy Gradients , 2017, AAAI.
[4] Yi Wu,et al. Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments , 2017, NIPS.
[5] Haitao Wang,et al. Deep reinforcement learning with experience replay based on SARSA , 2016, 2016 IEEE Symposium Series on Computational Intelligence (SSCI).
[6] Guillaume J. Laurent,et al. Coordination of independent learners in cooperative Markov games. , 2009 .
[7] R. Stephenson. A and V , 1962, The British journal of ophthalmology.
[8] Alex Graves,et al. Asynchronous Methods for Deep Reinforcement Learning , 2016, ICML.
[9] Kevin Barraclough,et al. I and i , 2001, BMJ : British Medical Journal.
[10] Rudolf Paul Wiegand,et al. An analysis of cooperative coevolutionary algorithms , 2004 .
[11] Michael Wagenknecht,et al. Fuzzy control : theory and practice , 2000 .
[12] Tom Schaul,et al. StarCraft II: A New Challenge for Reinforcement Learning , 2017, ArXiv.
[13] Rob Fergus,et al. Learning Multiagent Communication with Backpropagation , 2016, NIPS.
[14] Manuela M. Veloso,et al. Multiagent Systems: A Survey from a Machine Learning Perspective , 2000, Auton. Robots.
[15] David Silver,et al. Deep Reinforcement Learning with Double Q-Learning , 2015, AAAI.
[16] Weinan Zhang,et al. MAgent: A Many-Agent Reinforcement Learning Platform for Artificial Collective Intelligence , 2017, AAAI.
[17] Zhang Ru-bo. Survey of distributed reinforcement learning algorithms in multi-agent systems , 2003 .
[18] Frank L. Lewis,et al. Off-Policy Actor-Critic Structure for Optimal Control of Unknown Systems With Disturbances , 2016, IEEE Transactions on Cybernetics.
[19] Erfu Yang,et al. Multiagent Reinforcement Learning for Multi-Robot Systems: A Survey , 2004 .
[20] Sergey Levine,et al. High-Dimensional Continuous Control Using Generalized Advantage Estimation , 2015, ICLR.
[21] Jianye Hao,et al. Qatten: A General Framework for Cooperative Multiagent Reinforcement Learning , 2020, ArXiv.
[22] Stephen Tyree,et al. GA3C: GPU-based A3C for Deep Reinforcement Learning , 2016, ArXiv.
[23] Peter Stone,et al. Deep Recurrent Q-Learning for Partially Observable MDPs , 2015, AAAI Fall Symposia.
[24] Shimon Whiteson,et al. The StarCraft Multi-Agent Challenge , 2019, AAMAS.
[25] Craig Boutilier,et al. The Dynamics of Reinforcement Learning in Cooperative Multiagent Systems , 1998, AAAI/IAAI.
[26] Shimon Whiteson,et al. QMIX: Monotonic Value Function Factorisation for Deep Multi-Agent Reinforcement Learning , 2018, ICML.
[27] Shimon Whiteson,et al. Learning to Communicate with Deep Multi-Agent Reinforcement Learning , 2016, NIPS.
[28] W. Marsden. I and J , 2012 .
[29] Karl Tuyls,et al. Theoretical Advantages of Lenient Learners: An Evolutionary Game Theoretic Perspective , 2008, J. Mach. Learn. Res..
[30] Zhang Guoyin. Survey of Multi-agent Reinforcement Learning in Markov Games , 2005 .
[31] Dorian Kodelja,et al. Multiagent cooperation and competition with deep reinforcement learning , 2015, PloS one.
[32] Peng Peng,et al. Multiagent Bidirectionally-Coordinated Nets: Emergence of Human-level Coordination in Learning to Play StarCraft Combat Games , 2017, 1703.10069.
[33] Peter Dayan,et al. Technical Note: Q-Learning , 2004, Machine Learning.
[34] Panagiotis Tzionas,et al. A robust approach for multi-agent natural resource allocation based on stochastic optimization algorithms , 2014, Appl. Soft Comput..
[35] Malik Abdulrazzaq Alsaedi,et al. Fuzzy control system review , 2013 .
[36] Satinder Singh,et al. Generative Adversarial Self-Imitation Learning , 2018, ArXiv.
[37] Seungjae Shin,et al. A Survey on Deep Reinforcement Learning Libraries , 2019 .
[38] Hado van Hasselt,et al. Double Q-learning , 2010, NIPS.
[39] Guillaume J. Laurent,et al. Hysteretic q-learning :an algorithm for decentralized reinforcement learning in cooperative multi-agent teams , 2007, 2007 IEEE/RSJ International Conference on Intelligent Robots and Systems.
[40] Yung Yi,et al. QTRAN: Learning to Factorize with Transformation for Cooperative Multi-Agent Reinforcement Learning , 2019, ICML.
[41] Kenneth A. De Jong,et al. A Cooperative Coevolutionary Approach to Function Optimization , 1994, PPSN.
[42] Shimon Whiteson,et al. Stabilising Experience Replay for Deep Multi-Agent Reinforcement Learning , 2017, ICML.
[43] Sean Luke,et al. Lenient Learning in Independent-Learner Stochastic Cooperative Games , 2016, J. Mach. Learn. Res..
[44] Martin Lauer,et al. An Algorithm for Distributed Reinforcement Learning in Cooperative Multi-Agent Systems , 2000, ICML.