Simultaneous Policy and Discrete Communication Learning for Multi-Agent Cooperation
暂无分享,去创建一个
[1] Craig Boutilier,et al. Planning, Learning and Coordination in Multiagent Decision Processes , 1996, TARK.
[2] Robert Babuska,et al. A Survey of Actor-Critic Reinforcement Learning: Standard and Natural Policy Gradients , 2012, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).
[3] Pieter Abbeel,et al. Emergence of Grounded Compositional Language in Multi-Agent Populations , 2017, AAAI.
[4] Rob Fergus,et al. Learning Multiagent Communication with Backpropagation , 2016, NIPS.
[5] Bart De Schutter,et al. Multi-agent Reinforcement Learning: An Overview , 2010 .
[6] Guy Lever,et al. Human-level performance in 3D multiplayer games with population-based reinforcement learning , 2018, Science.
[7] Shimon Whiteson,et al. Learning to Communicate with Deep Multi-Agent Reinforcement Learning , 2016, NIPS.
[8] Shimon Whiteson,et al. Counterfactual Multi-Agent Policy Gradients , 2017, AAAI.
[9] Yi Wu,et al. Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments , 2017, NIPS.
[10] Manuela M. Veloso,et al. Heuristic Planning for Decentralized MDPs with Sparse Interactions , 2010, DARS.
[11] Howie Choset,et al. PRIMAL: Pathfinding via Reinforcement and Imitation Multi-Agent Learning , 2018, IEEE Robotics and Automation Letters.
[12] Yishay Mansour,et al. Policy Gradient Methods for Reinforcement Learning with Function Approximation , 1999, NIPS.
[13] Howie Choset,et al. Distributed Learning of Decentralized Control Policies for Articulated Mobile Robots , 2019, IEEE Transactions on Robotics.
[14] Alexandre M. Bayen,et al. Variance Reduction for Policy Gradient with Action-Dependent Factorized Baselines , 2018, ICLR.
[15] Neil Immerman,et al. The Complexity of Decentralized Control of Markov Decision Processes , 2000, UAI.
[16] Howie Choset,et al. Distributed Reinforcement Learning for Multi-robot Decentralized Collective Construction , 2018, DARS.