Coordinating multi-agent reinforcement learning with limited communication
暂无分享,去创建一个
[1] Andrew McCallum,et al. Instance-Based Utile Distinctions for Reinforcement Learning with Hidden State , 1995, ICML.
[2] Makoto Yokoo,et al. Networked Distributed POMDPs: A Synergy of Distributed Constraint Optimization and POMDPs , 2005, IJCAI.
[3] Claudia V. Goldman,et al. Transition-independent decentralized markov decision processes , 2003, AAMAS '03.
[4] Nikos A. Vlassis,et al. Collaborative Multiagent Reinforcement Learning by Payoff Propagation , 2006, J. Mach. Learn. Res..
[5] D. Koller,et al. Planning under uncertainty in complex structured environments , 2003 .
[6] Victor R. Lesser,et al. Integrating organizational control into multi-agent learning , 2009, AAMAS.
[7] Victor R. Lesser,et al. Coordinated Multi-Agent Reinforcement Learning in Networked Distributed POMDPs , 2011, AAAI.
[8] Anita Raja,et al. Coordinating decentralized learning and conflict resolution across agent boundaries , 2012 .
[9] Carlos Guestrin,et al. Multiagent Planning with Factored MDPs , 2001, NIPS.
[10] Nicholas R. Jennings,et al. Decentralised Coordination of Mobile Sensors Using the Max-Sum Algorithm , 2009, IJCAI.
[11] Michail G. Lagoudakis,et al. Coordinated Reinforcement Learning , 2002, ICML.
[12] Marc Toussaint,et al. Scalable Multiagent Planning Using Probabilistic Inference , 2011, IJCAI.
[13] Victor R. Lesser,et al. Self-organization for coordinating decentralized reinforcement learning , 2010, AAMAS.
[14] Craig Boutilier,et al. The Dynamics of Reinforcement Learning in Cooperative Multiagent Systems , 1998, AAAI/IAAI.
[15] Edmund H. Durfee,et al. Influence-Based Policy Abstraction for Weakly-Coupled Dec-POMDPs , 2010, ICAPS.
[16] Neil Immerman,et al. The Complexity of Decentralized Control of Markov Decision Processes , 2000, UAI.