Reward shaping for valuing communications during multi-agent coordination
暂无分享,去创建一个
Nicholas R. Jennings | Enrico Gerding | Simon Andrew Williamson | N. Jennings | E. Gerding | S. A. Williamson
[1] John N. Tsitsiklis,et al. The Complexity of Markov Decision Processes , 1987, Math. Oper. Res..
[2] Nicholas R. Jennings,et al. A principled information valuation for communications during multi-agent coordination , 2008 .
[3] Victor R. Lesser,et al. Multi-agent policies: from centralized ones to decentralized ones , 2002, AAMAS '02.
[4] Brahim Chaib-draa,et al. An online POMDP algorithm for complex multiagent environments , 2005, AAMAS '05.
[5] Neil Immerman,et al. The Complexity of Decentralized Control of Markov Decision Processes , 2000, UAI.
[6] Makoto Yokoo,et al. Taming Decentralized POMDPs: Towards Efficient Policy Computation for Multiagent Settings , 2003, IJCAI.
[7] R. A. Leibler,et al. On Information and Sufficiency , 1951 .
[8] Claudia V. Goldman,et al. Optimizing information exchange in cooperative multi-agent systems , 2003, AAMAS '03.
[9] Manuela M. Veloso,et al. Reasoning about joint beliefs for execution-time communication decisions , 2005, AAMAS '05.
[10] P. Cohen,et al. Rational Communication in Multi-Agent Environments , 2000 .
[11] Weixiong Zhang,et al. Towards Flexible Teamwork in Persistent Teams: Extended Report , 2000, Autonomous Agents and Multi-Agent Systems.
[12] Shlomo Zilberstein,et al. Value-based observation compression for DEC-POMDPs , 2008, AAMAS.
[13] Hiroaki Kitano,et al. RoboCup Rescue: a grand challenge for multi-agent systems , 2000, Proceedings Fourth International Conference on MultiAgent Systems.
[14] Weixiong Zhang,et al. Towards flexible teamwork in persistent teams , 1998, Proceedings International Conference on Multi Agent Systems (Cat. No.98EX160).