A Game Theoretic Framework for Communication in Fully Observable Multiagent Systems

Communication is an important element of multiagent systems (MAS). In fully decentralized systems it is needed to allow the agents to coordinate their actions to achieve certain goals. When the agents have no means to coordinate their actions, they generally choose actions that minimize their chance of losses. If the agents were allowed to coordinate, on the other hand, they can choose actions that allow them to get higher rewards instead. In game theory, these concepts are known as risk dominance and payoff dominance. In this paper, we model communication between agents in stochastic games as an extensive form game in which the agents can numerically evaluate the benefit of communicating a piece of information to the other agents. This allows the agent to answer two important questions about the communication process, namely when should an agent communicate, and what should an agent communicate. We also present a more stable way to select between Nash equilibria in multiagent reinforcement learning using NashQ which is used in the calculation of the values for the communication game.

[1]  Geoffrey J. Gordon,et al.  Multi-Robot Negotiation: Approximating the Set of Subgame Perfect Equilibria in General-Sum Stochastic Games , 2006, NIPS.

[2]  Michael P. Wellman,et al.  Nash Q-Learning for General-Sum Stochastic Games , 2003, J. Mach. Learn. Res..

[3]  Brahim Chaib-draa,et al.  Distributed Planning in Stochastic Games with Communication , 2008, 2008 Seventh International Conference on Machine Learning and Applications.

[4]  Eric Maskin,et al.  Markov Perfect Equilibrium: I. Observable Actions , 2001, J. Econ. Theory.

[5]  John C. Harsanyi,et al.  Общая теория выбора равновесия в играх / A General Theory of Equilibrium Selection in Games , 1989 .

[6]  L. Shapley,et al.  Stochastic Games* , 1953, Proceedings of the National Academy of Sciences.

[7]  Yoav Shoham,et al.  Multiagent Systems - Algorithmic, Game-Theoretic, and Logical Foundations , 2009 .

[8]  Manuela Veloso,et al.  What to Communicate? Execution-Time Decision in Multi-agent POMDPs , 2006, DARS.

[9]  J. Harsanyi A new theory of equilibrium selection for games with complete information , 1995 .

[10]  J. Nash Equilibrium Points in N-Person Games. , 1950, Proceedings of the National Academy of Sciences of the United States of America.

[11]  Abdel-Illah Mouaddib,et al.  A Rich Communication Model in Opportunistic Decentralized Decision Making , 2010, 2010 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology.