论文信息 - Nash Equilibrium or Nash Bargaining ? Choosing a Solution Concept for Multi-Agent Learning

Nash Equilibrium or Nash Bargaining ? Choosing a Solution Concept for Multi-Agent Learning

Learning in many multi-agent settings is inherently repeated play. This calls into question the naive application of Nash equilibria in multi-agent learning and suggests, instead, the application of give-and-take principles of bargaining. We present an M action, N player social dilemma that encodes the key elements of the Prisoner’s Dilemma and thereby serves to highlight the importance of cooperation in multiagent systems. This game is instructive because it characterizes social dilemmas with more than two agents and more than two choices. We show how several different multi-agent learning algorithms behave in this social dilemma, including a satisficing algorithm based on [16] that is compatible with the bargaining perspective. This algorithm is a form of relaxation search that converges to a satisficing equilibrium without knowledge of other agents actions and payoffs. Finally, we present theoretical results that characterize the behavior of the algorithm.

Michael A. Goodrich | M. Goodrich | Jeffrey L. Stimpson

[1] Manuela Veloso,et al. An Analysis of Stochastic Game Theory for Multiagent Reinforcement Learning , 2000 .

[2] D. Fudenberg,et al. The Theory of Learning in Games , 1998 .

[3] Leslie Pack Kaelbling,et al. Playing is believing: The role of beliefs in multi-agent learning , 2001, NIPS.

[4] Craig Boutilier,et al. The Dynamics of Reinforcement Learning in Cooperative Multiagent Systems , 1998, AAAI/IAAI.

[5] Peter Stone,et al. Leading Best-Response Strategies in Repeated Games , 2001, International Joint Conference on Artificial Intelligence.

[6] Nick Feltovich,et al. Reinforcement-based vs. Belief-based Learning Models in Experimental Asymmetric-information Games , 2000 .

[7] W. Hamilton,et al. The evolution of cooperation. , 1984, Science.

[8] Jeffrey S. Rosenschein,et al. Time and the Prisoner's Dilemma , 2007, ICMAS.

[9] E. Kalai,et al. Rational Learning Leads to Nash Equilibrium , 1993 .

[10] Norman Frohlich,et al. When Is Universal Contribution Best for the Group? , 1996 .

[11] Debraj Ray,et al. Evolving Aspirations and Cooperation , 1998 .