论文信息 - Just add Pepper: extending learning algorithms for repeated matrix games to repeated Markov games

Just add Pepper: extending learning algorithms for repeated matrix games to repeated Markov games

Learning in multi-agent settings has recently garnered much interest, the result of which has been the development of somewhat effective multi-agent learning (MAL) algorithms for repeated normal-form games. However, general-purpose MAL algorithms for richer environments, such as general-sum repeated stochastic (Markov) games (RSGs), are less advanced. Indeed, previously created MAL algorithms for RSGs are typically successful only when the behavior of associates meets specific game theoretic assumptions and when the game is of a particular class (such as zero-sum games). In this paper, we present a new algorithm, called Pepper, that can be used to extend MAL algorithms designed for repeated normal-form games to RSGs. We demonstrate that Pepper creates a family of new algorithms, each of whose asymptotic performance in RSGs is reminiscent of its asymptotic performance in related repeated normal-form games. We also show that some algorithms formed with Pepper outperform existing algorithms in an interesting RSG.

Jacob W. Crandall | J. Crandall

[1] Bruno Bouzy,et al. Multi-agent Learning Experiments on Repeated Matrix Games , 2010, ICML.

[2] Sean R Eddy,et al. What is dynamic programming? , 2004, Nature Biotechnology.

[3] Michael A. Goodrich,et al. Learning to compete, compromise, and cooperate in repeated general-sum games , 2005, ICML.

[4] Murray L Weidenbaum,et al. Learning to compete , 1986 .

[5] Ronen I. Brafman,et al. R-MAX - A General Polynomial Time Algorithm for Near-Optimal Reinforcement Learning , 2001, J. Mach. Learn. Res..

[6] M. Goodrich,et al. Neglect Tolerant Teaming: Issues and Dilemmas , 2003 .

[7] Michael L. Littman,et al. A Polynomial-time Nash Equilibrium Algorithm for Repeated Stochastic Games , 2008, UAI.

[8] Michael L. Littman,et al. Friend-or-Foe Q-learning in General-Sum Games , 2001, ICML.

[9] Michael A. Goodrich,et al. Learning to compete, coordinate, and cooperate in repeated games using reinforcement learning , 2011, Machine Learning.

[10] Debraj Ray,et al. Evolving Aspirations and Cooperation , 1998 .

[11] Michael L. Littman,et al. Markov Games as a Framework for Multi-Agent Reinforcement Learning , 1994, ICML.