论文信息 - On a Dynamical Analysis of Reinforcement Learning in Games: Emergence of Occam's Razor

On a Dynamical Analysis of Reinforcement Learning in Games: Emergence of Occam's Razor

Modeling learning agents in the context of Multi-agent Systems requires an adequate understanding of their dynamic behaviour. Usually, these agents are modeled similar to the different players in a standard game theoretical model. Unfortunately traditional Game Theory is static and limited in its usefelness. Evolutionary Game Theory improves on this by providing a dynamics which describes how strategies evolve over time. In this paper, we discuss three learning models whose dynamics are related to the Replicator Dynamics(RD). We show how a classical Reinforcement Learning(RL) technique, i.e. Q-learning relates to the RD. This allows to better understand the learning process and it allows to determine how complex a RL model should be. More precisely, Occam's Razor applies in the framework of games, i.e. the simplest model (Cross) suffices for learning equilibria. An experimental verification in all three models is presented.

Karl Tuyls | Katja Verbeeck | Sam Maes

[1] Kumpati S. Narendra,et al. Learning automata - an introduction , 1989 .

[2] T. D. Schneider,et al. Evolution of biological information. , 2000, Nucleic acids research.

[3] D. Serra,et al. Game theory and economics , 2003 .

[4] Richard S. Sutton,et al. Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[5] Tom Lenaerts,et al. A selection-mutation model for q-learning in multi-agent systems , 2003, AAMAS '03.

[6] Fernando Redondo. Game Theory and Economics , 2001 .

[7] D. Stauffer. Life, Love and Death: Models of Biological Reproduction and Aging , 1999 .

[8] Richard S. Sutton,et al. Introduction to Reinforcement Learning , 1998 .

[9] Tom Lenaerts,et al. Towards a relation between learning agents and evolutionary dynamics , 2002 .

[10] Josef Hofbauer,et al. Evolutionary Games and Population Dynamics , 1998 .

[11] Jörgen W. Weibull,et al. Evolutionary Game Theory , 1996 .

[12] Tilman Börgers,et al. Learning Through Reinforcement and Replicator Dynamics , 1997 .