论文信息 - Max-planck-institut F ¨ Ur Mathematik in Den Naturwissenschaften Leipzig Reinforcement Learning in Complementarity Game and Population Dynamics Reinforcement Learning in Complementarity Game and Population Dynamics

Max-planck-institut F ¨ Ur Mathematik in Den Naturwissenschaften Leipzig Reinforcement Learning in Complementarity Game and Population Dynamics Reinforcement Learning in Complementarity Game and Population Dynamics

We systematically test and compare different reinforcement learning schemes in a complementarity game [J. Jost and W. Li, Physica A 345, 245 (2005)] played between members of two populations. More precisely, we study the Roth-Erev, Bush-Mosteller, and SoftMax reinforcement learning schemes. A modified version of Roth-Erev with a power exponent of 1.5, as opposed to 1 in the standard version, performs best. We also compare these reinforcement learning strategies with evolutionary schemes. This gives insight into aspects like the issue of quick adaptation as opposed to systematic exploration or the role of learning rates.

Jürgen Jost | Wei Li | J. Jost | Wei Li

[1] Armin W. Schulz. Signals: evolution, learning, and information , 2012 .

[2] Gardner Ackley,et al. 安定政策の目標("The American Economic Review"78年5月号掲載) , 1978 .

[3] D. Ruppert. The Elements of Statistical Learning: Data Mining, Inference, and Prediction , 2004 .

[4] F. A. Hayek. The American Economic Review , 2007 .

[5] Richard S. Sutton,et al. Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.