论文信息 - Asymmetric multiagent reinforcement learning

Asymmetric multiagent reinforcement learning

A novel method for asymmetric multiagent reinforcement learning is introduced in this paper. The method addresses the problem where the information states of the agents involved in the learning task are not equal; some agents (leaders) have information on how their opponents (followers) will select their actions and based on this information leaders encourage followers to select actions that lead to improved payoffs for the leaders. This kind of configuration arises, e.g. in semi-centralized multiagent systems with an external global utility associated to the system. We present a brief literature survey of multiagent reinforcement learning based on Markov games and then construct an asymmetric learning method that utilizes the theory of Markov games. Additionally, we test the proposed method with a simple example application.

V. Kononen | V. Könönen

[1] T. Başar,et al. Dynamic Noncooperative Game Theory , 1982 .

[2] Michael L. Littman,et al. Markov Games as a Framework for Multi-Agent Reinforcement Learning , 1994, ICML.

[3] Thomas G. Dietterich. What is machine learning? , 2020, Archives of Disease in Childhood.

[4] J. Filar,et al. Competitive Markov Decision Processes , 1996 .

[5] Craig Boutilier,et al. The Dynamics of Reinforcement Learning in Cooperative Multiagent Systems , 1998, AAAI/IAAI.

[6] Michael P. Wellman,et al. Multiagent Reinforcement Learning: Theoretical Framework and an Algorithm , 1998, ICML.

[7] Michael L. Littman,et al. Friend-or-Foe Q-learning in General-Sum Games , 2001, ICML.

[8] Daniel Kudenko,et al. Reinforcement learning of coordination in cooperative multi-agent systems , 2002, AAAI/IAAI.

[9] Xiaofeng Wang,et al. Reinforcement Learning to Play an Optimal Nash Equilibrium in Team Markov Games , 2002, NIPS.

[10] Richard S. Sutton,et al. Reinforcement Learning: An Introduction , 2005, IEEE Transactions on Neural Networks.