论文信息 - Function approximation based multi-agent reinforcement learning

Function approximation based multi-agent reinforcement learning

The paper presents two new multi-agent based domain independent coordination mechanisms for reinforcement learning. The first mechanism allows agents to learn coordination information from state transitions and the second one from the observed reward distribution. In this way, the latter mechanism tends to increase region-wide joint rewards. The selected experimented domain is Adversarial Food-Collecting World (AFCW), which can be configured both as single and multi-agent environments. Experimental results show the effectiveness of these mechanisms.

[1] Leemon C. Baird,et al. Residual Algorithms: Reinforcement Learning with Function Approximation , 1995, ICML.

[2] Faruk Polat,et al. A Conflict Resolution-Based Decentralized Multi-Agent Problem Solving Model , 1992, MAAMAW.

[3] Andrew W. Moore,et al. Reinforcement Learning: A Survey , 1996, J. Artif. Intell. Res..

[4] Nils J. Nilsson,et al. Reacting, Planning, and Learning in an Autonomous Agent , 1996, Machine Intelligence 14.

[5] Gerhard Weiss,et al. Learning to Coordinate Actions in Multi-Agent-Systems , 1993, IJCAI.

[6] Moshe Tennenholtz,et al. Adaptive Load Balancing: A Study in Multi-Agent Learning , 1994, J. Artif. Intell. Res..

[7] Shashi Shekhar,et al. A Negotiation Platform for Cooperating Multi-agent Systems , 1993 .

[8] Ming Tan,et al. Multi-Agent Reinforcement Learning: Independent versus Cooperative Agents , 1997, ICML.