Multi Agent Reinforcement Learning for Gridworld Soccer Leadingpass

Soccer robotics is an emerging field that combines artificial intelligence and mobile robotics with the popular sport of soccer. Robotic soccer agents need to cooperate to complete tasks or subtasks, one way is by learning to coordinate their action. Leadingpass is considered as a task that had to be performed successfully by the team, or opponent could intercept the ball that leads the team to lose the game. This paper describes how Reinforcement Learning (RL) methods are applied to the learning scenario, that the learning agents cooperatively complete the leadingpass task in the Gridworld soccer environment. Not only RL algorithms for single agent case, but also for multi agent case.

[1]  Arthur Carvalho,et al.  Reinforcement learning for the soccer dribbling task , 2011, 2011 IEEE Conference on Computational Intelligence and Games (CIG'11).

[2]  Manuela M. Veloso,et al.  Multiagent learning using a variable learning rate , 2002, Artif. Intell..

[3]  Craig Boutilier,et al.  Planning, Learning and Coordination in Multiagent Decision Processes , 1996, TARK.

[4]  Reinaldo A. C. Bianchi,et al.  Reinforcement Learning with Case-Based Heuristics for RoboCup Soccer Keepaway , 2012, 2012 Brazilian Robotics Symposium and Latin American Robotics Symposium.

[5]  Peter Dayan,et al.  Technical Note: Q-Learning , 2004, Machine Learning.

[6]  Martin A. Riedmiller,et al.  Effective Methods for Reinforcement Learning in Large Multi-Agent Domains (Leistungsfähige Verfahren für das Reinforcement Lernen in komplexen Multi-Agenten-Umgebungen) , 2005, it Inf. Technol..

[7]  Chun-Gui Li,et al.  A Multi-agent Reinforcement Learning using Actor-Critic methods , 2008, 2008 International Conference on Machine Learning and Cybernetics.

[8]  Bart De Schutter,et al.  A Comprehensive Survey of Multiagent Reinforcement Learning , 2008, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[9]  Bart De Schutter,et al.  Multiagent Reinforcement Learning with Adaptive State Focus , 2005, BNAIC.

[10]  Craig Boutilier,et al.  The Dynamics of Reinforcement Learning in Cooperative Multiagent Systems , 1998, AAAI/IAAI.

[11]  Peter Dayan,et al.  Q-learning , 1992, Machine Learning.

[12]  E. Schuitema Reinforcement Learning on autonomous humanoid robots , 2012 .

[13]  Peter Stone,et al.  Reinforcement Learning for RoboCup Soccer Keepaway , 2005, Adapt. Behav..

[14]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.