论文信息 - Keepaway Soccer: A Machine Learning Testbed

Keepaway Soccer: A Machine Learning Testbed

RoboCup simulated soccer presents many challenges to machine learning (ML) methods, including a large state space, hidden and uncertain state, multiple agents, and long and variable delays in the effects of actions. While there have been many successful ML applications to portions of the robotic soccer task, it appears to be still beyond the capabilities of modern machine learning techniques to enable a team of 11 agents to successfully learn the full robotic soccer task from sensors to actuators. Because the successful applications to portions of the task have been embedded in different teams and have often addressed different sub-tasks, they have been difficult to compare. We put forth keepaway soccer as a domain suitable for directly comparing different machine learning approaches to robotic soccer. It is complex enough that it can't be solved trivially, yet simple enough that complete machine learning approaches are feasible. In keepaway, one team, "the keepers," tries to keep control of the ball for as long as possible despite the efforts of "the takers." The keepers learn individually when to hold the ball and when to pass to a teammate, while the takers learn when to charge the ball-holder and when to cover possible passing lanes. We fully specify the domain and summarize some initial, successful learning results.

Peter Stone | Richard S. Sutton | R. Sutton | P. Stone

[1] Aiko M. Hormann,et al. Programs for Machine Learning. Part I , 1962, Inf. Control..

[2] J. Ross Quinlan,et al. C4.5: Programs for Machine Learning , 1992 .

[3] Martin L. Puterman,et al. Markov Decision Processes: Discrete Stochastic Dynamic Programming , 1994 .

[4] Michael O. Duff,et al. Reinforcement Learning Methods for Continuous-Time Markov Decision Problems , 1994, NIPS.

[5] Hiroaki Kitano,et al. The RoboCup Synthetic Agent Challenge 97 , 1997, IJCAI.

[6] James A. Hendler,et al. Co-evolving Soccer Softbot Team Coordination with Genetic Programming , 1997, RoboCup.

[7] Tomohito Andou,et al. Refinement of Soccer Agents' Positions Using Reinforcement Learning , 1997, RoboCup.

[8] Ian Frank,et al. Soccer Server: A Tool for Research on Multiagent Systems , 1998, Appl. Artif. Intell..

[9] Astro Teller,et al. Evolving Team Darwin United , 1998, RoboCup.

[10] Peter Stone,et al. Anticipation as a key for collaboration in a team of agents: a case study in robotic soccer , 1999, Optics East.

[11] Manuela M. Veloso,et al. Team-partitioned, opaque-transition reinforcement learning , 1999, AGENTS '99.

[12] Yishay Mansour,et al. Policy Gradient Methods for Reinforcement Learning with Function Approximation , 1999, NIPS.

[13] Eiji Uchibe,et al. Cooperative Behavior Acquisition by Learning and Evolution in a Multi-Agent Environment for Mobile Robots , 1999 .

[14] 浅田稔,et al. RoboCup-98 : Robot Soccer World Cup II , 1999 .

[15] Peter Stone,et al. Layered learning in multiagent systems - a winning approach to robotic soccer , 2000, Intelligent robotics and autonomous agents.

[16] Martin A. Riedmiller,et al. Karlsruhe Brainstormers - A Reinforcement Learning Approach to Robotic Soccer , 2000, RoboCup.

[17] Peter Stone,et al. An architecture for action selection in robotic soccer , 2001, AGENTS '01.

[18] Peter Stone,et al. Scaling Reinforcement Learning toward RoboCup Soccer , 2001, ICML.