论文信息 - Coaching Advice and Adaptation

Coaching Advice and Adaptation

Our research on coaching refers to one autonomous agent providing advice to another autonomous agent about how to act. In past work, we dealt with advice-receiving agents with fixed strategies, and we now consider agents which are learning. Further, we consider agents which have various limitations, with the hypothesis that if the coach adapts its advice to those limitations, more effective learning will result. In this work, we systematically explore the effect of various limitations upon the effectiveness of the coach’s advice. We state the two learning problems faced by the coach and the coached agents, and empirically study these problems in a predator-prey environment. The coach has access to optimal policies for the environment, and advises the predator on which actions to take. We experiment with limitations on the predator agent’s actions, the bandwidth between the coach and agent, and the memory size of the agent. We analyze the results which show that coaching can improve agent performance in the face of all these limitations.

Manuela M. Veloso | Patrick Riley

[1] Peter Bakker,et al. Robot see, robot do: An overview of robot imitation , 1996 .

[2] S. King. Learning to fly. , 1998, Nursing times.

[3] Patrick Riley. MPADES: Middleware for Parallel Agent Discrete Event Simulation , 2002, RoboCup.

[4] Geoffrey E. Hinton,et al. Learning internal representations by error propagation , 1986 .

[5] John H. Holland,et al. Adaptation in Natural and Artificial Systems: An Introductory Analysis with Applications to Biology, Control, and Artificial Intelligence , 1992 .

[6] Garrison W. Cottrell,et al. A Connectionist Model Of Instruction Following , 1995 .

[7] Craig Boutilier,et al. Implicit Imitation in Multiagent Reinforcement Learning , 1999, ICML.

[8] Manuela M. Veloso,et al. Planning for Distributed Execution through Use of Probabilistic Opponent Models , 2002, AIPS.

[9] Manuela Veloso,et al. An Empirical Study of Coaching , 2002, DARS.

[10] Tamio Arai,et al. Distributed Autonomous Robotic Systems 3 , 1998 .

[11] Craig Boutilier,et al. Imitation and Reinforcement Learning in Agents with Heterogeneous Actions , 2001, Canadian Conference on AI.