论文信息 - Reinforcement Learning of Agent with a Staged View in Distance and Direction for the Pursuit Problem

Reinforcement Learning of Agent with a Staged View in Distance and Direction for the Pursuit Problem

An autonomous agent had a ranged view of the absolute coordinate system, where it can receive accurate information in a range but noting out of the range. This is a considerably artificial situation. In this paper, we propose a staged view in distance and direction of the relative coordinate system, where an agent receives accurate information in neighborhood but only rough information in short and middle-distance areas. It reflects a human's view that we can see easily an object in the neighborhood but more difficult as distance becomes larger and we can see easily an object in the center direction but more difficult in the righter and lefter directions. We show by a numerical experiment for the pursuit problem, a multi-agent's benchmark problem, that the agent with the staged view learns effectively using Q-learning.

Kazuhisa Seta | Motohide Umano | Tadayoshi Yamamura

[1] Peter Dayan,et al. Q-learning , 1992, Machine Learning.

[2] K. Fu,et al. A heuristic approach to reinforcement learning control systems , 1965 .

[3] Kenji Fukumoto,et al. Multi-agent Reinforcement Learning: A Modular Approach , 1996 .

[4] Jiming Liu. Autonomous agents and multi-agent systems : explorations in learning, self-organization and adaptive computation , 2001 .

[5] M. Benda,et al. On Optimal Cooperation of Knowledge Sources , 1985 .

[6] Richard S. Sutton,et al. Generalization in ReinforcementLearning : Successful Examples UsingSparse Coarse , 1996 .

[7] Akira Ito,et al. Speeding up Multi-Agent Reinforcement Learning by Coarse-Graining of Perception ——Hunter Game as an Example—— , 2001 .

[8] Richard S. Sutton,et al. Learning to predict by the methods of temporal differences , 1988, Machine Learning.

[9] Andreas Birk,et al. RoboCup 2001: Robot Soccer World Cup V , 2002, Lecture Notes in Computer Science.

[10] John J. Grefenstette,et al. Credit assignment in rule discovery systems based on genetic algorithms , 1988, Machine Learning.