Reinforcement Learning of Agent with a Staged View in Distance and Direction for the Pursuit Problem
暂无分享,去创建一个
[1] Peter Dayan,et al. Q-learning , 1992, Machine Learning.
[2] K. Fu,et al. A heuristic approach to reinforcement learning control systems , 1965 .
[3] Kenji Fukumoto,et al. Multi-agent Reinforcement Learning: A Modular Approach , 1996 .
[4] Jiming Liu. Autonomous agents and multi-agent systems : explorations in learning, self-organization and adaptive computation , 2001 .
[5] M. Benda,et al. On Optimal Cooperation of Knowledge Sources , 1985 .
[6] Richard S. Sutton,et al. Generalization in ReinforcementLearning : Successful Examples UsingSparse Coarse , 1996 .
[7] Akira Ito,et al. Speeding up Multi-Agent Reinforcement Learning by Coarse-Graining of Perception ——Hunter Game as an Example—— , 2001 .
[8] Richard S. Sutton,et al. Learning to predict by the methods of temporal differences , 1988, Machine Learning.
[9] Andreas Birk,et al. RoboCup 2001: Robot Soccer World Cup V , 2002, Lecture Notes in Computer Science.
[10] John J. Grefenstette,et al. Credit assignment in rule discovery systems based on genetic algorithms , 1988, Machine Learning.