Cooperative Q-Leaming Based on Maturity ofthe Policy
暂无分享,去创建一个
Mao Yang | Yantao Tian | Xiaomei Liu | Mao Yang | Yantao Tian | Xiaomei Liu
[1] Y. Kuroe,et al. Swarm reinforcement learning algorithms -exchange of information among multiple agents- , 2007, SICE Annual Conference 2007.
[2] Maja J. Mataric,et al. Broadcast of local eligibility: behavior-based control for strongly cooperative robot teams , 2000, International Conference on Autonomous Agents.
[3] Maja J. Mataric,et al. Broadcast of Local Elibility for Multi-Target Observation , 2000, DARS.
[4] Tucker R. Balch,et al. Communication, Diversity and Learning: Cornerstones of Swarm Behavior , 2004, Swarm Robotics.
[5] Yang Zhilian. Overview of particle swarm optimization , 2003 .
[6] Maja J. Mataric,et al. Murdoch: publish/subscribe task allocation for heterogeneous agents , 2000, AGENTS '00.
[7] Yantao Tian,et al. Cooperative Q Learning Based on Blackboard Architecture , 2007, 2007 International Conference on Computational Intelligence and Security Workshops (CISW 2007).
[8] Lynne E. Parker,et al. ALLIANCE: an architecture for fault tolerant multirobot cooperation , 1998, IEEE Trans. Robotics Autom..