论文信息 - Reward and Diversity in Multirobot Foraging

Reward and Diversity in Multirobot Foraging

This research seeks to quantify the impact of the choice of reward function on behavioral diversity in learning robot teams The methodology developed for this work has been applied to multirobot forag ing soccer and cooperative movement This paper focuses speci cally on results in multirobot forag ing In these experiments three types of reward are used with Q learning to train a multirobot team to forage a local performance based reward a global performance based reward and a heuristic strategy referred to as shaped reinforcement Local strate gies provide each agent a speci c reward according to its own behavior while global rewards provide all the agents on the team the same reward simul taneously Shaped reinforcement provides a heuris tic reward for an agent s action given its situation The experiments indicate that local performance based rewards and shaped reinforcement generate statistically similar results they both provide the best performance and the least diversity Finally learned policies are demonstrated on a team of No madic Technologies Nomad robots

Tucker Balch | T. Balch

[1] Ronald C. Arkin,et al. Cooperation without communication: Multiagent schema-based robot navigation , 1992, J. Field Robotics.

[2] Tucker R. Balch,et al. Communication of behavorial state in multi-agent retrieval tasks , 1993, [1993] Proceedings IEEE International Conference on Robotics and Automation.

[3] Tucker R. Balch,et al. Communication in reactive multiagent robotic systems , 1995, Auton. Robots.

[4] Lynne E. Parker,et al. Heterogeneous multi-robot cooperation , 1994 .

[5] Ronald C. Arkin,et al. Temporal coordination of perceptual algorithms for mobile robot navigation , 1994, IEEE Trans. Robotics Autom..

[6] Pattie Maes,et al. A Study of Territoriality: The Role of Critical Mass in Adaptive Task Division , 1996 .

[7] Hiroaki Kitano,et al. RoboCup: The Robot World Cup Initiative , 1997, AGENTS '97.

[8] Tucker Balch,et al. Learning Roles: Behavioral Diversity in Robot Teams , 1997 .

[9] Maja J. Mataric,et al. Interference as a Tool for Designing and Evaluating Multi-Robot Controllers , 1997, AAAI/IAAI.

[10] Maja J. Mataric,et al. Reinforcement Learning in the Multi-Robot Domain , 1997, Auton. Robots.

[11] R. Arkin,et al. Behavioral diversity in learning robot teams , 1998 .

[12] Tucker R. Balch. The impact of diversity on performance in multi-robot foraging , 1999, AGENTS '99.