论文信息 - Scheduling with Group Dynamics: A Multi-Robot Task-Allocation Algorithm based on Vacancy Chains

Scheduling with Group Dynamics: A Multi-Robot Task-Allocation Algorithm based on Vacancy Chains

Abstract : Existing task allocation and scheduling algorithms, including task- allocation algorithms for multi-robot systems, generally assume that tasks are independent. This assumption is often violated in groups of cooperative mobile robots, where the group dynamics can have a critical impact on performance. We present a multi-robot task allocation algorithm that is sensitive to group dynamics. Our algorithm is based on vacancy chains, a resource distribution process common in human and animal societies. We study the problem of cooperative transportation in simulation. We demonstrate through experiments in simulation that if robots keep local task utility estimates, and follow a greedy task selection policy, the interactions in the group cause the collection of learned policies to converge toward an optimal allocation pattern as defined by the vacancy chain framework. As the robots are continuously updating their individual utility estimates, the vacancy chain algorithm has the additional property of adapting automatically to changes in the environment, e.g., robot breakdowns or changes in task values. Our experiments show that in the case of such changes, the vacancy chain algorithm consistently outperforms random and static task allocation algorithms. Finally, the vacancy chain algorithm uses no communication or unique roles, and as a result it is more likely to scale to large groups and will degrade gracefully in response to individual breakdowns.

Gaurav S. Sukhatme | Maja J. Matarić | Torbjørn S. Dahl

[1] Maja J. Matari,et al. Behavior-based Control: Examples from Navigation, Learning, and Group Behavior , 1997 .

[2] Wilfried Brauer,et al. Multi-machine scheduling-a multi-agent learning approach , 1998, Proceedings International Conference on Multi Agent Systems (Cat. No.98EX160).

[3] Manuela Veloso,et al. Automated Robot Behavior Recognition Applied to Robotic Soccer , 1999 .

[4] Lynne E. Parker,et al. L-ALLIANCE: Task-oriented multi-robot learning in behavior-based systems , 1996, Adv. Robotics.

[5] Maja J. Mataric,et al. Learning Multiple Models for Reward Maximization , 2000, ICML.

[6] Manuela M. Veloso,et al. Automatically tracking and analyzing the behavior of live insect colonies , 2001, AGENTS '01.

[7] Maja J. Mataric,et al. Behaviour-based control: examples from navigation, learning, and group behaviour , 1997, J. Exp. Theor. Artif. Intell..

[8] Andrew W. Moore,et al. Reinforcement Learning: A Survey , 1996, J. Artif. Intell. Res..

[9] Gérard Lachiver,et al. RL-Cyclist: A Self-Teaching Agent Driving a Bicycle , 2000, Artificial Intelligence and Soft Computing.

[10] Gaurav S. Sukhatme,et al. Emergent bucket brigading: a simple mechanisms for improving performance in multi-robot constrained-space foraging tasks , 2001, AGENTS '01.

[11] Aloysius K. Mok,et al. Multiprocessor On-Line Scheduling of Hard-Real-Time Tasks , 1989, IEEE Trans. Software Eng..

[12] Thomas G. Dietterich. Adaptive computation and machine learning , 1998 .

[13] P. Brucker,et al. Scheduling Algorithms, Second Revised and Enlarged Edition. , 1999 .

[14] Maja J. Mataric,et al. Broadcast of Local Elibility for Multi-Target Observation , 2000, DARS.

[15] Tucker R. Balch. The impact of diversity on performance in multi-robot foraging , 1999, AGENTS '99.

[16] Chris Melhuish,et al. Stigmergy, Self-Organization, and Sorting in Collective Robotics , 1999, Artificial Life.

[17] Rachid Alami,et al. M+: a scheme for multi-robot cooperation through negotiated task allocation and achievement , 1999, Proceedings 1999 IEEE International Conference on Robotics and Automation (Cat. No.99CH36288C).

[18] Dani Goldberg. Design and Evaluation of Robust Behavior-Based Controllers for Distributed Multi-Robot Collection Tasks , 2001 .

[19] Gaurav S. Sukhatme,et al. Most valuable player: a robot device server for distributed control , 2001, Proceedings 2001 IEEE/RSJ International Conference on Intelligent Robots and Systems. Expanding the Societal Role of Robotics in the the Next Millennium (Cat. No.01CH37180).

[20] Victor R. Lesser,et al. The Use of Meta-Level Control for Coordination in a Distributed Problem Solving Network , 1983, IJCAI.

[21] Maja J. Mataric,et al. Interaction and intelligent behavior , 1994 .

[22] Tamio Arai,et al. Distributed Autonomous Robotic Systems 3 , 1998 .

[23] Michael Sampels,et al. Ant colony optimization for FOP shop scheduling: a case study on different pheromone representations , 2002, Proceedings of the 2002 Congress on Evolutionary Computation. CEC'02 (Cat. No.02TH8600).

[24] 李幼升,et al. Ph , 1989 .

[25] Albert Y. Zomaya,et al. A Framework for Reinforcement-Based Scheduling in Parallel Processor Systems , 1998, IEEE Trans. Parallel Distributed Syst..

[26] Alex M. Andrew,et al. ROBOT LEARNING, edited by Jonathan H. Connell and Sridhar Mahadevan, Kluwer, Boston, 1993/1997, xii+240 pp., ISBN 0-7923-9365-1 (Hardback, 218.00 Guilders, $120.00, £89.95). , 1999, Robotica (Cambridge. Print).

[27] Wei Zhang,et al. A Reinforcement Learning Approach to job-shop Scheduling , 1995, IJCAI.

[28] Gaurav S. Sukhatme,et al. Adaptive spatio-temporal organization in groups of robots , 2002, IEEE/RSJ International Conference on Intelligent Robots and Systems.

[29] A. Ijspeert,et al. A Macroscopic Analytical Model of Collaboration in Distributed Robotic Systems , 2002, Artificial Life.

[30] Maja J. Mataric,et al. General spatial features for analysis of multi-robot and human activities from raw position data , 2002, IEEE/RSJ International Conference on Intelligent Robots and Systems.

[31] Pradeep K. Khosla,et al. Learning-Based Task Allocation in Decentralized Multirobot System , 2000, DARS.

[32] Anil K. Seth,et al. Modeling Group Foraging: Individual Suboptimality, Interference, and a Kind of Matching , 2001, Adapt. Behav..

[33] Maja J. Mataric,et al. Sold!: auction methods for multirobot coordination , 2002, IEEE Trans. Robotics Autom..

[34] I. Chase,et al. The vacancy chain process: a new mechanism of resource distribution in animals with application to hermit crabs , 1988, Animal Behaviour.