Distributed Multi-Robot Task Allocation through Vacancy Chains

Existing multi-robot task allocation algorithms (MRTA) generally do not consider the effects of interaction, such as interference, but instead assume that tasks are independent. That assumption, however, is often violated in groups of cooperative mobile robots, where interaction effects can have a critical impact on performance. Modeling the effects of interaction, or group dynamics, is problematic due to their complexity and volatility, which also makes it difficult to hand-code optimal solutions to MRTA problems. We present a distributed MRTA algorithm that is sensitive to interaction dynamics. The algorithm uses distributed reinforcement learning to make interference-sensitive estimates of task utilities and relies on stigmergy to let optimal allocations emerge. The algorithm is inspired by vacancy chains, a resource distribution process common in human and animal societies. We present a formal model of task allocation through vacancy chains that defines optimal allocations in MRTA problems in terms of interference-sensitive measures of individual robot contributions. We validate our model by demonstrating, in simulation, how the predicted allocations are produced by our algorithm. As the robots continuously update their individual utility estimates, the vacancy chain algorithm has the additional property of adapting automatically to changes in the environment, e.g., robot breakdowns or changes in task values. We study the problem of cooperative transportation and demonstrate that our algorithm improves on the performance of hand-coded solutions. Finally, as the vacancy chain algorithm uses no communication or unique roles and is more robust and scalable than alternative algorithms with significant communication overheads.

[1]  Victor R. Lesser,et al.  The Use of Meta-Level Control for Coordination in a Distributed Problem Solving Network , 1983, IJCAI.

[2]  Daniel D. Corkill,et al.  Coordination in a Distributed Problem Solving Network , 1984 .

[3]  I. Chase,et al.  The vacancy chain process: a new mechanism of resource distribution in animals with application to hermit crabs , 1988, Animal Behaviour.

[4]  Roger B. Myerson,et al.  Game theory - Analysis of Conflict , 1991 .

[5]  Michael L. Littman,et al.  Markov Games as a Framework for Multi-Agent Reinforcement Learning , 1994, ICML.

[6]  Maja J. Mataric,et al.  Interaction and intelligent behavior , 1994 .

[7]  Wei Zhang,et al.  A Reinforcement Learning Approach to job-shop Scheduling , 1995, IJCAI.

[8]  Lynne E. Parker,et al.  L-ALLIANCE: Task-oriented multi-robot learning in behavior-based systems , 1996, Adv. Robotics.

[9]  Maja J. Mataric,et al.  Reinforcement Learning in the Multi-Robot Domain , 1997, Auton. Robots.

[10]  Wilfried Brauer,et al.  Multi-machine scheduling-a multi-agent learning approach , 1998, Proceedings International Conference on Multi Agent Systems (Cat. No.98EX160).

[11]  Albert Y. Zomaya,et al.  A Framework for Reinforcement-Based Scheduling in Parallel Processor Systems , 1998, IEEE Trans. Parallel Distributed Syst..

[12]  Rachid Alami,et al.  M+: a scheme for multi-robot cooperation through negotiated task allocation and achievement , 1999, Proceedings 1999 IEEE International Conference on Robotics and Automation (Cat. No.99CH36288C).

[13]  Tucker R. Balch The impact of diversity on performance in multi-robot foraging , 1999, AGENTS '99.

[14]  Chris Melhuish,et al.  Stigmergy, Self-Organization, and Sorting in Collective Robotics , 1999, Artificial Life.

[15]  Manuela Veloso,et al.  Automated Robot Behavior Recognition Applied to Robotic Soccer , 1999 .

[16]  P. Khosla,et al.  Learning-Based Task Allocation in Decentralized Multirobot System , 2000, DARS.

[17]  Maja J. Mataric,et al.  Learning Multiple Models for Reward Maximization , 2000, ICML.

[18]  Maja J. Mataric,et al.  Broadcast of Local Elibility for Multi-Target Observation , 2000, DARS.

[19]  D. Atkin OR scheduling algorithms. , 2000, Anesthesiology.

[20]  Dani Goldberg Design and Evaluation of Robust Behavior-Based Controllers for Distributed Multi-Robot Collection Tasks , 2001 .

[21]  Manuela M. Veloso,et al.  Rational and Convergent Learning in Stochastic Games , 2001, IJCAI.

[22]  Gaurav S. Sukhatme,et al.  Emergent bucket brigading: a simple mechanisms for improving performance in multi-robot constrained-space foraging tasks , 2001, AGENTS '01.

[23]  Manuela M. Veloso,et al.  Automatically tracking and analyzing the behavior of live insect colonies , 2001, AGENTS '01.

[24]  Gaurav S. Sukhatme,et al.  Most valuable player: a robot device server for distributed control , 2001, Proceedings 2001 IEEE/RSJ International Conference on Intelligent Robots and Systems. Expanding the Societal Role of Robotics in the the Next Millennium (Cat. No.01CH37180).

[25]  A. Ijspeert,et al.  A Macroscopic Analytical Model of Collaboration in Distributed Robotic Systems , 2002, Artificial Life.

[26]  Anil K. Seth,et al.  Modeling Group Foraging: Individual Suboptimality, Interference, and a Kind of Matching , 2001, Adapt. Behav..

[27]  Gaurav S. Sukhatme,et al.  Tracking Targets Using Multiple Robots: The Effect of Environment Occlusion , 2002, Auton. Robots.

[28]  Maja J. Mataric,et al.  Sold!: auction methods for multirobot coordination , 2002, IEEE Trans. Robotics Autom..

[29]  Maja J. Mataric,et al.  General spatial features for analysis of multi-robot and human activities from raw position data , 2002, IEEE/RSJ International Conference on Intelligent Robots and Systems.

[30]  Gaurav S. Sukhatme,et al.  Adaptive spatio-temporal organization in groups of robots , 2002, IEEE/RSJ International Conference on Intelligent Robots and Systems.

[31]  Michael Sampels,et al.  Ant colony optimization for FOP shop scheduling: a case study on different pheromone representations , 2002, Proceedings of the 2002 Congress on Evolutionary Computation. CEC'02 (Cat. No.02TH8600).

[32]  Maja J. Matarić,et al.  A formal framework for the study of task allocation in multi-robot systems , 2003 .

[33]  Gaurav S. Sukhatme,et al.  Multi-robot task-allocation through vacancy chains , 2003, 2003 IEEE International Conference on Robotics and Automation (Cat. No.03CH37422).

[34]  Gaurav S. Sukhatme,et al.  Multi-Robot Task Allocation in Uncertain Environments , 2003, Auton. Robots.

[35]  Maja J. Matarić,et al.  A Framework for Studying Multi-Robot Task Allocation , 2003 .

[36]  Brian P. Gerkey,et al.  A Formal Analysis and Taxonomy of Task Allocation in Multi-Robot Systems , 2004, Int. J. Robotics Res..

[37]  Kian Hsiang Low,et al.  Task Allocation via Self-Organizing Swarm Coalitions in Distributed Mobile Sensor Network , 2004, AAAI.

[38]  Marco Dorigo,et al.  Self-Organised Task Allocation in a Group of Robots , 2004, DARS.

[39]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.