Decentralized Decision-Making Under Uncertainty for Multi-Robot Teams

Automatically generating solutions to general multi-robot coordination problems with communication limitations is challenging, but crucial in many domains. As one way to address this problem, we describe a probabilistic framework for synthesizing control policies for general multi-robot systems based on decentralized partially observable Markov decision processes with macro-actions (MacDec-POMDPs). MacDecPOMDPs are a general model of decision-making where a team of robots cooperates to optimize a shared objective in the presence of uncertainty. MacDec-POMDPs also consider communication limitations, so execution is decentralized. We describe how, in contrast to most existing methods that are specialized to a particular problem class, we can synthesize control policies that exploit whatever opportunities for coordination are present in the problem, while balancing uncertainty, sensor information, and information about other robots.

[1]  Leslie Pack Kaelbling,et al.  Planning and Acting in Partially Observable Stochastic Domains , 1998, Artif. Intell..

[2]  Tucker R. Balch,et al.  Behavior-based formation control for multirobot teams , 1998, IEEE Trans. Robotics Autom..

[3]  Neil Immerman,et al.  The Complexity of Decentralized Control of Markov Decision Processes , 2000, UAI.

[4]  Eric Bonabeau,et al.  Cooperative transport by ants and robots , 2000, Robotics Auton. Syst..

[5]  Brian P. Gerkey,et al.  A Formal Analysis and Taxonomy of Task Allocation in Multi-Robot Systems , 2004, Int. J. Robotics Res..

[6]  Howie Choset,et al.  Limited communication, multi-robot team based coverage , 2004, IEEE International Conference on Robotics and Automation, 2004. Proceedings. ICRA '04. 2004.

[7]  Frans A. Oliehoek,et al.  Decentralized POMDPs , 2012, Reinforcement Learning.

[8]  Decentralized control of partially observable Markov decision processes , 2015, 52nd IEEE Conference on Decision and Control.

[9]  G. Konidaris,et al.  Planning with Macro-Actions in Decentralized POMDPs Citation , 2014 .

[10]  Jonathan P. How,et al.  Planning for decentralized control of multiple robots under uncertainty , 2014, 2015 IEEE International Conference on Robotics and Automation (ICRA).