Monte Carlo Tree Search with Branch and Bound for Multi-Robot Task Allocation

Multi-robot teams are effective in a variety of task allocation domains such as warehouse automation and surveillance. Robots in such domains have to perform tasks at given locations and specific times. Tasks have to be allocated to optimize given team objectives, such as minimizing the total distance traveled. We propose an efficient, satisficing and centralized Monte Carlo Tree Search (MCTS) based algorithm which exploits the branch and bound paradigm with a novel search parallelization method to solve the multi-robot task allocation problem with spatial, temporal and other side constraints. Unlike previous heuristics proposed for this problem, our approach maintains asymptotic convergence guarantees of MCTS and it has efficient anytime behavior. It finds near-optimal solutions for non-trivial problems in the Solomon data sets in an hour.

[1]  L. Goddard,et al.  Operations Research (OR) , 2007 .

[2]  Nils J. Nilsson,et al.  Artificial Intelligence , 1974, IFIP Congress.

[3]  Marius M. Solomon,et al.  Algorithms for the Vehicle Routing and Scheduling Problems with Time Window Constraints , 1987, Oper. Res..

[4]  Thomas G. Dietterich What is machine learning? , 2020, Archives of Disease in Childhood.

[5]  S. Crawford,et al.  Volume 1 , 2012, Journal of Diabetes Investigation.

[6]  Jonathan P. How,et al.  Coordination and control of multiple UAVs with timing constraints and loitering , 2003, Proceedings of the 2003 American Control Conference, 2003..

[7]  Editors , 2003 .

[8]  Peter Auer,et al.  Finite-time Analysis of the Multiarmed Bandit Problem , 2002, Machine Learning.

[9]  Michel Gendreau,et al.  Vehicle Routing Problem with Time Windows, Part I: Route Construction and Local Search Algorithms , 2005, Transp. Sci..

[10]  Michel Gendreau,et al.  Vehicle Routing Problem with Time Windows, Part II: Metaheuristics , 2005, Transp. Sci..

[11]  Zhun Fan,et al.  Service robots for hospitals: A case study of transportation tasks in a hospital , 2009, 2009 IEEE International Conference on Automation and Logistics.

[12]  Martin Müller,et al.  Fuego—An Open-Source Framework for Board Games and Go Engine Based on Monte Carlo Tree Search , 2010, IEEE Transactions on Computational Intelligence and AI in Games.

[13]  Anthony Stentz,et al.  Optimal Vehicle Routing and Scheduling with Precedence Constraints and Location Choice , 2010 .

[14]  Han-Lim Choi,et al.  Decentralized planning for complex missions with dynamic communication constraints , 2010, Proceedings of the 2010 American Control Conference.

[15]  Alan Fern,et al.  Ensemble Monte-Carlo Planning: An Empirical Study , 2011, ICAPS.

[16]  P. Cochat,et al.  Et al , 2008, Archives de pediatrie : organe officiel de la Societe francaise de pediatrie.

[17]  Simon M. Lucas,et al.  A Survey of Monte Carlo Tree Search Methods , 2012, IEEE Transactions on Computational Intelligence and AI in Games.

[18]  Anthony Stentz,et al.  A comprehensive taxonomy for multi-robot task allocation , 2013, Int. J. Robotics Res..

[19]  Julie A. Shah,et al.  Fast Scheduling of Multi-Robot Teams with Temporospatial Constraints , 2013, Robotics: Science and Systems.

[20]  Stephen J. Guy,et al.  User-driven narrative variation in large story domains using monte carlo tree search , 2014, AAMAS.

[21]  Stefan Edelkamp,et al.  Solving Single Vehicle Pickup and Delivery Problems with Time Windows and Capacity Constraints using Nested Monte-Carlo Search , 2014, ICAART.

[22]  Stephen J. Guy,et al.  Stochastic Tree Search with Useful Cycles for patrolling problems , 2015, 2015 IEEE International Conference on Robotics and Automation (ICRA).

[23]  Changjoo Nam,et al.  Assignment Algorithms for Modeling Resource Contention in Multirobot Task Allocation , 2015, IEEE Transactions on Automation Science and Engineering.

[24]  George Samaras,et al.  SocialRobot: An interactive mobile robot for elderly home care , 2015, 2015 IEEE/SICE International Symposium on System Integration (SII).

[25]  Stephen J. Guy,et al.  Generating Sokoban Puzzle Game Levels with Monte Carlo Tree Search , 2016 .

[26]  Demis Hassabis,et al.  Mastering the game of Go with deep neural networks and tree search , 2016, Nature.