Decentralized and Nondeterministic Multi-Robot Area Patrolling in Adversarial Environments

Multi-robot patrolling is the problem of repeatedly visiting a group of regions of interest in an environment with a group of robots to prevent intrusion. Early works have proposed deterministic patrolling algorithms which could be learned by an adversary observing them over time. More recent works provide non-deterministic patrolling schemes which only work for perimeter patrolling and require coordination and synchronization. In this paper, we investigate the problem of finding robust and scalable strategies for multi-robot patrolling under an adversarial environment. So, we present algorithms to find different decentralized strategies for a patroller in the form of Markov chains which use convex optimization to minimize the average commute time for an environment, a subset of the environment, or a specific region of an environment when we use uniform distribution over all regions for both patroller and adversary. Additionally, we use these strategies in a game theoretical setup to form a payoff matrix to obtain an optimal mixed strategy for patroller. We also propose an algorithm to find a decentralized strategy for patroller in the form of Markov chain which converges very fast as it is the optimal response from patroller against the adversary when we use non-uniform distribution over all the regions for both patroller and adversary. Our results show the scalability and applicability of our approach in different types of environments. Despite the lack of synchronization and coordination among patrollers, our approach performs competitively compared to existing methods.

[1]  Nicola Basilico,et al.  Patrolling security games: Definition and algorithms for solving large instances with single patroller and single intruder , 2012, Artif. Intell..

[2]  W. K. Hastings,et al.  Monte Carlo Sampling Methods Using Markov Chains and Their Applications , 1970 .

[3]  Yann Chevaleyre,et al.  A theoretical analysis of multi-agent patrolling strategies , 2004, Proceedings of the Third International Joint Conference on Autonomous Agents and Multiagent Systems, 2004. AAMAS 2004..

[4]  Noa Agmon On events in multi-robot patrol in adversarial environments , 2010, AAMAS.

[5]  Stephen P. Boyd,et al.  Minimizing Effective Resistance of a Graph , 2008, SIAM Rev..

[6]  David Portugal,et al.  MSP algorithm: multi-robot patrolling based on territory allocation using balanced graph partitioning , 2010, SAC '10.

[7]  J. Y. Yen,et al.  Finding the K Shortest Loopless Paths in a Network , 2007 .

[8]  Leonardo Bobadilla,et al.  Distributed Multi-Robot Area Patrolling in Adversarial Environments , 2015 .

[9]  N. Metropolis,et al.  Equation of State Calculations by Fast Computing Machines , 1953, Resonance.

[10]  Nicola Basilico,et al.  Asynchronous Multi-Robot Patrolling against Intrusions in Arbitrary Topologies , 2010, AAAI.

[11]  Sarit Kraus,et al.  Multi-robot perimeter patrol in adversarial settings , 2008, 2008 IEEE International Conference on Robotics and Automation.

[12]  Nicola Basilico,et al.  Leader-follower strategies for robotic patrolling in environments with arbitrary topologies , 2009, AAMAS.

[13]  Peter Stone,et al.  A multi-robot system for continuous area sweeping tasks , 2006, Proceedings 2006 IEEE International Conference on Robotics and Automation, 2006. ICRA 2006..

[14]  Sarit Kraus,et al.  Multi-Robot Adversarial Patrolling: Facing a Full-Knowledge Opponent , 2011, J. Artif. Intell. Res..

[15]  Sarit Kraus,et al.  An efficient heuristic approach for security against multiple adversaries , 2007, AAMAS '07.

[16]  Bo An,et al.  Computing Solutions in Infinite-Horizon Discounted Adversarial Patrolling Games , 2014, ICAPS.

[17]  David Portugal,et al.  A Survey on Multi-robot Patrolling Algorithms , 2011, DoCEIS.

[18]  Stephen P. Boyd,et al.  Fastest Mixing Markov Chain on a Graph , 2004, SIAM Rev..

[19]  Stefano Carpin,et al.  Online patrolling using hierarchical spatial representations , 2012, 2012 IEEE International Conference on Robotics and Automation.

[20]  Jacques Wainer,et al.  Probabilistic Multiagent Patrolling , 2008, SBIA.

[21]  Stephen P. Boyd Convex optimization of graph Laplacian eigenvalues , 2006 .

[22]  Sarit Kraus,et al.  Playing games for security: an efficient exact algorithm for solving Bayesian Stackelberg games , 2008, AAMAS.

[23]  Noa Agmon,et al.  Multi-robot area patrol under frequency constraints , 2007, Proceedings 2007 IEEE International Conference on Robotics and Automation.