Tree Search Techniques for Minimizing Detectability and Maximizing Visibility

We introduce and study the problem of planning a trajectory for an agent to carry out a reconnaissance mission while avoiding being detected by an adversarial guard. This introduces a multi-objective version of classical visibility-based target search and pursuit-evasion problem. In our formulation, the agent receives a positive reward for increasing its visibility (by exploring new regions) and a negative penalty every time it is detected by the guard. The objective is to find a finite-horizon path for the agent that balances the trade off between maximizing visibility and minimizing detectability.We model this problem as a discrete, sequential, two-player, zero-sum game. We use two types of game tree search algorithms to solve this problem: minimax search tree and Monte-Carlo search tree. Both search trees can yield the optimal policy but may require possibly exponential computational time and space. We propose several pruning techniques to reduce the computational cost while still preserving optimality guarantees. Simulation results show that the proposed strategy prunes approximately three orders of magnitude nodes as compared to the brute-force strategy. We also find that the Monte-Carlo search tree saves approximately one order of computational time as compared to the minimax search tree.

[1]  J.P. Hespanha,et al.  Sensing limitations in the Lion and Man problem , 2007, 2007 American Control Conference.

[2]  Emilio Frazzoli,et al.  Incremental Sampling-Based Algorithms for a Class of Pursuit-Evasion Games , 2010, WAFR.

[3]  Svante Carlsson,et al.  Computing Vision Points in Polygons , 1999, Algorithmica.

[4]  Sylvain Gelly,et al.  Exploration exploitation in Go: UCT for Monte-Carlo Go , 2006, NIPS 2006.

[5]  Geoffrey A. Hollinger,et al.  Efficient Multi-robot Search for a Moving Target , 2009, Int. J. Robotics Res..

[6]  Pratap Tokekar,et al.  Non-myopic target tracking strategies for non-linear systems , 2016, 2016 IEEE 55th Conference on Decision and Control (CDC).

[7]  Mark H. M. Winands,et al.  Monte-Carlo Tree Search and minimax hybrids , 2013, 2013 IEEE Conference on Computational Inteligence in Games (CIG).

[8]  Pratap Tokekar,et al.  A Minimax Tree Based Approach for Minimizing Detectability and Maximizing Visibility , 2018, ArXiv.

[9]  Nancy A. Lynch,et al.  Environment Characterization for Non-recontaminating Frontier-Based Robotic Exploration , 2011, PRIMA.

[10]  Jason M. O'Kane,et al.  Complete and optimal visibility-based pursuit-evasion , 2017, Int. J. Robotics Res..

[11]  Rafael Murrieta-Cid,et al.  Image feedback based optimal control and the value of information in a differential game , 2018, Autom..

[12]  Lydia E. Kavraki,et al.  Analysis of probabilistic roadmaps for path planning , 1998, IEEE Trans. Robotics Autom..

[13]  Svante Carlsson,et al.  Finding the Shortest Watchman Route in a Simple Polygon , 1993, ISAAC.

[14]  David Hsu,et al.  Monte Carlo Value Iteration with Macro-Actions , 2011, NIPS.

[15]  Ryan M. Eustice,et al.  Active visual SLAM for robotic area coverage: Theory and experiment , 2015, Int. J. Robotics Res..

[16]  Volkan Isler,et al.  Optimal Reconstruction with a Small Number of Views , 2017, ArXiv.

[17]  Volkan Isler,et al.  Capturing an Evader in a Polygonal Environment with Obstacles , 2011, IJCAI.

[18]  Zhihua Qu,et al.  A heuristic task scheduling for multi-pursuer multi-evader games , 2011, 2011 IEEE International Conference on Information and Automation.

[19]  Peter Norvig,et al.  Artificial Intelligence: A Modern Approach , 1995 .

[20]  Vijay Kumar,et al.  Visibility-based persistent monitoring with robot teams , 2015, 2015 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[21]  Steven M. LaValle,et al.  Planning algorithms , 2006 .

[22]  Volkan Isler,et al.  View Selection with Geometric Uncertainty Modeling , 2018, Robotics: Science and Systems.

[23]  Pieter Spronck,et al.  Monte-Carlo Tree Search: A New Framework for Game AI , 2008, AIIDE.

[24]  J. O'Rourke Art gallery theorems and algorithms , 1987 .

[25]  Lydia E. Kavraki,et al.  Analysis of probabilistic roadmaps for path planning , 1996, Proceedings of IEEE International Conference on Robotics and Automation.