Optimizing Evasive Strategies for an Evader with Imperfect Vision Capacity

The multiagent pursuit-evasion problem has attracted considerable interest during recent years, and a general assumption is that the evader has perfect vision capacity. However, in the real world, the vision capacity of the evader is always imperfect, and it may have noisy observation within its limited field of view. Such an imperfect vision capacity makes the evader sense incomplete and inaccurate information from the environment, and thus, the evader will achieve suboptimal decisions. To address this challenge, we decompose this problem into two subproblems: 1) optimizing evasive strategies with a limited field of view, and 2) optimizing evasive strategies with noisy observation. For the evader with a limited field of view, we propose a memory-based ‘worst case’ algorithm, the idea of which is to store the locations of the pursuers seen before and estimate the possible region of the pursuers outside the sight of the evader. For the evader with noisy observation, we propose a value-based reinforcement learning algorithm that trains the evader offline and applies the learned strategy to the actual environment, aiming at reducing the impact of uncertainty created by inaccurate information. Furthermore, we combine and make a trade-off between the above two algorithms and propose a memory-based reinforcement learning algorithm that utilizes the estimated locations to modify the input of the state set in the reinforcement learning algorithm. Finally, we extensively evaluate our algorithms in simulation, concluding that in this imperfect vision capacity setting, our algorithms significantly improve the escape success rate of the evader.

[1]  Tamer Basar,et al.  Numerical approximation for a visibility based pursuit-evasion game , 2014, 2014 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[2]  Zhengyuan Zhou,et al.  Evasion of a team of dubins vehicles from a hidden pursuer , 2014, 2014 IEEE International Conference on Robotics and Automation (ICRA).

[3]  Ming Xu,et al.  Dynamics and Control for Nonideal Solar Sails Around Artificial Lagrangian Points , 2018 .

[4]  Jason M. O'Kane,et al.  A sampling-based algorithm for multi-robot visibility-based pursuit-evasion , 2014, 2014 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[5]  Hong Bing-rong,et al.  Research on High Speed Evader vs. Multi Lower Speed Pursuers in Multi Pursuit-evasion Games , 2012 .

[6]  S. Shankar Sastry,et al.  Probabilistic pursuit-evasion games: theory, implementation, and experimental evaluation , 2002, IEEE Trans. Robotics Autom..

[7]  Wei Sun,et al.  Sequential pursuit of multiple targets under external disturbances via Zermelo-Voronoi diagrams , 2017, Autom..

[8]  Dongbing Gu,et al.  Construction of Barrier in a Fishing Game With Point Capture , 2017, IEEE Transactions on Cybernetics.

[9]  Dongbing Gu,et al.  Multi-player pursuit-evasion games with one superior evader , 2016, Autom..

[10]  Jean Rouat,et al.  Robust sound source localization using a microphone array on a mobile robot , 2003, Proceedings 2003 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2003) (Cat. No.03CH37453).

[11]  Jonathan P. How,et al.  Decentralized non-communicating multiagent collision avoidance with deep reinforcement learning , 2016, 2017 IEEE International Conference on Robotics and Automation (ICRA).

[12]  Ioannis Ch. Paschalidis,et al.  Cooperative multi-quadrotor pursuit of an evader in an environment with no-fly zones , 2016, 2016 IEEE International Conference on Robotics and Automation (ICRA).

[13]  Tucker R. Balch,et al.  Distributed sensor fusion for object position estimation by multi-robot systems , 2001, Proceedings 2001 ICRA. IEEE International Conference on Robotics and Automation (Cat. No.01CH37164).

[14]  Essameddin Badreddin,et al.  Cooperative pursue in pursuit-evasion games with unmanned aerial vehicles , 2015, 2015 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[15]  Jason M. O'Kane,et al.  Pursuit-evasion with fixed beams , 2016, 2016 IEEE International Conference on Robotics and Automation (ICRA).

[16]  Jason M. O'Kane,et al.  Complete and optimal visibility-based pursuit-evasion , 2017, Int. J. Robotics Res..

[17]  Zhifeng Hao,et al.  Multiagent Pursuit-Evasion Problem with the Pursuers Moving at Uncertain Speeds , 2019, J. Intell. Robotic Syst..

[18]  Steven M. LaValle,et al.  Visibility-based pursuit-evasion: the case of curved environments , 2001, IEEE Trans. Robotics Autom..

[19]  Guoliang Xing,et al.  Exploiting Reactive Mobility for Collaborative Target Detection in Wireless Sensor Networks , 2010, IEEE Transactions on Mobile Computing.

[20]  João Pedro Hespanha,et al.  On Discrete-Time Pursuit-Evasion Games With Sensing Limitations , 2008, IEEE Transactions on Robotics.

[21]  Prasanta K. Bose,et al.  Swarm Coordination for Pursuit Evasion Games using Sensor Networks , 2005, Proceedings of the 2005 IEEE International Conference on Robotics and Automation.

[22]  Xin Yao,et al.  Turning High-Dimensional Optimization Into Computationally Expensive Optimization , 2018, IEEE Transactions on Evolutionary Computation.

[23]  Alejandro Sarmiento,et al.  Surveillance Strategies for a Pursuer with Finite Sensor Range , 2007, Int. J. Robotics Res..

[24]  Eithan Ephrati,et al.  Divide and Conquer in Multi-Agent Planning , 1994, AAAI.

[25]  Sonia Martínez,et al.  Distributed robust data fusion based on dynamic voting , 2011, 2011 IEEE International Conference on Robotics and Automation.

[26]  Julie A. Adams,et al.  Hybrid mission planning with coalition formation , 2017, Autonomous Agents and Multi-Agent Systems.

[27]  Mangal Kothari,et al.  Pursuit-Evasion Games of High Speed Evader , 2017, J. Intell. Robotic Syst..

[28]  Jie Chen,et al.  Construction of Barrier in a Three-Player Pursuit-Evasion Game , 2016, Unmanned Syst..

[29]  Masafumi Yamashita,et al.  Searching for a Mobile Intruder in a Polygonal Region , 1992, SIAM J. Comput..

[30]  Mangal Kothari,et al.  A cooperative pursuit-evasion game of a high speed evader , 2015, 2015 54th IEEE Conference on Decision and Control (CDC).

[31]  Jason M. O'Kane,et al.  Visibility-based pursuit-evasion with probabilistic evader models , 2011, 2011 IEEE International Conference on Robotics and Automation.

[32]  D. Renshaw,et al.  A Comprehensive Tool for Modeling CMOS Image-Sensor-Noise Performance , 2007, IEEE Transactions on Electron Devices.

[33]  Jason M. O'Kane,et al.  Persistent pursuit-evasion: The case of the preoccupied pursuer , 2017, 2017 IEEE International Conference on Robotics and Automation (ICRA).

[34]  Frank Chongwoo Park,et al.  Robot sensor calibration: solving AX=XB on the Euclidean group , 1994, IEEE Trans. Robotics Autom..

[35]  Yu Zhang,et al.  IQ-ASyMTRe: Forming Executable Coalitions for Tightly Coupled Multirobot Tasks , 2013, IEEE Transactions on Robotics.

[36]  Shahram Izadi,et al.  Modeling Kinect Sensor Noise for Improved 3D Reconstruction and Tracking , 2012, 2012 Second International Conference on 3D Imaging, Modeling, Processing, Visualization & Transmission.

[37]  Sebastian Thrun,et al.  Visibility-based Pursuit-evasion with Limited Field of View , 2004, Int. J. Robotics Res..

[38]  Tamer Basar,et al.  Surveillance for Security as a Pursuit-Evasion Game , 2014, GameSec.

[39]  Peter Dayan,et al.  Technical Note: Q-Learning , 2004, Machine Learning.

[40]  Stergios I. Roumeliotis,et al.  Weighted range sensor matching algorithms for mobile robot displacement estimation , 2002, Proceedings 2002 IEEE International Conference on Robotics and Automation (Cat. No.02CH37292).

[41]  Yichuan Jiang,et al.  Pursuing a Faster Evader Based on an Agent Team with Unstable Speeds , 2017, AAMAS.

[42]  Glenn Healey,et al.  Radiometric CCD camera calibration and noise estimation , 1994, IEEE Trans. Pattern Anal. Mach. Intell..