Patrolling task planning for the multi-layer multi-agent system based on sequential allocation method

The unmanned aerial vehicle (UAV) swarm has developed rapidly in recent years, especially the UAV swarm with sensors which is becoming common means of achieving situational awareness. In this paper, we develop a scalable, online and myopic algorithm for the multi-layer multi-agent system continuously patrolling problem. The main goal of the multi-agent system is to collect information as much as possible. We formulate this problem as Partially Observable Markov Decision Process (POMDP). The algorithm includes information dimensionality reduction representation, inter-layer information interaction, online heuristic function and sequential allocation method, which effectively improve the collected information and reduces the computational complexity. In addition, as the layer increases, this algorithm can guarantee the patrolling performance of the multi-agent system without increasing the computational complexity for each sub-leader. Finally, the empirical analysis shows that our algorithm has many advantages, which has theoretical and practical significance.

[1]  Nicholas R. Jennings,et al.  Near-optimal continuous patrolling with teams of mobile information gathering agents , 2013, Artif. Intell..

[2]  Matthijs T. J. Spaan,et al.  Partially Observable Markov Decision Processes , 2010, Encyclopedia of Machine Learning.

[3]  Stéphane Galland,et al.  Organizational and Holonic Modelling of a Simulated and Synthetic Spatial Environment , 2014, E4MAS.

[4]  Sarvapali D. Ramchurn,et al.  Multi-Agent Patrolling under Uncertainty and Threats , 2015, PloS one.

[5]  Charles Lesire,et al.  Multi-Target Detection and Recognition by UAVs Using Online POMDPs , 2013, AAAI.

[6]  Lincheng Shen,et al.  Robust Satisficing Decision Making for Unmanned Aerial Vehicle Complex Missions under Severe Uncertainty , 2016, PloS one.

[7]  Yi Ouyang,et al.  On the Optimality of Myopic Sensing in Multi-State Channels , 2013, IEEE Transactions on Information Theory.

[8]  Sarvapali D. Ramchurn,et al.  Decentralized Patrolling Under Constraints in Dynamic Environments , 2016, IEEE Transactions on Cybernetics.

[9]  Stéphane Galland,et al.  SARL: A General-Purpose Agent-Oriented Programming Language , 2014, 2014 IEEE/WIC/ACM International Joint Conferences on Web Intelligence (WI) and Intelligent Agent Technologies (IAT).

[10]  Anders Lyhne Christensen,et al.  Generic, scalable and decentralized fault detection for robot swarms , 2017, PloS one.

[11]  Alborz Geramifard,et al.  Cooperative Mission Planning for Multi-UAV Teams , 2015 .

[12]  William H. Sandholm,et al.  Orders of limits for stationary distributions, stochastic dominance, and stochastic stability , 2010 .

[13]  Joel Veness,et al.  Monte-Carlo Planning in Large POMDPs , 2010, NIPS.