Learned Search Parameters For Cooperating Vehicles using Gaussian Process Regressions

Unmanned vehicles are capable of working as teams to accomplish a wide variety of mission objectives, such as searching for and tracking targets. In this paper, vehicle search paths are dictated by a joint cost function which maximizes the reward earned from partitioned sections across the search area. In previous work, these rewards were assigned based on the elapsed time since the section had last been searched. This approach is effective in rewarding vehicles to search out areas which haven't been visited in a long time, yet it lacks the ability to weight grid cells differently based on the probability that targets will be in that section. This paper proposes a method of using accumulated knowledge of the average density of targets within an area, along with a Gaussian process regression to assign rewards. Vehicles then choose paths that are more likely to find targets rather than seeking areas which have not been searched recently. Through numerical simulations we show that this method increases the number of targets seen by cooperating UAVs and provides an accurate estimate of target density within a search area.

[1]  Daniel Krajzewicz,et al.  Recent Development and Applications of SUMO - Simulation of Urban MObility , 2012 .

[2]  Carl E. Rasmussen,et al.  Gaussian Process Training with Input Noise , 2011, NIPS.

[3]  J.K. Hedrick,et al.  An overview of emerging results in cooperative UAV control , 2004, 2004 43rd IEEE Conference on Decision and Control (CDC) (IEEE Cat. No.04CH37601).

[4]  Soo Jeon,et al.  Real-time optimal path planning and wind estimation using Gaussian process regression for precision airdrop , 2017, 2017 American Control Conference (ACC).

[5]  Fakhrul Alam,et al.  Gaussian Process Model Predictive Control of an Unmanned Quadrotor , 2016, Journal of Intelligent & Robotic Systems.

[6]  Olivier Morel,et al.  Autonomous patrol and surveillance system using unmanned aerial vehicles , 2015, 2015 IEEE 15th International Conference on Environment and Electrical Engineering (EEEIC).

[7]  Bernard Ghanem,et al.  Persistent Aerial Tracking system for UAVs , 2016, 2016 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[8]  Krishna R. Pattipati,et al.  Multi-step look-ahead policy for autonomous cooperative surveillance by UAVs in hostile environments , 2008, 2008 47th IEEE Conference on Decision and Control.

[9]  Sean Luke,et al.  Cooperative Multi-Agent Learning: The State of the Art , 2005, Autonomous Agents and Multi-Agent Systems.

[10]  Agathe Girard,et al.  Prediction at an Uncertain Input for Gaussian Processes and Relevance Vector Machines Application to Multiple-Step Ahead Time-Series Forecasting , 2002 .

[11]  Christopher K. I. Williams,et al.  Gaussian Processes for Machine Learning (Adaptive Computation and Machine Learning) , 2005 .

[12]  Timothy W. McLain,et al.  Cooperative forest fire surveillance using a team of small unmanned air vehicles , 2006, Int. J. Syst. Sci..

[13]  Nikhil Nigam,et al.  The Multiple Unmanned Air Vehicle Persistent Surveillance Problem: A Review , 2014 .

[14]  Cameron K. Peterson Dynamic grouping of cooperating vehicles using a receding horizon controller for ground target search and track missions , 2017, 2017 IEEE Conference on Control Technology and Applications (CCTA).

[15]  A. Ollero,et al.  The block-sharing strategy for area monitoring missions using a decentralized multi-UAV system , 2014, 2014 International Conference on Unmanned Aircraft Systems (ICUAS).

[16]  J. Meguro,et al.  Real-time hazard map generation using small unmanned aerial vehicle , 2008, 2008 SICE Annual Conference.

[17]  Nicholas R. J. Lawrance,et al.  Autonomous Exploration of a Wind Field with a Gliding Aircraft , 2011 .

[18]  Dongbin Zhao,et al.  Reinforcement learning for multi-agent patrol policy , 2010, 9th IEEE International Conference on Cognitive Informatics (ICCI'10).

[19]  Andrew Gordon Wilson,et al.  Gaussian Process Kernels for Pattern Discovery and Extrapolation , 2013, ICML.

[20]  Yehuda Elmaliach,et al.  A realistic model of frequency-based multi-robot polyline patrolling , 2008, AAMAS.

[21]  Sean R. Martin,et al.  Receding horizon controller using particle swarm optimization for closed-loop ground target surveillance and tracking , 2009, Defense + Commercial Sensing.

[22]  Agathoniki Trigoni,et al.  Supporting Search and Rescue Operations with UAVs , 2010, 2010 International Conference on Emerging Security Technologies.

[23]  Mohsen Afsharchi,et al.  Decentralized area patrolling for teams of UAVs , 2016, 2016 4th International Conference on Control, Instrumentation, and Automation (ICCIA).

[24]  J. Kocijan,et al.  Gaussian process model based predictive control , 2004, Proceedings of the 2004 American Control Conference.