Maximum reward collection problem: a cooperative receding horizon approach for dynamic clustering

In this paper, the Maximum Reward Collection Problem (MRCP) in uncertain environments is investigated where multiple agents cooperate to maximize the total reward collected from a set of moving targets in the mission space with unknown arrival times, trajectories and dynamics. The reward with respect to each of the targets has a time discounting value and can be collected only if a cluster of agents with proper number of elements visits the targets. Meanwhile, in each cluster, it is assumed that agents are able to extract a larger fraction of reward when their configuration in the cluster is close to specific configuration around the respective target. The inherited uncertainty in the environment and the dynamic clustering factor render the one-shot optimization in MRCP rather impractical. Therefore, a Cooperative Receding Horizon (CRH) controller is utilized toward maximizing the collected reward and based on the prediction of the future positions of targets with the given limited information. Some analytical aspects of problem is discussed and the effectiveness and advantages of the proposed algorithm is demonstrated via numerical simulations.

[1]  Christos G. Cassandras,et al.  Distributed coverage control and data collection with mobile sensor networks , 2010, 49th IEEE Conference on Decision and Control (CDC).

[2]  Richard M. Murray,et al.  Recent Research in Cooperative Control of Multivehicle Systems , 2007 .

[3]  Amir G. Aghdam,et al.  Stability analysis of dynamic decision-making for vehicle heading control , 2015, 2015 American Control Conference (ACC).

[4]  Christos G. Cassandras,et al.  Distributed Coverage Control and Data Collection With Mobile Sensor Networks , 2010, IEEE Transactions on Automatic Control.

[5]  Efstathios Bakolas,et al.  Optimal pursuit of moving targets using dynamic Voronoi diagrams , 2010, 49th IEEE Conference on Decision and Control (CDC).

[6]  Ali Ekici,et al.  Multiple agents maximum collection problem with time dependent rewards , 2013, Comput. Ind. Eng..

[7]  Amir G. Aghdam,et al.  Cooperative receding horizon control for multi-target interception in uncertain environments , 2014, 53rd IEEE Conference on Decision and Control.

[8]  Francesco Bullo,et al.  Vehicle routing algorithms to intercept escaping targets , 2014, 2014 American Control Conference.

[9]  Elise Miller-Hooks,et al.  Scheduling technicians for planned maintenance of geographically distributed equipment , 2007 .

[10]  Ying Lan Multiple mobile robot cooperative target intercept with local coordination , 2012, 2012 24th Chinese Control and Decision Conference (CCDC).

[11]  Hassan Rivaz,et al.  Cooperative control for multi-target interception with sensing and communication limitations: A game-theoretic approach , 2015, 2015 54th IEEE Conference on Decision and Control (CDC).

[12]  Yasaman Khazaeni,et al.  A new event-driven Cooperative Receding Horizon controller for multi-agent systems in uncertain environments , 2014, 53rd IEEE Conference on Decision and Control.

[13]  Christos G. Cassandras,et al.  A Cooperative receding horizon controller for multivehicle uncertain environments , 2006, IEEE Transactions on Automatic Control.