Nash Bargaining Solution based rendezvous guidance of unmanned aerial vehicles

This paper addresses a finite-time rendezvous problem for a group of unmanned aerial vehicles (UAVs), in the absence of a leader or a reference trajectory. When the UAVs do not cooperate, they are assumed to use Nash equilibrium strategies (NES). However, when the UAVs can communicate among themselves, they can implement cooperative game theoretic strategies for mutual benefit. In a convex linear quadratic differential game (LQDG), a Pareto-optimal solution (POS) is obtained when the UAVs jointly minimize a team cost functional, which is constructed through a convex combination of individual cost functionals. This paper proposes an algorithm to determine the convex combination of weights corresponding to the Pareto-optimal Nash Bargaining Solution (NBS), which offers each UAV a lower cost than that incurred from the NES. Conditions on the cost functions that make the proposed algorithm converge to the NBS are presented. A UAV, programmed to choose its strategies at a given time based upon cost-to-go estimates for the rest of the game duration, may switch to NES finding it to be more beneficial than continuing with a cooperative strategy it previously agreed upon with the other UAVs. For such scenarios, a renegotiation method, that makes use of the proposed algorithm to obtain the NBS corresponding to the state of the game at an intermediate time, is proposed. This renegotiation method helps to establish cooperation between UAVs and prevents non-cooperative behaviour. In this context, the conditions of time consistency of a cooperative solution have been derived in connection to LQDG. The efficacy of the guidance law derived from the proposed algorithm is illustrated through simulations. (C) 2018 Published by Elsevier Ltd on behalf of The Franklin Institute.

[1]  Lesley A. Weitz,et al.  Decentralized Cooperative-Control Design for Multivehicle Formations , 2007 .

[2]  Tal Shima,et al.  Formation-Flying Guidance for Cooperative Radar Deception , 2012 .

[3]  Khashayar Khorasani,et al.  Multi-agent team cooperation: A game theory approach , 2009, Autom..

[4]  Mark R. Anderson,et al.  FORMATION FLIGHT AS A COOPERATIVE GAME , 1998 .

[5]  Debasish Ghose,et al.  Team, Game, and Negotiation based Intelligent Autonomous UAV Task Allocation for Wide Area Applications , 2007, Innovations in Intelligent Machines.

[6]  Yisheng Zhong,et al.  Time-varying formation control for unmanned aerial vehicles with switching interaction topologies , 2014, 2014 International Conference on Unmanned Aircraft Systems (ICUAS).

[7]  Rodney Teo,et al.  Decentralized overlapping control of a formation of unmanned aerial vehicles , 2004, Autom..

[8]  Dongbing Gu,et al.  A Differential Game Approach to Formation Control , 2008, IEEE Transactions on Control Systems Technology.

[9]  Jacob Engwerda,et al.  LQ Dynamic Optimization and Differential Games , 2005 .

[10]  Youdan Kim,et al.  Controller Design for UAV Formation Flight Using Consensus based Decentralized Approach , 2009 .

[11]  Dimitri P. Bertsekas,et al.  Convex Optimization Algorithms , 2015 .

[12]  R. Pesenti,et al.  Mechanism Design for Optimal Consensus Problems , 2006, Proceedings of the 45th IEEE Conference on Decision and Control.

[13]  Arshad Mahmood,et al.  Decentrailized formation flight control of quadcopters using robust feedback linearization , 2017, J. Frankl. Inst..

[14]  William B. Dunbar,et al.  Distributed receding horizon control for multi-vehicle formation stabilization , 2006, Autom..

[15]  F. Abdollahi,et al.  Time varying formation control using feedback information differential game approach , 2011, 2011 19th Iranian Conference on Electrical Engineering.

[16]  T.H. Lee,et al.  A leader-follower formation flight control scheme for UAV helicopters , 2008, 2008 IEEE International Conference on Automation and Logistics.

[17]  Robert F. Stengel,et al.  Optimization and Coordination of Multiagent Systems Using Principled Negotiation , 1999 .

[18]  Zhan Li,et al.  Decentralized output-feedback formation control of multiple 3-DOF laboratory helicopters , 2015, J. Frankl. Inst..

[19]  Georges Zaccour,et al.  Time Consistency in Cooperative Differential Games: A Tutorial , 2007, INFOR Inf. Syst. Oper. Res..

[20]  Changchun Hua,et al.  Nonlinear protocols for distributed consensus in directed networks of dynamic agents , 2015, J. Frankl. Inst..

[21]  John Leif Jørgensen,et al.  Noncooperative Rendezvous Using Angles-Only Optical Navigation: System Design and Flight Results , 2013 .

[22]  T. Başar,et al.  Dynamic Noncooperative Game Theory , 1982 .

[23]  Yuanqing Xia,et al.  Coordinated formation control design with obstacle avoidance in three-dimensional space , 2015, J. Frankl. Inst..

[24]  Austin L. Smith Proportional Navigation With Adaptive Terminal Guidance For Aircraft Rendezvous (Preprint) , 2007 .

[25]  Wei Ren Coordination of Multiple Micro Air Vehicles Using Consensus Schemes , 2005 .

[26]  Ashwini Ratnoo Variable Deviated Pursuit for Rendezvous Guidance , 2015 .

[27]  Hugh H. T. Liu,et al.  Formation UAV flight control using virtual structure and motion synchronization , 2008, 2008 American Control Conference.

[28]  Bruno Sinopoli,et al.  Distributed control applications within sensor networks , 2003, Proc. IEEE.

[29]  Jason R. Marden,et al.  Game Theory and Distributed Control , 2015 .

[30]  Stephen P. Boyd,et al.  Distributed optimization for cooperative agents: application to formation flight , 2004, 2004 43rd IEEE Conference on Decision and Control (CDC) (IEEE Cat. No.04CH37601).

[31]  C. Tomlin,et al.  Decentralized optimization, with application to multiple aircraft coordination , 2002, Proceedings of the 41st IEEE Conference on Decision and Control, 2002..

[32]  Timothy W. McLain,et al.  Aerial rendezvous of small unmanned aircraft using a passive towed cable system , 2014 .

[33]  Kamesh Subbarao,et al.  Nonlinear Guidance and Consensus for Unmanned Vehicles with Time Varying Connection Topologies , 2011 .

[34]  Wenwu Yu,et al.  Distributed leader-follower flocking control for multi-agent dynamical systems with time-varying velocities , 2010, Syst. Control. Lett..

[35]  P. Dutta A Folk Theorem for Stochastic Games , 1995 .

[36]  Wenwu Yu,et al.  An Overview of Recent Progress in the Study of Distributed Multi-Agent Coordination , 2012, IEEE Transactions on Industrial Informatics.

[37]  Yuanqing Xia,et al.  Distributed MPC for formation of multi-agent systems with collision avoidance and obstacle avoidance , 2017, J. Frankl. Inst..

[38]  Wei Lin,et al.  Distributed UAV formation control using differential game approach , 2014 .

[39]  Debasish Ghose,et al.  Sliding Mode Control-Based Autopilots for Leaderless Consensus of Unmanned Aerial Vehicles , 2014, IEEE Transactions on Control Systems Technology.

[40]  Francesco Borrelli,et al.  Decentralized receding horizon control for large scale dynamically decoupled systems , 2009, Autom..

[41]  J. Nash Two-Person Cooperative Games , 1953 .

[42]  Isabelle Fantoni,et al.  Flocking of multiple Unmanned Aerial Vehicles by LQR control , 2014, 2014 International Conference on Unmanned Aircraft Systems (ICUAS).

[43]  Weihua Zhao,et al.  Quadcopter formation flight control combining MPC and robust feedback linearization , 2014, J. Frankl. Inst..

[44]  Mohammadreza Radmanesh,et al.  Flight formation of UAVs in presence of moving obstacles using fast-dynamic mixed integer linear programming , 2016 .

[45]  Khanh Pham,et al.  Bio-Inspired Rendezvous Strategies and Respondent Detections , 2013 .

[46]  Jingrui Zhang,et al.  Autonomous Guidance for Rendezvous Phasing Based on Special-Point-Based Maneuvers , 2015 .

[47]  Claire J. Tomlin,et al.  DECENTRALIZED OPTIMIZATION VIA NASH BARGAINING , 2004 .

[48]  Xingping Chen,et al.  Control of leader-follower formations of terrestrial UAVs , 2003, 42nd IEEE International Conference on Decision and Control (IEEE Cat. No.03CH37475).