Massive Autonomous UAV Path Planning: A Neural Network Based Mean-Field Game Theoretic Approach

This paper investigates the autonomous control of massive unmanned aerial vehicles (UAVs) for mission-critical applications (e.g., dispatching many UAVs from a source to a destination for firefighting). Achieving their fast travel and low motion energy without inter-UAV collision under wind perturbation is a daunting control task, which incurs huge communication energy for exchanging UAV states in real time. We tackle this problem by exploiting a mean-field game (MFG) theoretic control method that requires the UAV state exchanges only once at the initial source. Afterwards, each UAV can control its acceleration by locally solving two partial differential equations (PDEs), known as the Hamilton-Jacobi- Bellman (HJB) and Fokker-Planck-Kolmogorov (FPK) equations. This approach, however, brings about huge computation energy for solving the PDEs, particularly under multi-dimensional UAV states. We address this issue by utilizing a machine learning (ML) method where two separate ML models approximate the solutions of the HJB and FPK equations. These ML models are trained and exploited using an online gradient descent method with low computational complexity. Numerical evaluations validate that the proposed ML aided MFG theoretic algorithm, referred to as \emph{MFG learning control}, is effective in collision avoidance with low communication energy and acceptable computation energy.

[1]  Felipe Cucker,et al.  Avoiding Collisions in Flocks , 2010, IEEE Transactions on Automatic Control.

[2]  Mehdi Bennis,et al.  Wireless Network Intelligence at the Edge , 2018, Proceedings of the IEEE.

[3]  Federico Milano,et al.  SDE-based wind speed models with Weibull distribution and exponential autocorrelation , 2016, 2016 IEEE Power and Energy Society General Meeting (PESGM).

[4]  Edwin K. P. Chong,et al.  UAV Path Planning in a Dynamic Environment via Partially Observable Markov Decision Process , 2013, IEEE Transactions on Aerospace and Electronic Systems.

[5]  Derong Liu,et al.  Neural-Network-Based Online HJB Solution for Optimal Robust Guaranteed Cost Control of Continuous-Time Uncertain Nonlinear Systems , 2014, IEEE Transactions on Cybernetics.

[6]  Peter E. Caines,et al.  Mean Field Analysis of Controlled Cucker-Smale Type Flocking: Linear Analysis and Perturbation Equations , 2011 .

[7]  Kimon P. Valavanis,et al.  Evolutionary algorithm based offline/online path planner for UAV navigation , 2003, IEEE Trans. Syst. Man Cybern. Part B.

[8]  Mehdi Bennis,et al.  Massive UAV-to-Ground Communication and its Stable Movement Control: A Mean-Field Approach , 2018, 2018 IEEE 19th International Workshop on Signal Processing Advances in Wireless Communications (SPAWC).

[9]  Walid Saad,et al.  Efficient Deployment of Multiple Unmanned Aerial Vehicles for Optimal Wireless Coverage , 2016, IEEE Communications Letters.

[10]  J. Karl Hedrick,et al.  Autonomous UAV path planning and estimation , 2009, IEEE Robotics & Automation Magazine.

[11]  E. Ackerman,et al.  Medical delivery drones take flight in east africa , 2018, IEEE Spectrum.

[12]  Hazer Inaltekin,et al.  Virtualized Control Over Fog: Interplay Between Reliability and Latency , 2017, IEEE Internet of Things Journal.

[13]  R. Courant,et al.  On the Partial Difference Equations, of Mathematical Physics , 2015 .

[14]  Walid Saad,et al.  A Tutorial on UAVs for Wireless Networks: Applications, Challenges, and Open Problems , 2018, IEEE Communications Surveys & Tutorials.

[15]  P. Lions,et al.  Mean field games , 2007 .

[16]  Soummya Kar,et al.  Revisiting Normalized Gradient Descent: Fast Evasion of Saddle Points , 2017, IEEE Transactions on Automatic Control.

[17]  Minyi Huang,et al.  Large-Population Cost-Coupled LQG Problems With Nonuniform Agents: Individual-Mass Behavior and Decentralized $\varepsilon$-Nash Equilibria , 2007, IEEE Transactions on Automatic Control.