Multi-Agent Deep Reinforcement Learning For Optimising Energy Efficiency of Fixed-Wing UAV Cellular Access Points

Unmanned Aerial Vehicles (UAVs) promise to become an intrinsic part of next generation communications, as they can be deployed to provide wireless connectivity to ground users to supplement existing terrestrial networks. The majority of the existing research into the use of UAV access points for cellular coverage considers rotary-wing UAV designs (i.e. quadcopters). However, we expect fixed-wing UAVs to be more appropriate for connectivity purposes in scenarios where long flight times are necessary (such as for rural coverage), as fixed-wing UAVs rely on a more energy-efficient form of flight when compared to the rotary-wing design. As fixed-wing UAVs are typically incapable of hovering in place, their deployment optimisation involves optimising their individual flight trajectories in a way that allows them to deliver high quality service to the ground users in an energy-efficient manner. In this paper, we propose a multi-agent deep reinforcement learning approach to optimise the energy efficiency of fixed-wing UAV cellular access points while still allowing them to deliver high-quality service to users on the ground. In our decentralized approach, each UAV is equipped with a Dueling Deep Q-Network (DDQN) agent which can adjust the 3D trajectory of the UAV over a series of timesteps. By coordinating with their neighbours, the UAVs adjust their individual flight trajectories in a manner that optimises the total system energy efficiency. We benchmark the performance of our approach against a series of heuristic trajectory planning strategies, and demonstrate that our method can improve the system energy efficiency by as much as 70%.

[1]  Dushantha Nalin K. Jayakody,et al.  3-D Trajectory Optimization for Fixed-Wing UAV-Enabled Wireless Network , 2021, IEEE Access.

[2]  Xiao Liu,et al.  Reinforcement Learning in Multiple-UAV Networks: Deployment and Movement Design , 2019, IEEE Transactions on Vehicular Technology.

[3]  Yen-Chen Liu,et al.  Design, Modeling and Control of a Solar-Powered Quadcopter , 2018, 2018 IEEE International Conference on Robotics and Automation (ICRA).

[4]  Peter Henderson,et al.  An Introduction to Deep Reinforcement Learning , 2018, Found. Trends Mach. Learn..

[5]  Jeffrey G. Andrews,et al.  Downlink and Uplink Cell Association With Traditional Macrocells and Millimeter Wave Small Cells , 2016, IEEE Transactions on Wireless Communications.

[6]  Luiz A. DaSilva,et al.  Deployment of UAV-mounted access points according to spatial user locations in two-tier cellular networks , 2016, 2016 Wireless Days (WD).

[7]  Tom Schaul,et al.  Dueling Network Architectures for Deep Reinforcement Learning , 2015, ICML.

[8]  Jang-Ping Sheu,et al.  Power-Efficient Trajectory Adjustment and Temporal Routing for Multi-UAV Networks , 2020, IEEE Transactions on Green Communications and Networking.

[9]  Ivana Dusparic,et al.  REQIBA: Regression and Deep Q-Learning for Intelligent UAV Cellular User to Base Station Association , 2020, IEEE Transactions on Vehicular Technology.

[10]  Walid Saad,et al.  A Tutorial on UAVs for Wireless Networks: Applications, Challenges, and Open Problems , 2018, IEEE Communications Surveys & Tutorials.

[11]  Jing Jiang,et al.  Energy-Efficiency for IoT System With Cache-Enabled Fixed-Wing UAV Relay , 2020, IEEE Access.

[12]  Rui Zhang,et al.  Energy-Efficient UAV Communication With Trajectory Optimization , 2016, IEEE Transactions on Wireless Communications.

[13]  Ivana Dusparic,et al.  Energy-aware placement optimization of UAV base stations via decentralized multi-agent Q-learning , 2021, ArXiv.

[14]  Jang-Ping Sheu,et al.  UAV Deployment and IoT Device Association for Energy-Efficient Data-Gathering in Fixed-Wing Multi-UAV Networks , 2021, IEEE Transactions on Green Communications and Networking.

[15]  Sachin Shriwastav,et al.  Coordinated Coverage and Fault Tolerance using Fixed-wing Unmanned Aerial Vehicles , 2020, 2020 International Conference on Unmanned Aircraft Systems (ICUAS).

[16]  Shakil Ahmed,et al.  Energy-Efficient UAV Relaying Communications to Serve Ground Nodes , 2020, IEEE Communications Letters.