Trajectory Optimization for UAV Emergency Communication With Limited User Equipment Energy: A Safe-DQN Approach

In post-disaster scenarios, it is challenging to provide reliable and flexible emergency communications, especially when the mobile infrastructure is seriously damaged. This article investigates the unmanned aerial vehicle (UAV)-based emergency communication networks, in which UAV is used as a mobile aerial base station for collecting information from ground users in affected areas. Due to the breakdown of ground power system after disasters, the available energy of affected user equipment (UE) is limited. Meanwhile, with the complex geographical conditions after disasters, there are obstacles affecting the flight of UAV. Aiming at maximizing the uplink throughput of UAV networks during the flying time, we formulate the UAV trajectory optimization problem considering UE energy limitation and location of obstacles. Since the constraint on UE energy is dynamic and long-term cumulative, it is hard to be solved directly. We transform the problem into a constrained Markov decision-making process (CMDP) with UAV as agent. To tackle the CMDP, we propose a safe-deep-Q-network (safe-DQN)-based UAV trajectory design algorithm, where the UAV learns to selects the optimal action in reasonable policy sets. Simulation results reveal that: 1) the uplink throughput of the proposed algorithm converges within multiple iterations and 2) compared with the benchmark algorithms, the proposed algorithm performs better in terms of uplink throughput and UE energy efficiency, achieving a good trade-off between UE energy consumption and uplink throughput.

[1]  Fumiyuki Adachi,et al.  Deep Reinforcement Learning for UAV Navigation Through Massive MIMO Technique , 2019, IEEE Transactions on Vehicular Technology.

[2]  Jie Xu,et al.  UAV-Enabled Wireless Power Transfer: Trajectory Design and Energy Optimization , 2017, IEEE Transactions on Wireless Communications.

[3]  J. Loo,et al.  Joint Computation and Communication Design for UAV-Assisted Mobile Edge Computing in IoT , 2019, IEEE Transactions on Industrial Informatics.

[4]  Yuanwei Liu,et al.  Machine Learning Empowered Trajectory and Passive Beamforming Design in UAV-RIS Wireless Networks , 2020, IEEE Journal on Selected Areas in Communications.

[5]  Arumugam Nallanathan,et al.  Multi-Agent Reinforcement Learning-Based Resource Allocation for UAV Networks , 2018, IEEE Transactions on Wireless Communications.

[6]  Qingkai Liang,et al.  Accelerated Primal-Dual Policy Optimization for Safe Reinforcement Learning , 2018, ArXiv.

[7]  Rui Zhang,et al.  Throughput Maximization for UAV-Enabled Mobile Relaying Systems , 2016, IEEE Transactions on Communications.

[8]  Wenjun Xu,et al.  Joint Trajectory Optimization and User Scheduling for Rotary-Wing UAV-Enabled Wireless Powered Communication Networks , 2019, IEEE Access.

[9]  Philip T. Krein,et al.  Telecommunications Power Plant Damage Assessment for Hurricane Katrina– Site Survey and Follow-Up Results , 2009, IEEE Systems Journal.

[10]  Mahbub Hassan,et al.  Survey on UAV Cellular Communications: Practical Aspects, Standardization Advancements, Regulation, and Security Challenges , 2018, IEEE Communications Surveys & Tutorials.

[11]  Tiankui Zhang,et al.  Caching Placement and Resource Allocation for Cache-Enabling UAV NOMA Networks , 2020, IEEE Transactions on Vehicular Technology.

[12]  Yu Lin,et al.  UAV-Assisted Emergency Communications: An Extended Multi-Armed Bandit Perspective , 2019, IEEE Communications Letters.

[13]  F. Richard Yu,et al.  Intelligent Trajectory Design in UAV-Aided Communications With Reinforcement Learning , 2019, IEEE Transactions on Vehicular Technology.

[14]  Chongcheng Chen,et al.  Multiobjective UAV Path Planning for Emergency Information Collection and Transmission , 2020, IEEE Internet of Things Journal.

[15]  Yue Gao,et al.  UAV Communications Based on Non-Orthogonal Multiple Access , 2018, IEEE Wireless Communications.

[16]  Sergey Andreev,et al.  Flexible and Reliable UAV-Assisted Backhaul Operation in 5G mmWave Cellular Networks , 2018, IEEE Journal on Selected Areas in Communications.

[17]  Ofir Nachum,et al.  A Lyapunov-based Approach to Safe Reinforcement Learning , 2018, NeurIPS.

[18]  Enrico Natalizio,et al.  UAV-assisted disaster management: Applications and open issues , 2016, 2016 International Conference on Computing, Networking and Communications (ICNC).

[19]  Yi Wang,et al.  Cache-Enabling UAV Communications: Network Deployment and Resource Allocation , 2020, IEEE Transactions on Wireless Communications.

[20]  Walid Saad,et al.  Caching in the Sky: Proactive Deployment of Cache-Enabled Unmanned Aerial Vehicles for Optimized Quality-of-Experience , 2016, IEEE Journal on Selected Areas in Communications.

[21]  Chi Harold Liu,et al.  Energy-Efficient UAV Control for Effective and Fair Communication Coverage: A Deep Reinforcement Learning Approach , 2018, IEEE Journal on Selected Areas in Communications.

[22]  Xiao Liu,et al.  Trajectory Design and Power Control for Multi-UAV Assisted Wireless Networks: A Machine Learning Approach , 2018, IEEE Transactions on Vehicular Technology.

[23]  Nei Kato,et al.  Future Intelligent and Secure Vehicular Network Toward 6G: Machine-Learning Approaches , 2020, Proceedings of the IEEE.

[24]  Ian F. Akyildiz,et al.  Help from the Sky: Leveraging UAVs for Disaster Management , 2017, IEEE Pervasive Computing.

[25]  Ismail Guvenc,et al.  Improved Throughput Coverage in Natural Disasters: Unmanned Aerial Base Stations for Public-Safety Communications , 2016, IEEE Vehicular Technology Magazine.

[26]  Xiaoli Xu,et al.  Overcoming Endurance Issue: UAV-Enabled Communications With Proactive Caching , 2017, IEEE Journal on Selected Areas in Communications.

[27]  Weidang Lu,et al.  UAV-Assisted Emergency Networks in Disasters , 2019, IEEE Wireless Communications.

[28]  Rui Zhang,et al.  Wireless communications with unmanned aerial vehicles: opportunities and challenges , 2016, IEEE Communications Magazine.

[29]  Mohamed-Slim Alouini,et al.  Joint Trajectory and Precoding Optimization for UAV-Assisted NOMA Networks , 2019, IEEE Transactions on Communications.

[30]  Christos Politis,et al.  An Overview of Post-Disaster Emergency Communication Systems in the Future Networks , 2019, IEEE Wireless Communications.