Energy Efficient 3-D UAV Control for Persistent Communication Service and Fairness: A Deep Reinforcement Learning Approach

Recently, unmanned aerial vehicles (UAVs) as flying wireless communication platform have attracted much attention. Benefiting from the mobility, UAV aerial base stations can be deployed quickly and flexibly, and can effectively establish Line-of-Sight communication links. However, there are many challenges in UAV communication system. The first challenge is energy constraint, where the UAV battery lifetime is in the order of fraction of an hour. The second challenge is that the coverage area of UAV aerial base station is limited and the commercial UAV is usually expensive. Thus, covering a large target region all the time with sufficient UAVs is quite challenging. To solve above challenges, in this paper, we propose energy efficient and fair 3-D UAV scheduling with energy replenishment, where UAVs move around to serve users and recharge timely to replenish energy. Inspired by the success of deep reinforcement learning, we propose a UAV Control policy based on Deep Deterministic Policy Gradient (UC-DDPG) to address the combination problem of 3-D mobility of multiple UAVs and energy replenishment scheduling, which ensures energy efficient and fair coverage of each user in a large region and maintains the persistent service. Simulation results reveal that UC-DDPG shows a good convergence and outperforms other scheduling algorithms in terms of data volume, energy efficiency and fairness.

[1]  Walid Saad,et al.  Efficient Deployment of Multiple Unmanned Aerial Vehicles for Optimal Wireless Coverage , 2016, IEEE Communications Letters.

[2]  Ying-Chang Liang,et al.  Deep Reinforcement Learning for Multi-User Access Control in UAV Networks , 2019, ICC 2019 - 2019 IEEE International Conference on Communications (ICC).

[3]  Luciano Bononi,et al.  Fly and recharge: Achieving persistent coverage using Small Unmanned Aerial Vehicles (SUAVs) , 2017, 2017 IEEE International Conference on Communications (ICC).

[4]  Jie Xu,et al.  Energy Beamforming With One-Bit Feedback , 2013, IEEE Transactions on Signal Processing.

[5]  David Hyunchul Shim,et al.  Persistent UAV Service: An Improved Scheduling Formulation and Prototypes of System Components , 2013, 2013 International Conference on Unmanned Aircraft Systems (ICUAS).

[6]  Kandeepan Sithamparanathan,et al.  Optimal LAP Altitude for Maximum Coverage , 2014, IEEE Wireless Communications Letters.

[7]  Sofie Pollin,et al.  Ultra Reliable UAV Communication Using Altitude and Cooperation Diversity , 2017, IEEE Transactions on Communications.

[8]  Michael Achtelik,et al.  Design of a flexible high performance quadcopter platform breaking the MAV endurance record with laser power beaming , 2011, 2011 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[9]  Issa M. Khalil,et al.  On the continuous coverage problem for a swarm of UAVs , 2016, 2016 IEEE 37th Sarnoff Symposium.

[10]  Lav Gupta,et al.  Survey of Important Issues in UAV Communication Networks , 2016, IEEE Communications Surveys & Tutorials.

[11]  Khaled Ben Letaief,et al.  Beyond Empirical Models: Pattern Formation Driven Placement of UAV Base Stations , 2017, IEEE Transactions on Wireless Communications.

[12]  Sofie Pollin,et al.  Joint Sum-Rate and Power Gain Analysis of an Aerial Base Station , 2016, 2016 IEEE Globecom Workshops (GC Wkshps).

[13]  Yuval Tassa,et al.  Continuous control with deep reinforcement learning , 2015, ICLR.

[14]  Anjan Chakrabarty,et al.  Energy Maps for Long-Range Path Planning for Small- and Micro - UAVs , 2009 .

[15]  Walid Saad,et al.  Caching in the Sky: Proactive Deployment of Cache-Enabled Unmanned Aerial Vehicles for Optimized Quality-of-Experience , 2016, IEEE Journal on Selected Areas in Communications.

[16]  Rui Zhang,et al.  Placement Optimization of UAV-Mounted Mobile Base Stations , 2016, IEEE Communications Letters.

[17]  Halim Yanikomeroglu,et al.  On the Number and 3D Placement of Drone Base Stations in Wireless Cellular Networks , 2016, 2016 IEEE 84th Vehicular Technology Conference (VTC-Fall).

[18]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[19]  S. O. Rice,et al.  Statistical properties of a sine wave plus random noise , 1948, Bell Syst. Tech. J..

[20]  Tarik Taleb,et al.  A green strategic activity scheduling for UAV networks: A sub-modular game perspective , 2016, IEEE Communications Magazine.

[21]  Rui Zhang,et al.  Energy-Efficient UAV Communication With Trajectory Optimization , 2016, IEEE Transactions on Wireless Communications.

[22]  J.W. Langelaan Tree-based trajectory planning to exploit atmospheric energy , 2008, 2008 American Control Conference.

[23]  Abdallah Khreishah,et al.  Providing wireless coverage to high-rise buildings using UAVs , 2017, 2017 IEEE International Conference on Communications (ICC).

[24]  Weidong Wang,et al.  Deep Reinforcement Learning Based Dynamic Channel Allocation Algorithm in Multibeam Satellite Systems , 2018, IEEE Access.

[25]  Chi Harold Liu,et al.  Energy-Efficient UAV Control for Effective and Fair Communication Coverage: A Deep Reinforcement Learning Approach , 2018, IEEE Journal on Selected Areas in Communications.

[26]  Wei Ni,et al.  Capacity Analysis of UAV Communications: Cases of Random Trajectories , 2018, IEEE Transactions on Vehicular Technology.

[27]  Gang Cao,et al.  AIF: An Artificial Intelligence Framework for Smart Wireless Network Management , 2018, IEEE Communications Letters.

[28]  Kobi Cohen,et al.  Deep Multi-User Reinforcement Learning for Distributed Dynamic Spectrum Access , 2017, IEEE Transactions on Wireless Communications.

[29]  Antonios Tsourdos,et al.  Energy Management in Swarm of Unmanned Aerial Vehicles , 2013, 2013 International Conference on Unmanned Aircraft Systems (ICUAS).

[30]  Mohamed-Slim Alouini,et al.  A unified approach to the performance analysis of digital communication over generalized fading channels , 1998, Proc. IEEE.

[31]  Xiao Liu,et al.  Reinforcement Learning in Multiple-UAV Networks: Deployment and Movement Design , 2019, IEEE Transactions on Vehicular Technology.

[32]  D. B. D. Costa,et al.  Capacity analysis of cooperative systems with relay selection in Nakagami-m fading , 2009, IEEE Commun. Lett..

[33]  Sofie Pollin,et al.  Micro aerial vehicle networks: an experimental analysis of challenges and opportunities , 2014, IEEE Communications Magazine.

[34]  Zhisheng Niu,et al.  DeepNap: Data-Driven Base Station Sleeping Operations Through Deep Reinforcement Learning , 2018, IEEE Internet of Things Journal.

[35]  Halim Yanikomeroglu,et al.  3-D Placement of an Unmanned Aerial Vehicle Base Station (UAV-BS) for Energy-Efficient Maximal Coverage , 2017, IEEE Wireless Communications Letters.

[36]  Dimitrios Zorbas,et al.  Modelling the mobile target covering problem using flying drones , 2016, Optim. Lett..

[37]  Rui Zhang,et al.  Wireless communications with unmanned aerial vehicles: opportunities and challenges , 2016, IEEE Communications Magazine.

[38]  Sebastian Magierowski,et al.  Vehicle Routing Problems for Drone Delivery , 2016, IEEE Transactions on Systems, Man, and Cybernetics: Systems.

[39]  Wei Zheng,et al.  Artificial Intelligence-Based Handoff Management for Dense WLANs: A Deep Reinforcement Learning Approach , 2019, IEEE Access.

[40]  Ying-Chang Liang,et al.  Applications of Deep Reinforcement Learning in Communications and Networking: A Survey , 2018, IEEE Communications Surveys & Tutorials.

[41]  Grzegorz Bocewicz,et al.  Energy Consumption in Unmanned Aerial Vehicles: A Review of Energy Consumption Models and Their Relation to the UAV Routing , 2018, Advances in Intelligent Systems and Computing.

[42]  James R. Morrison,et al.  On the Scheduling of Systems of UAVs and Fuel Service Stations for Long-Term Mission Fulfillment , 2013, J. Intell. Robotic Syst..

[43]  Walid Saad,et al.  Mobile Unmanned Aerial Vehicles (UAVs) for Energy-Efficient Internet of Things Communications , 2017, IEEE Transactions on Wireless Communications.

[44]  Ryan N. Smith,et al.  Wind-energy based path planning for Unmanned Aerial Vehicles using Markov Decision Processes , 2013, 2013 IEEE International Conference on Robotics and Automation.

[45]  Shane Legg,et al.  Human-level control through deep reinforcement learning , 2015, Nature.

[46]  Giorgio C. Buttazzo,et al.  Energy-Aware Coverage Path Planning of UAVs , 2015, 2015 IEEE International Conference on Autonomous Robot Systems and Competitions.

[47]  Halim Yanikomeroglu,et al.  Efficient 3-D placement of an aerial base station in next generation cellular networks , 2016, 2016 IEEE International Conference on Communications (ICC).

[48]  Ian F. Akyildiz,et al.  Help from the Sky: Leveraging UAVs for Disaster Management , 2017, IEEE Pervasive Computing.

[49]  Walid Saad,et al.  Wireless Communication Using Unmanned Aerial Vehicles (UAVs): Optimal Transport Theory for Hover Time Optimization , 2017, IEEE Transactions on Wireless Communications.

[50]  Luciano Bononi,et al.  Joint Coverage, Connectivity, and Charging Strategies for Distributed UAV Networks , 2018, IEEE Transactions on Robotics.

[51]  Taoka Hidekazu,et al.  Scenarios for 5G mobile and wireless communications: the vision of the METIS project , 2014, IEEE Communications Magazine.

[52]  Robert Sowah,et al.  Rotational energy harvesting to prolong flight duration of quadcopters , 2017, 2015 IEEE Industry Applications Society Annual Meeting.

[53]  Rui Zhang,et al.  Throughput Maximization for UAV-Enabled Mobile Relaying Systems , 2016, IEEE Transactions on Communications.

[54]  Walid Saad,et al.  Optimal transport theory for power-efficient deployment of unmanned aerial vehicles , 2016, 2016 IEEE International Conference on Communications (ICC).