Optimal UAV Base Station Trajectories Using Flow-Level Models for Reinforcement Learning

Cellular base stations (BS) and remote radio heads can be mounted on unmanned aerial vehicles (UAV) for flexible, traffic-aware deployment. These UAV base station networks (UAVBSN) promise an unprecendented degree of freedom that can be exploited for spectral efficiency gains as well as optimal network utilization. However, the current literature lacks realistic radio and traffic models for UAVBSN deployment planning and for performance evaluation. In this paper, we propose flow-level models (FLM) for realistically characterizing the UAVBSN performance in terms of a broad range of flow- and system-level metrics. Further, we propose a deep reinforcement learning (DRL) approach that relies on the UAVBSN FLM for learning the optimal traffic-aware UAV trajectories. For a given user traffic density and starting UAV locations, our RL approach learns the optimal UAV trajectories offline that maximizes a cumulative performance metric. We then execute the learned UAV trajectories in a discrete event simulator to evaluate online UAVBSN performance. For ${M}\pmb {=}$ 9 UAVs deployed in a simulated Downtown San Francisco model, where the UAV trajectories are defined by ${N}\pmb {=}$ 20 discrete actions, our approach achieves approximately a three-fold increase in the average user throughput compared to the initial UAV placement, while simultaneously balancing traffic loads across the BSs.

[1]  Halim Yanikomeroglu,et al.  Efficient 3D aerial base station placement considering users mobility by reinforcement learning , 2018, 2018 IEEE Wireless Communications and Networking Conference (WCNC).

[2]  Walid Saad,et al.  Optimal transport theory for power-efficient deployment of unmanned aerial vehicles , 2016, 2016 IEEE International Conference on Communications (ICC).

[3]  Qingqing Wu,et al.  Joint Trajectory and Communication Design for Multi-UAV Enabled Wireless Networks , 2017, IEEE Transactions on Wireless Communications.

[4]  Halim Yanikomeroglu,et al.  Environment-Aware Drone-Base-Station Placements in Modern Metropolitans , 2018, IEEE Wireless Communications Letters.

[5]  Alejandro Aragn-Zavala,et al.  High-altitude platforms for wireless communications , 2008 .

[6]  Zhisheng Niu,et al.  Spatial modeling of Scalable Spatially-correlated Log-normal distributed traffic inhomogeneity and energy-efficient network planning , 2013, 2013 IEEE Wireless Communications and Networking Conference (WCNC).

[7]  Walid Saad,et al.  Drone Small Cells in the Clouds: Design, Deployment and Performance Analysis , 2014, 2015 IEEE Global Communications Conference (GLOBECOM).

[8]  Sergey Levine,et al.  Trust Region Policy Optimization , 2015, ICML.

[9]  Abbas Jamalipour,et al.  Modeling air-to-ground path loss for low altitude platforms in urban environments , 2014, 2014 IEEE Global Communications Conference.

[10]  Halim Yanikomeroglu,et al.  3-D Placement of an Unmanned Aerial Vehicle Base Station for Maximum Coverage of Users With Different QoS Requirements , 2017, IEEE Wireless Communications Letters.

[11]  Paul de Kerret,et al.  Trajectory optimization for mobile access point , 2017, 2017 51st Asilomar Conference on Signals, Systems, and Computers.

[12]  Wojciech Zaremba,et al.  OpenAI Gym , 2016, ArXiv.

[13]  Halim Yanikomeroglu,et al.  The New Frontier in RAN Heterogeneity: Multi-Tier Drone-Cells , 2016, IEEE Communications Magazine.

[14]  Walid Saad,et al.  Wireless Communication Using Unmanned Aerial Vehicles (UAVs): Optimal Transport Theory for Hover Time Optimization , 2017, IEEE Transactions on Wireless Communications.

[15]  Joonhyuk Kang,et al.  Mobile Edge Computing via a UAV-Mounted Cloudlet: Optimization of Bit Allocation and Path Planning , 2016, IEEE Transactions on Vehicular Technology.

[16]  Kandeepan Sithamparanathan,et al.  Optimal LAP Altitude for Maximum Coverage , 2014, IEEE Wireless Communications Letters.

[17]  David Gesbert,et al.  A Landing Spot Approach for Enhancing the Performance of UAV-Aided Wireless Networks , 2018, 2018 IEEE International Conference on Communications Workshops (ICC Workshops).

[18]  Michael I. Jordan,et al.  RLlib: Abstractions for Distributed Reinforcement Learning , 2017, ICML.

[19]  Paul de Kerret,et al.  Trajectory Optimization for Autonomous Flying Base Station via Reinforcement Learning , 2018, 2018 IEEE 19th International Workshop on Signal Processing Advances in Wireless Communications (SPAWC).

[20]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[21]  Gerhard Fettweis,et al.  A Framework Enabling Spatial Analysis of Mobile Traffic Hot Spots , 2014, IEEE Wireless Communications Letters.

[22]  Demis Hassabis,et al.  Mastering the game of Go with deep neural networks and tree search , 2016, Nature.

[23]  Halim Yanikomeroglu,et al.  Strategic Densification With UAV-BSs in Cellular Networks , 2018, IEEE Wireless Communications Letters.

[24]  Salman Durrani,et al.  Uplink Coverage Performance of an Underlay Drone Cell for Temporary Events , 2018, 2018 IEEE International Conference on Communications Workshops (ICC Workshops).

[25]  Victor C. M. Leung,et al.  UAV Trajectory Optimization for Data Offloading at the Edge of Multiple Cells , 2018, IEEE Transactions on Vehicular Technology.

[26]  Roberto Sabatini,et al.  UAVs Assisted Delay Optimization in Heterogeneous Wireless Networks , 2016, IEEE Communications Letters.

[27]  Walid Saad,et al.  Drone-Based Antenna Array for Service Time Minimization in Wireless Networks , 2018, 2018 IEEE International Conference on Communications (ICC).

[28]  Bülent Tavli,et al.  UAV Base Station Location Optimization for Next Generation Wireless Networks: Overview and Future Research Directions , 2018, 2019 1st International Conference on Unmanned Vehicle Systems-Oman (UVS).

[29]  Walid Saad,et al.  Communications and Control for Wireless Drone-Based Antenna Array , 2017, IEEE Transactions on Communications.

[30]  Gerhard Fettweis,et al.  A Performance Evaluation Framework for Interference-Coupled Cellular Data Networks , 2016, IEEE Transactions on Wireless Communications.

[31]  J. W. Roberts Traffic theory and the Internet , 2001 .

[32]  Xingqin Lin,et al.  An Overview of 3GPP Release-15 Study on Enhanced LTE Support for Connected Drones , 2018, IEEE Communications Standards Magazine.

[33]  Markus Gruber Role of altitude when exploring optimal placement of UAV access points , 2016, 2016 IEEE Wireless Communications and Networking Conference.

[34]  Shane Legg,et al.  Human-level control through deep reinforcement learning , 2015, Nature.

[35]  Walid Saad,et al.  Efficient Deployment of Multiple Unmanned Aerial Vehicles for Optimal Wireless Coverage , 2016, IEEE Communications Letters.

[36]  Halim Yanikomeroglu,et al.  Efficient 3-D placement of an aerial base station in next generation cellular networks , 2016, 2016 IEEE International Conference on Communications (ICC).

[37]  Li-Hsing Yen,et al.  Autonomous Deployment of UAVs as Access Points to Serve Wireless Terminals , 2017, 2017 IEEE 86th Vehicular Technology Conference (VTC-Fall).

[38]  Ryu Miura,et al.  On A Novel Adaptive UAV-Mounted Cloudlet-Aided Recommendation System for LBSNs , 2019, IEEE Transactions on Emerging Topics in Computing.

[39]  Ekram Hossain,et al.  Multi-Tier Drone Architecture for 5G/B5G Cellular Networks: Challenges, Trends, and Prospects , 2017, IEEE Communications Magazine.

[40]  Mehdi Bennis,et al.  UAV-Assisted Heterogeneous Networks for Capacity Enhancement , 2016, IEEE Communications Letters.

[41]  Halim Yanikomeroglu,et al.  Backhaul-aware robust 3D drone placement in 5G+ wireless networks , 2017, 2017 IEEE International Conference on Communications Workshops (ICC Workshops).

[42]  Halim Yanikomeroglu,et al.  3-D Placement of an Unmanned Aerial Vehicle Base Station (UAV-BS) for Energy-Efficient Maximal Coverage , 2017, IEEE Wireless Communications Letters.

[43]  Bart Custers,et al.  Drone Technology: Types, Payloads, Applications, Frequency Spectrum Issues and Future Developments , 2016 .

[44]  Rui Zhang,et al.  Placement Optimization of UAV-Mounted Mobile Base Stations , 2016, IEEE Communications Letters.

[45]  Halim Yanikomeroglu,et al.  On the Number and 3D Placement of Drone Base Stations in Wireless Cellular Networks , 2016, 2016 IEEE 84th Vehicular Technology Conference (VTC-Fall).

[46]  Shuowen Zhang,et al.  Cellular-Enabled UAV Communication: Trajectory Optimization under Connectivity Constraint , 2017, 2018 IEEE International Conference on Communications (ICC).

[47]  Rui Zhang,et al.  Energy-Efficient UAV Communication With Trajectory Optimization , 2016, IEEE Transactions on Wireless Communications.

[48]  Yuan Yu,et al.  TensorFlow: A system for large-scale machine learning , 2016, OSDI.

[49]  Mohamed-Slim Alouini,et al.  FSO-Based Vertical Backhaul/Fronthaul Framework for 5G+ Wireless Networks , 2016, IEEE Communications Magazine.

[50]  Rui Zhang,et al.  Wireless communications with unmanned aerial vehicles: opportunities and challenges , 2016, IEEE Communications Magazine.

[51]  Walid Saad,et al.  Unmanned Aerial Vehicle With Underlaid Device-to-Device Communications: Performance and Tradeoffs , 2015, IEEE Transactions on Wireless Communications.

[52]  Marco A. Panduro,et al.  Design of UAVs-Based 3D Antenna Arrays for a Maximum Performance in Terms of Directivity and SLL , 2016 .

[53]  Wei Zhang,et al.  Spectrum Sharing for Drone Networks , 2017, IEEE Journal on Selected Areas in Communications.

[54]  Sergey Levine,et al.  Deep reinforcement learning for robotic manipulation with asynchronous off-policy updates , 2016, 2017 IEEE International Conference on Robotics and Automation (ICRA).