Learning-Based UAV Trajectory Optimization With Collision Avoidance and Connectivity Constraints

Unmanned aerial vehicles (UAVs) are expected to be an integral part of wireless networks, and determining collision-free trajectories for multiple UAVs while satisfying requirements of connectivity with ground base stations (GBSs) is a challenging task. In this paper, we consider non-cooperative multi-UAV scenarios, in which multiple UAVs need to fly from initial locations to destinations, while satisfying collision avoidance, wireless connectivity, and kinematic constraints. We aim to find trajectories for the UAVs with the goal to minimize their mission completion time. We first formulate the multi-UAV trajectory optimization problem as a sequential decision making problem. We, then, propose a decentralized deep reinforcement learning approach to solve the problem. More specifically, a value network is developed to obtain values given the agent’s joint state (including the agent’s information, the nearby agents’ observable information, and the locations of the nearby GBSs). A signal-to-interference-plus-noise ratio (SINR)-prediction neural network is also designed, using accumulated SINR measurements obtained when interacting with the cellular network, to map the GBSs’ locations into the SINR levels in order to predict the UAV’s SINR. Numerical results show that with the value network and SINR-prediction network, real-time navigation for multi-UAVs can be efficiently performed in various environments with high success rate.

[1]  Lin X. Cai,et al.  Learning to Be Proactive: Self-Regulation of UAV Based Networks With UAV and User Dynamics , 2021, IEEE Transactions on Wireless Communications.

[2]  Xueyuan Wang,et al.  Learning-Based UAV Trajectory Optimization with Collision Avoidance and Connectivity Constraints , 2021, ICC 2021 - IEEE International Conference on Communications.

[3]  Shuowen Zhang,et al.  Radio Map-Based 3D Path Planning for Cellular-Connected UAV , 2019, IEEE Transactions on Wireless Communications.

[4]  M. Caccamo,et al.  Multi-UAV Path Planning for Wireless Data Harvesting With Deep Reinforcement Learning , 2020, IEEE Open Journal of the Communications Society.

[5]  Rui Zhang,et al.  Simultaneous Navigation and Radio Mapping for Cellular-Connected UAV With Deep Reinforcement Learning , 2020, IEEE Transactions on Wireless Communications.

[6]  Elvino S. Sousa,et al.  A Double Q-Learning Approach for Navigation of Aerial Vehicles with Connectivity Constraint , 2020, ICC 2020 - 2020 IEEE International Conference on Communications (ICC).

[7]  Yu Cheng,et al.  Constrained Deep Reinforcement Learning for Energy Sustainable Multi-UAV Based Random Access IoT Networks With NOMA , 2020, IEEE Journal on Selected Areas in Communications.

[8]  Elvino S. Sousa,et al.  Power Efficient Trajectory Optimization for the Cellular-Connected Aerial Vehicles , 2019, 2019 IEEE 30th Annual International Symposium on Personal, Indoor and Mobile Radio Communications (PIMRC).

[9]  Yuanwei Liu,et al.  Non-Orthogonal Multiple Access for Air-to-Ground Communication , 2019, IEEE Transactions on Communications.

[10]  Sofie Pollin,et al.  Cellular Coverage-Aware Path Planning for UAVs , 2019, 2019 IEEE 20th International Workshop on Signal Processing Advances in Wireless Communications (SPAWC).

[11]  Khaled Ben Letaief,et al.  Connectivity-Aware UAV Path Planning with Aerial Coverage Maps , 2019, 2019 IEEE Wireless Communications and Networking Conference (WCNC).

[12]  Qingqing Wu,et al.  Accessing From the Sky: A Tutorial on UAV Communications for 5G and Beyond , 2019, Proceedings of the IEEE.

[13]  Shuowen Zhang,et al.  Trajectory Optimization for Cellular-Connected UAV Under Outage Duration Constraint , 2019, J. Commun. Inf. Networks.

[14]  Xiao Liu,et al.  Trajectory Design and Power Control for Multi-UAV Assisted Wireless Networks: A Machine Learning Approach , 2018, IEEE Transactions on Vehicular Technology.

[15]  Alexandre Alahi,et al.  Crowd-Robot Interaction: Crowd-Aware Robot Navigation With Attention-Based Deep Reinforcement Learning , 2018, 2019 International Conference on Robotics and Automation (ICRA).

[16]  Ismail Güvenç,et al.  Impact of 3D UWB Antenna Radiation Pattern on Air-to-Ground Drone Connectivity , 2018, 2018 IEEE 88th Vehicular Technology Conference (VTC-Fall).

[17]  Ismail Güvenç,et al.  Trajectory Optimization for Cellular-Connected UAVs with Disconnectivity Constraint , 2018, 2018 IEEE International Conference on Communications Workshops (ICC Workshops).

[18]  Shuowen Zhang,et al.  Cellular-Enabled UAV Communication: A Connectivity-Constrained Trajectory Optimization Perspective , 2018, IEEE Transactions on Communications.

[19]  Jonathan P. How,et al.  Motion Planning Among Dynamic, Decision-Making Agents with Deep Reinforcement Learning , 2018, 2018 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[20]  Sofie Pollin,et al.  Cellular Connectivity for UAVs: Network Modeling, Performance Analysis, and Design Guidelines , 2018, IEEE Transactions on Wireless Communications.

[21]  Rui Zhang,et al.  Cellular-Connected UAV: Potential, Challenges, and Promising Technologies , 2018, IEEE Wireless Communications.

[22]  Walid Saad,et al.  Interference Management for Cellular-Connected UAVs: A Deep Reinforcement Learning Approach , 2018, IEEE Transactions on Wireless Communications.

[23]  Xiaoli Xu,et al.  Trajectory Design for Completion Time Minimization in UAV-Enabled Multicasting , 2018, IEEE Transactions on Wireless Communications.

[24]  Songyang Lao,et al.  Collision Avoidance for Cooperative UAVs with Rolling Optimization Algorithm Based on Predictive State Space , 2017 .

[25]  Jonathan P. How,et al.  Socially aware motion planning with deep reinforcement learning , 2017, 2017 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[26]  J. R. Morrison,et al.  Rolling Horizon Path Planning of an Autonomous System of UAVs for Persistent Cooperative Service: MILP Formulation and Efficient Heuristics , 2016, J. Intell. Robotic Syst..

[27]  Wolfram Burgard,et al.  Socially compliant mobile robot navigation via inverse reinforcement learning , 2016, Int. J. Robotics Res..

[28]  Rui Zhang,et al.  Energy-Efficient UAV Communication With Trajectory Optimization , 2016, IEEE Transactions on Wireless Communications.

[29]  Srikanth Saripalli,et al.  Collision avoidance for UAVs using reachable sets , 2015, 2015 International Conference on Unmanned Aircraft Systems (ICUAS).

[30]  Mykel J. Kochenderfer,et al.  Aircraft Collision Avoidance Using Monte Carlo Real-Time Belief Space Search , 2011, J. Intell. Robotic Syst..

[31]  Dinesh Manocha,et al.  The Hybrid Reciprocal Velocity Obstacle , 2011, IEEE Transactions on Robotics.

[32]  Mykel J. Kochenderfer,et al.  Unmanned Aircraft Collision Avoidance using Continuous-State POMDPs , 2011, Robotics: Science and Systems.

[33]  Dinesh Manocha,et al.  Reciprocal n-Body Collision Avoidance , 2011, ISRR.

[34]  Leslie Pack Kaelbling,et al.  Collision Avoidance for Unmanned Aircraft using Markov Decision Processes , 2010 .

[35]  Ian Postlethwaite,et al.  Multi-UAV path planning in obstacle rich environments using Rapidly-exploring Random Trees , 2009, Proceedings of the 48h IEEE Conference on Decision and Control (CDC) held jointly with 2009 28th Chinese Control Conference.

[36]  Dinesh Manocha,et al.  Reciprocal Velocity Obstacles for real-time multi-agent navigation , 2008, 2008 IEEE International Conference on Robotics and Automation.

[37]  R. Pesenti,et al.  Multiple UAV cooperative path planning via neuro-dynamic programming , 2004, 2004 43rd IEEE Conference on Decision and Control (CDC) (IEEE Cat. No.04CH37601).

[38]  Timothy W. McLain,et al.  Multiple UAV cooperative search under collision avoidance and limited range communication constraints , 2003, 42nd IEEE International Conference on Decision and Control (IEEE Cat. No.03CH37475).

[39]  Jianqiao Yu,et al.  Path Planning for Multi-UAV Formation , 2015, J. Intell. Robotic Syst..