A DDPG-based Approach for Energy-aware UAV Navigation in Obstacle-constrained Environment

In this paper, we propose a three-dimensional autonomous UAV navigation framework using Deep Deterministic Policy Gradient (DDPG) learning approach. The objective is to employ a self-trained UAV as an airborne Internet of Things (IoT) unit to navigate obstacles and reach a destination point, where it can communicate with a ground sensor node with sufficiently high data rate. We develop a customized reward function which aims to minimize the distance separating the UAV and its destination while penalizing collisions. A dynamic energy threshold is also set to redirect the UAV towards the charging station in case of battery depletion. We numerically simulate the behavior of the UAV when learning the environmental obstacles, and autonomously selecting trajectories for selected scenarios. Finally, we show that our learning approach achieves close performance to the one of the graph-based Dijkstra's algorithm.

[1]  Abolfazl Razi,et al.  A Path Planning Algorithm for Collective Monitoring Using Autonomous Drones , 2019, 2019 53rd Annual Conference on Information Sciences and Systems (CISS).

[2]  Hakim Ghazzai,et al.  A Generic Spatiotemporal Scheduling for Autonomous UAVs: A Reinforcement Learning-Based Approach , 2020, IEEE Open Journal of Vehicular Technology.

[3]  Shane Legg,et al.  Human-level control through deep reinforcement learning , 2015, Nature.

[4]  Chi Harold Liu,et al.  Energy-Efficient Distributed Mobile Crowd Sensing: A Deep Learning Approach , 2019, IEEE Journal on Selected Areas in Communications.

[5]  Sherin Abdelhamid UAV Path Planning for Emergency Management in IoT , 2018, 2018 IEEE International Conference on Communications Workshops (ICC Workshops).

[6]  Kandeepan Sithamparanathan,et al.  Optimal LAP Altitude for Maximum Coverage , 2014, IEEE Wireless Communications Letters.

[7]  Abbas Harifi,et al.  UAV Path Planning for Data Gathering of IoT Nodes: Ant Colony or Simulated Annealing Optimization , 2019, 2019 3rd International Conference on Internet of Things and Applications (IoT).

[8]  Halima Elbiaze,et al.  A Queuing Theoretic Framework for Flying Mesh Network Assisted IoT Environments , 2019, 2019 IEEE 5th World Forum on Internet of Things (WF-IoT).

[9]  Jie Xu,et al.  Energy Minimization for Wireless Communication With Rotary-Wing UAV , 2018, IEEE Transactions on Wireless Communications.

[10]  Hendrik F. Hamann,et al.  Drone-based reconstruction for 3D geospatial data processing , 2016, 2016 IEEE 3rd World Forum on Internet of Things (WF-IoT).

[11]  Tao Jiang,et al.  Path planning under obstacle-avoidance constraints based on ant colony optimization algorithm , 2017, 2017 IEEE 17th International Conference on Communication Technology (ICCT).

[12]  Hakim Ghazzai,et al.  A Generic Spatiotemporal UAV Scheduling Framework for Multi-Event Applications , 2019, IEEE Access.

[13]  Yuval Tassa,et al.  Continuous control with deep reinforcement learning , 2015, ICLR.

[14]  Hakim Ghazzai,et al.  Q-learning based Routing Scheduling For a Multi-Task Autonomous Agent , 2019, 2019 IEEE 62nd International Midwest Symposium on Circuits and Systems (MWSCAS).