Reconfigurable Intelligent Surface-Assisted Multi-UAV Networks: Efficient Resource Allocation With Deep Reinforcement Learning

In this paper, we propose reconfigurable intelligent surface (RIS)-assisted unmanned aerial vehicles (UAVs) networks that can utilise both advantages of UAV’s agility and RIS’s reflection for enhancing the network’s performance. To aim at maximising the energy efficiency (EE) of the considered networks, we jointly optimise the power allocation of the UAVs and the phase-shift matrix of the RIS. A deep reinforcement learning (DRL) approach is proposed for solving the continuous optimisation problem with time-varying channels in a centralised fashion. Moreover, parallel learning approach is also proposed for reducing the information transmission requirement of the centralised approach. Numerical results show a significant improvement of our proposed schemes compared with the conventional approaches in terms of EE, flexibility, and processing time. Our proposed DRL methods for RIS-assisted UAV networks can be used for real-time applications due to their capability of instant decision-making and handling the time-varying channel with the dynamic environmental setting. KeywordsDeep reinforcement learning, multi-UAV, reconfigurable intelligent surface, resource allocation.

[1]  Qisheng Wang,et al.  Deep Reinforcement Learning Based Intelligent Reflecting Surface Optimization for MISO Communication Systems , 2020, IEEE Wireless Communications Letters.

[2]  Dimitri P. Bertsekas,et al.  Dynamic Programming and Optimal Control, Two Volume Set , 1995 .

[3]  Hoang Duong Tuan,et al.  Joint Design of Reconfigurable Intelligent Surfaces and Transmit Beamforming Under Proper and Improper Gaussian Signaling , 2020, IEEE Journal on Selected Areas in Communications.

[4]  H. Vincent Poor,et al.  Reconfigurable Intelligent Surface Assisted Device-to-Device Communications , 2020, IEEE Transactions on Wireless Communications.

[5]  Xiaojun Yuan,et al.  Passive Beamforming and Information Transfer Design for Reconfigurable Intelligent Surfaces Aided Multiuser MIMO Systems , 2019, IEEE Journal on Selected Areas in Communications.

[6]  Georges Kaddoum,et al.  URLLC Facilitated by Mobile UAV Relay and RIS: A Joint Design of Passive Beamforming, Blocklength, and UAV Positioning , 2021, IEEE Internet of Things Journal.

[7]  Long D. Nguyen,et al.  Role of UAVs in Public Safety Communications: Energy Efficiency Perspective , 2019, IEEE Access.

[8]  Ali Ghrayeb,et al.  Optimizing Age of Information Through Aerial Reconfigurable Intelligent Surfaces: A Deep Reinforcement Learning Approach , 2020, IEEE Transactions on Vehicular Technology.

[9]  Rui Zhang,et al.  Energy-Efficient UAV Communication With Trajectory Optimization , 2016, IEEE Transactions on Wireless Communications.

[10]  Minh-Nghia Nguyen,et al.  Non-Cooperative Energy Efficient Power Allocation Game in D2D Communication: A Multi-Agent Deep Reinforcement Learning Approach , 2019, IEEE Access.

[11]  Erik G. Larsson,et al.  Weighted Sum-Rate Maximization for Reconfigurable Intelligent Surface Aided Wireless Networks , 2019, IEEE Transactions on Wireless Communications.

[12]  Xiaohu You,et al.  Joint Beamforming and Trajectory Optimization for Intelligent Reflecting Surfaces-Assisted UAV Communications , 2020, IEEE Access.

[13]  Xianglong Feng,et al.  A Deep Learning Based Modeling of Reconfigurable Intelligent Surface Assisted Wireless Communications for Phase Shift Configuration , 2021, IEEE Open Journal of the Communications Society.

[14]  Alex Graves,et al.  Asynchronous Methods for Deep Reinforcement Learning , 2016, ICML.

[15]  Trung Quang Duong,et al.  An Introduction of Real-time Embedded Optimisation Programming for UAV Systems under Disaster Communication , 2018, EAI Endorsed Trans. Ind. Networks Intell. Syst..

[16]  Sergey Levine,et al.  High-Dimensional Continuous Control Using Generalized Advantage Estimation , 2015, ICLR.

[17]  Hoang Duong Tuan,et al.  Learning-Aided Realtime Performance Optimisation of Cognitive UAV-Assisted Disaster Communication , 2019, 2019 IEEE Global Communications Conference (GLOBECOM).

[18]  Zhu Han,et al.  Hybrid Beamforming for Reconfigurable Intelligent Surface based Multi-User Communications: Achievable Rates With Limited Discrete Phase Shifts , 2019, IEEE Journal on Selected Areas in Communications.

[19]  Ayse Kortun,et al.  Real-Time Deployment and Resource Allocation for Distributed UAV Systems in Disaster Relief , 2019, 2019 IEEE 20th International Workshop on Signal Processing Advances in Wireless Communications (SPAWC).

[20]  Chau Yuen,et al.  Reconfigurable Intelligent Surfaces for Energy Efficiency in Wireless Communication , 2018, IEEE Transactions on Wireless Communications.

[21]  Saman Atapattu,et al.  Reconfigurable Intelligent Surface assisted Two-Way Communications: Performance Analysis and Optimization , 2020, ArXiv.

[22]  Hiroko Onishi,et al.  Drones: military weapons, surveillance or mapping tools for environmental monitoring? The need for legal framework is required , 2017 .

[23]  Yuan Yu,et al.  TensorFlow: A system for large-scale machine learning , 2016, OSDI.

[24]  Mohamed-Slim Alouini,et al.  Wireless Communications Through Reconfigurable Intelligent Surfaces , 2019, IEEE Access.

[25]  Long D. Nguyen,et al.  Distributed Deep Deterministic Policy Gradient for Power Allocation Control in D2D-Based V2V Communications , 2019, IEEE Access.

[26]  Rui Zhang,et al.  Energy-Efficient Data Collection in UAV Enabled Wireless Sensor Network , 2017, IEEE Wireless Communications Letters.

[27]  Ying-Chang Liang,et al.  Reconfigurable Intelligent Surface Assisted UAV Communication: Joint Trajectory Design and Passive Beamforming , 2022 .

[28]  Yuval Tassa,et al.  Continuous control with deep reinforcement learning , 2015, ICLR.

[29]  Dinh Thai Hoang,et al.  Wireless Powered Intelligent Reflecting Surfaces for Enhancing Wireless Communications , 2020, IEEE Transactions on Vehicular Technology.

[30]  Ronghong Mo,et al.  Reconfigurable Intelligent Surface Assisted Multiuser MISO Systems Exploiting Deep Reinforcement Learning , 2020, IEEE Journal on Selected Areas in Communications.

[31]  Yishay Mansour,et al.  Policy Gradient Methods for Reinforcement Learning with Function Approximation , 1999, NIPS.

[32]  Soung Chang Liew,et al.  Deep-Reinforcement Learning Multiple Access for Heterogeneous Wireless Networks , 2017, 2018 IEEE International Conference on Communications (ICC).

[33]  Erik G. Larsson,et al.  Intelligent Reflecting Surface-Assisted Cognitive Radio System , 2019, IEEE Transactions on Communications.

[34]  Long D. Nguyen,et al.  Real-Time Energy Harvesting Aided Scheduling in UAV-Assisted D2D Networks Relying on Deep Reinforcement Learning , 2021, IEEE Access.