Joint User, Channel, Modulation-Coding Selection, and RIS Configuration for Jamming Resistance in Multiuser OFDMA Systems

Reconfigurable intelligent surfaces (RISs) can potentially combat jamming. It is non-trivial to perform holistic selections of users, data streams, and modulation-coding modes for all subchannels, and RIS configuration in a downlink multiuser OFDMA system under jamming attacks, because of a mixed-integer program nature and difficulties in acquiring the channel state information (CSI) of the channels to and from the RIS and from an uncooperative jammer. We propose a new deep reinforcement learning (DRL)-based approach that learns through changes in the data rates of the users to reject jamming and maximize the sum rate. The key idea is to decouple the continuous RIS configuration from the discrete selections of users, data streams, subchannels, and modulation-coding modes. Another critical aspect is that we show the optimal selections almost surely follow a winner-takes-all strategy. Accordingly, the new DRL framework learns the RIS configuration with a twin-delayed deep deterministic policy gradient and takes the winner-takes-all strategy to evaluate the reward, thereby reducing the action space and accelerating learning. Simulations show the framework converges fast and fulfills the benefit of the RIS. With no need for the CSI of the channels to and from the RIS and from the jammer, the framework offers practical value.

[1]  Chih-Peng Li,et al.  DRL-Based Resource Allocation for Computation Offloading in IoV Networks , 2022, IEEE Transactions on Industrial Informatics.

[2]  Arunselvan Ramaswamy,et al.  3DPG: Distributed Deep Deterministic Policy Gradient Algorithms for Networked Multi-Agent Systems , 2022, 2201.00570.

[3]  Chau Yuen,et al.  Scalable Channel Estimation and Reflection Optimization for Reconfigurable Intelligent Surface-Enhanced OFDM Systems , 2021, IEEE Wireless Communications Letters.

[4]  Symeon Chatzinotas,et al.  Intelligent Reflecting Surface Enhanced Secure Transmission Against Both Jamming and Eavesdropping Attacks , 2021, IEEE Transactions on Vehicular Technology.

[5]  Josep Miquel Jornet,et al.  ADAPT: An Adaptive Directional Antenna Protocol for medium access control in Terahertz communication networks , 2021, Ad Hoc Networks.

[6]  Mengnan Jian,et al.  Baseband Signal Processing for Terahertz: Waveform Design, Modulation and Coding , 2021, 2021 International Wireless Communications and Mobile Computing (IWCMC).

[7]  Yang Yi,et al.  Deep Echo State Q-Network (DEQN) and Its Application in Dynamic Spectrum Sharing for 5G and Beyond , 2020, IEEE Transactions on Neural Networks and Learning Systems.

[8]  Liang Xiao,et al.  Reinforcement Learning-Based Mobile Offloading for Edge Computing Against Jamming and Interference , 2020, IEEE Transactions on Communications.

[9]  Qian Liu,et al.  Intelligent Reflecting Surface Enhanced Wideband MIMO-OFDM Communications: From Practical Model to Reflection Optimization , 2020, IEEE Transactions on Communications.

[10]  Yiyang Pei,et al.  Robust Beamforming and Phase Shift Design for IRS-Enhanced Multi-User MISO Downlink Communication , 2020, ICC 2020 - 2020 IEEE International Conference on Communications (ICC).

[11]  Dinh Thai Hoang,et al.  Optimization-driven Deep Reinforcement Learning for Robust Beamforming in IRS-assisted Wireless Communications , 2020, ArXiv.

[12]  Yonggang Wen,et al.  DeepComfort: Energy-Efficient Thermal Comfort Control in Buildings Via Reinforcement Learning , 2020, IEEE Internet of Things Journal.

[13]  Yuguang Fang,et al.  Augmenting Transmission Environments for Better Communications: Tunable Reflector Assisted MmWave WLANs , 2020, IEEE Transactions on Vehicular Technology.

[14]  Mohamed-Slim Alouini,et al.  Smart Radio Environments Empowered by Reconfigurable Intelligent Surfaces: How it Works, State of Research, and Road Ahead , 2020, ArXiv.

[15]  Jun Zhao,et al.  Intelligent Reflecting Surface Assisted Anti-Jamming Communications: A Fast Reinforcement Learning Approach , 2020, IEEE Transactions on Wireless Communications.

[16]  Liang Xiao,et al.  Deep Reinforcement Learning-Based Intelligent Reflecting Surface for Secure Wireless Communications , 2020, IEEE Transactions on Wireless Communications.

[17]  Kezhi Wang,et al.  Artificial-Noise-Aided Secure MIMO Wireless Communications via Intelligent Reflecting Surface , 2020, IEEE Transactions on Communications.

[18]  Pei Xiao,et al.  Intelligent Reflecting Surface Aided Multi-Antenna Secure Transmission , 2020, IEEE Wireless Communications Letters.

[19]  Linglong Dai,et al.  Two-Timescale Channel Estimation for Reconfigurable Intelligent Surface Aided Wireless Communications , 2019, IEEE Transactions on Communications.

[20]  Derrick Wing Kwan Ng,et al.  Robust and Secure Wireless Communications via Intelligent Reflecting Surfaces , 2019, IEEE Journal on Selected Areas in Communications.

[21]  Rui Zhang,et al.  Intelligent Reflecting Surface-Enhanced OFDM: Channel Estimation and Reflection Optimization , 2019, IEEE Wireless Communications Letters.

[22]  Wenfeng Zheng,et al.  Twin-Delayed DDPG: A Deep Reinforcement Learning Technique to Model a Continuous Movement of an Intelligent Robot Agent , 2019, ICVISP.

[23]  Mohamed-Slim Alouini,et al.  Wireless Communications Through Reconfigurable Intelligent Surfaces , 2019, IEEE Access.

[24]  Rui Zhang,et al.  Secure Wireless Communication via Intelligent Reflecting Surface , 2019, IEEE Wireless Communications Letters.

[25]  Wei Xu,et al.  Secrecy Rate Maximization for Intelligent Reflecting Surface Assisted Multi-Antenna Communications , 2019, IEEE Communications Letters.

[26]  Qiang Ni,et al.  Interference-Aware Radio Resource Allocation for 5G Ultra-Reliable Low-Latency Communication , 2018, 2018 IEEE Globecom Workshops (GC Wkshps).

[27]  Qingqing Wu,et al.  Intelligent Reflecting Surface Enhanced Wireless Network via Joint Active and Passive Beamforming , 2018, IEEE Transactions on Wireless Communications.

[28]  Herke van Hoof,et al.  Addressing Function Approximation Error in Actor-Critic Methods , 2018, ICML.

[29]  Chunlin Chen,et al.  A novel DDPG method with prioritized experience replay , 2017, 2017 IEEE International Conference on Systems, Man, and Cybernetics (SMC).

[30]  Ming Li,et al.  Jamming Resilient Communication Using MIMO Interference Cancellation , 2016, IEEE Transactions on Information Forensics and Security.

[31]  Yuval Tassa,et al.  Continuous control with deep reinforcement learning , 2015, ICLR.

[32]  R. Michael Buehrer,et al.  Optimal Jamming Against Digital Modulation , 2015, IEEE Transactions on Information Forensics and Security.

[33]  Guy Lever,et al.  Deterministic Policy Gradient Algorithms , 2014, ICML.

[34]  Xin Wang,et al.  Optimal chunk-based resource allocation for OFDMA systems with multiple BER requirements , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.

[35]  Hiroyo Ogawa,et al.  Implementation of an OFDM baseband with adaptive modulations to grouped subcarriers for millimeter-wave wireless indoor networks , 2011, IEEE Transactions on Consumer Electronics.

[36]  S. Aissa,et al.  Adaptive modulation using orthogonal STBC in MIMO Nakagami fading channels , 2004, Eighth IEEE International Symposium on Spread Spectrum Techniques and Applications - Programme and Book of Abstracts (IEEE Cat. No.04TH8738).

[37]  D. Lenz,et al.  Singular Spectrum of Lebesgue Measure Zero¶for One-Dimensional Quasicrystals , 2001, math-ph/0106012.

[38]  Yishay Mansour,et al.  Policy Gradient Methods for Reinforcement Learning with Function Approximation , 1999, NIPS.

[39]  H. Viswanathan,et al.  Capacity of Markov channels with receiver CSI and delayed feedback , 1998, Proceedings. 1998 IEEE International Symposium on Information Theory (Cat. No.98CH36252).

[40]  Andrea J. Goldsmith,et al.  Adaptive coded modulation for fading channels , 1997, Proceedings of ICC'97 - International Conference on Communications.

[41]  Jiancun Fan,et al.  Robust Secure Beamforming for Intelligent Reflecting Surface Assisted Full-Duplex MISO Systems , 2022, IEEE Transactions on Information Forensics and Security.

[42]  Qiang Li,et al.  Distributionally Robust Secure Multicast Beamforming With Intelligent Reflecting Surface , 2021, IEEE Transactions on Information Forensics and Security.

[43]  R. Bass,et al.  Review: P. Billingsley, Convergence of probability measures , 1971 .