Intelligent spectrum management based on reinforcement learning schemes in cooperative cognitive radio networks

Abstract Cognitive Radio (CR) and Cooperative Communication provide key technologies for efficient utilization of available unused spectrum bands (called resources) to achieve a spectral efficient system with high throughput. But to achieve its full potential, it is essential to empower the brain of CR that is Cognitive Engine (CE), using machine learning algorithms to control the operation and adapt parameters according to the dynamic environment. However, in practical scenarios, it is difficult to formulate network model beforehand due to complex network dynamics. To address this issue, the most favorable machine learning scheme, Reinforcement Learning (RL) based schemes are proposed to empower CE without forming an explicit network model. The proposed schemes, Comparison based Cooperative Q-Learning (CCopQL) and Comparison based Cooperative State-Action-Reward-(next) State-(next) Action (CCopSARSA) for resource allocation, allows each CR to learn cooperatively. The cooperation among CRs is in the form of comparing and then exchanging Q-values to obtain an optimal policy. Though these schemes involve information exchange among CRs as compared to independent Q-Leaning and SARSA but it provides improved system performance with high system throughput. Numerical results reveal the significant benefits from exploiting the cooperative feature with RL, both proposed schemes outperform other existing schemes in terms of system throughput and expedite the convergence than individual CR learning with CCopSARSA and CCopQL respectively.

[1]  Li Wang,et al.  Learning Radio Resource Management in RANs: Framework, Opportunities, and Challenges , 2018, IEEE Communications Magazine.

[2]  Krishan Kumar,et al.  Imperfect CSI Based Intelligent Dynamic Spectrum Management Using Cooperative Reinforcement Learning Framework in Cognitive Radio Networks , 2022, IEEE Transactions on Mobile Computing.

[3]  Amandeep Kaur,et al.  Energy-Efficient Resource Allocation in Cognitive Radio Networks Under Cooperative Multi-Agent Model-Free Reinforcement Learning Schemes , 2020, IEEE Transactions on Network and Service Management.

[4]  Elias Yaacoub,et al.  Throughput-Aware Cooperative Reinforcement Learning for Adaptive Resource Allocation in Device-to-Device Communication , 2017, Future Internet.

[5]  Ibrahim Dogan,et al.  Reinforcement learning approaches for specifying ordering policies of perishable inventory systems , 2018, Expert Syst. Appl..

[6]  Sudharman K. Jayaweera,et al.  A Survey on Machine-Learning Techniques in Cognitive Radios , 2013, IEEE Communications Surveys & Tutorials.

[7]  Yoshikazu Miyanaga,et al.  Dynamic Resource Allocation with Integrated Reinforcement Learning for a D2D-Enabled LTE-A Network with Access to Unlicensed Band , 2016, Mob. Inf. Syst..

[8]  S. Haykin,et al.  A Q-learning-based dynamic channel assignment technique for mobile communication systems , 1999 .

[9]  Sang-Jo Yoo,et al.  Q-learning-based dynamic joint control of interference and transmission opportunities for cognitive radio , 2018, EURASIP J. Wirel. Commun. Netw..

[10]  Farhad Khozeimeh,et al.  Brain-Inspired Dynamic Spectrum Management for Cognitive Radio Ad Hoc Networks , 2012, IEEE Transactions on Wireless Communications.

[11]  Csaba Szepesvári,et al.  A Unified Analysis of Value-Function-Based Reinforcement-Learning Algorithms , 1999, Neural Computation.

[12]  Basem Shihada,et al.  Energy-Efficient Power Allocation in Multitier 5G Networks Using Enhanced Online Learning , 2017, IEEE Transactions on Vehicular Technology.

[13]  Tim Clarke,et al.  Distributed Heuristically Accelerated Q-Learning for Robust Cognitive Spectrum Management in LTE Cellular Systems , 2016, IEEE Transactions on Mobile Computing.

[14]  Ana Galindo-Serrano,et al.  Distributed Q-Learning for Aggregated Interference Control in Cognitive Radio Networks , 2010, IEEE Transactions on Vehicular Technology.

[15]  Tommi S. Jaakkola,et al.  Convergence Results for Single-Step On-Policy Reinforcement-Learning Algorithms , 2000, Machine Learning.

[16]  Arun Prakash,et al.  Spectrum handoff in cognitive radio networks: A classification and comprehensive survey , 2016, J. Netw. Comput. Appl..

[17]  Arumugam Nallanathan,et al.  Multi-Agent Reinforcement Learning-Based Resource Allocation for UAV Networks , 2018, IEEE Transactions on Wireless Communications.

[18]  Basem Shihada,et al.  Sophisticated Online Learning Scheme for Green Resource Allocation in 5G Heterogeneous Cloud Radio Access Networks , 2018, IEEE Transactions on Mobile Computing.

[19]  Xianfu Chen,et al.  Stochastic Power Adaptation with Multiagent Reinforcement Learning for Cognitive Wireless Mesh Networks , 2013, IEEE Transactions on Mobile Computing.

[20]  Basem Shihada,et al.  Energy Efficient Traffic Offloading in Multi-Tier Heterogeneous 5G Networks Using Intuitive Online Reinforcement Learning , 2019, IEEE Transactions on Green Communications and Networking.

[21]  Fei Hu,et al.  Intelligent Spectrum Management Based on Transfer Actor-Critic Learning for Rateless Transmissions in Cognitive Radio Networks , 2018, IEEE Transactions on Mobile Computing.