论文信息 - Intelligent spectrum management based on reinforcement learning schemes in cooperative cognitive radio networks

Intelligent spectrum management based on reinforcement learning schemes in cooperative cognitive radio networks

Abstract Cognitive Radio (CR) and Cooperative Communication provide key technologies for efficient utilization of available unused spectrum bands (called resources) to achieve a spectral efficient system with high throughput. But to achieve its full potential, it is essential to empower the brain of CR that is Cognitive Engine (CE), using machine learning algorithms to control the operation and adapt parameters according to the dynamic environment. However, in practical scenarios, it is difficult to formulate network model beforehand due to complex network dynamics. To address this issue, the most favorable machine learning scheme, Reinforcement Learning (RL) based schemes are proposed to empower CE without forming an explicit network model. The proposed schemes, Comparison based Cooperative Q-Learning (CCopQL) and Comparison based Cooperative State-Action-Reward-(next) State-(next) Action (CCopSARSA) for resource allocation, allows each CR to learn cooperatively. The cooperation among CRs is in the form of comparing and then exchanging Q-values to obtain an optimal policy. Though these schemes involve information exchange among CRs as compared to independent Q-Leaning and SARSA but it provides improved system performance with high system throughput. Numerical results reveal the significant benefits from exploiting the cooperative feature with RL, both proposed schemes outperform other existing schemes in terms of system throughput and expedite the convergence than individual CR learning with CCopSARSA and CCopQL respectively.

Amandeep Kaur | Krishan Kumar

[1] Li Wang,et al. Learning Radio Resource Management in RANs: Framework, Opportunities, and Challenges , 2018, IEEE Communications Magazine.

[2] Krishan Kumar,et al. Imperfect CSI Based Intelligent Dynamic Spectrum Management Using Cooperative Reinforcement Learning Framework in Cognitive Radio Networks , 2022, IEEE Transactions on Mobile Computing.

[3] Amandeep Kaur,et al. Energy-Efficient Resource Allocation in Cognitive Radio Networks Under Cooperative Multi-Agent Model-Free Reinforcement Learning Schemes , 2020, IEEE Transactions on Network and Service Management.

[4] Elias Yaacoub,et al. Throughput-Aware Cooperative Reinforcement Learning for Adaptive Resource Allocation in Device-to-Device Communication , 2017, Future Internet.

[5] Ibrahim Dogan,et al. Reinforcement learning approaches for specifying ordering policies of perishable inventory systems , 2018, Expert Syst. Appl..

[6] Sudharman K. Jayaweera,et al. A Survey on Machine-Learning Techniques in Cognitive Radios , 2013, IEEE Communications Surveys & Tutorials.

[7] Yoshikazu Miyanaga,et al. Dynamic Resource Allocation with Integrated Reinforcement Learning for a D2D-Enabled LTE-A Network with Access to Unlicensed Band , 2016, Mob. Inf. Syst..

[8] S. Haykin,et al. A Q-learning-based dynamic channel assignment technique for mobile communication systems , 1999 .

[9] Sang-Jo Yoo,et al. Q-learning-based dynamic joint control of interference and transmission opportunities for cognitive radio , 2018, EURASIP J. Wirel. Commun. Netw..

[10] Farhad Khozeimeh,et al. Brain-Inspired Dynamic Spectrum Management for Cognitive Radio Ad Hoc Networks , 2012, IEEE Transactions on Wireless Communications.

[11] Csaba Szepesvári,et al. A Unified Analysis of Value-Function-Based Reinforcement-Learning Algorithms , 1999, Neural Computation.