Multichannel spectrum access based on reinforcement learning in cognitive internet of things

Abstract With the development of Internet of Things(IoT), the demands for communication spectrum have increased rapidly, resulting in the shortage of limited spectrum resources. Cognitive IoT (CIoT) based on cognitive radio (CR) can improve the spectrum utilization by accessing the idle spectrum licensed to a primary user (PU). In this paper, a multichannel spectrum access scheme based on reinforcement learning (RL) is proposed to improve the spectrum access of CIoT, wherein the CIoT can use multiple channels for transmissions to reduce the communication interruptions. The channels are ranked in the decreasing order of their predicted idle probabilities, which can make the CIoT find enough idle channels quickly via decreasing the number of sensing operations and spectrum handoffs. The simulation results show that our proposed scheme is superior to the single-channel spectrum access scheme in terms of throughput, communication interruption, average collision probability and average spectrum switching frequency.

[1]  Zhu Han,et al.  QoE-Driven Channel Allocation and Handoff Management for Seamless Multimedia in Cognitive 5G Cellular Networks , 2017, IEEE Transactions on Vehicular Technology.

[2]  Nazanin Rahnavard,et al.  A Learning-Based QoE-Driven Spectrum Handoff Scheme for Multimedia Transmissions over Cognitive Radio Networks , 2014, IEEE Journal on Selected Areas in Communications.

[3]  Kok-Lim Alvin Yau,et al.  Route Selection for Multi-Hop Cognitive Radio Networks Using Reinforcement Learning: An Experimental Study , 2016, IEEE Access.

[4]  Vishnu Raj,et al.  Spectrum Access In Cognitive Radio Using a Two-Stage Reinforcement Learning Approach , 2017, IEEE Journal of Selected Topics in Signal Processing.

[5]  Adisorn Lertsinsrubtavee,et al.  Hybrid Spectrum Sharing through Adaptive Spectrum Handoff and Selection , 2016, IEEE Transactions on Mobile Computing.

[6]  Hidetomo Ichihashi,et al.  Simple Reinforcement Learning for Small-Memory Agent , 2011, 2011 10th International Conference on Machine Learning and Applications and Workshops.

[7]  Weidang Lu,et al.  QoS-Guarantee Resource Allocation for Multibeam Satellite Industrial Internet of Things With NOMA , 2021, IEEE Transactions on Industrial Informatics.

[8]  Petri Mähönen,et al.  Channel Selection Algorithm for Cognitive Radio Networks with Heavy-Tailed Idle Times , 2017, IEEE Transactions on Mobile Computing.

[9]  Visa Koivunen,et al.  An Order Optimal Policy for Exploiting Idle Spectrum in Cognitive Radio Networks , 2015, IEEE Transactions on Signal Processing.

[10]  Jian Yang,et al.  Enhanced Throughput of Cognitive Radio Networks by Imperfect Spectrum Prediction , 2015, IEEE Communications Letters.

[11]  Mu Zhou,et al.  Reinforcement Learning-Based Multislot Double-Threshold Spectrum Sensing With Bayesian Fusion for Industrial Big Spectrum Data , 2021, IEEE Transactions on Industrial Informatics.

[12]  Sang-Jo Yoo,et al.  Q-Learning Based Multi-Objective Clustering Algorithm for Cognitive Radio Ad Hoc Networks , 2019, IEEE Access.

[13]  Wei Shao,et al.  Reinforcement learning-based spectrum handoff scheme with measured PDR in cognitive radio networks , 2019 .

[14]  H. Vincent Poor,et al.  Multiagent Reinforcement Learning Based Spectrum Sensing Policies for Cognitive Radio Networks , 2013, IEEE Journal of Selected Topics in Signal Processing.

[15]  Xueyan Zhang,et al.  NOMA-Based Resource Allocation for Cluster-Based Cognitive Industrial Internet of Things , 2020, IEEE Transactions on Industrial Informatics.

[16]  Yonghong Zeng,et al.  Sensing-Throughput Tradeoff for Cognitive Radio Networks , 2008, IEEE Trans. Wirel. Commun..

[17]  Dianjie Lu,et al.  Interference-aware spectrum handover for cognitive radio networks , 2014, Wirel. Commun. Mob. Comput..

[18]  Weidang Lu,et al.  A Novel Multichannel Internet of Things Based on Dynamic Spectrum Sharing in 5G Communication , 2019, IEEE Internet of Things Journal.

[19]  Jie Tang,et al.  Joint Precoding Optimization for Secure SWIPT in UAV-Aided NOMA Networks , 2020, IEEE Transactions on Communications.