Joint Channel Allocation and Power Control Based on Long Short-Term Memory Deep Q Network in Cognitive Radio Networks

Efficient spectrum resource management in cognitive radio networks (CRNs) is a promising method that improves the utilization of spectrum resource. In particular, the power control and channel allocation are of top priorities in spectrum resource management. Nevertheless, the joint design of power control and channel allocation is an NP-hard problem and the research is still in the preliminary stage. In this paper, we propose a novel joint approach based on long short-term memory deep Q network (LSTM-DQN). Our objective is to obtain the channel allocation schemes of the access points (APs) and the power control strategies of the secondary users (SUs). Specifically, the received signal strength information (RSSI) collected by the microbase stations is used as the input of LSTM-DQN. In this way, the collection of RSSI can be shared between users. After the training is completed, the APs are capable of selecting channels with small interference while the SUs may access the authorized channels in an underlay operation mode without knowing any knowledge about the primary users (PUs). Experimental results show that the channels are allocated to the APs with a lower probability of collision. Moreover, the SUs can adjust their power control strategies quickly to avoid the harmful interference to the PUs when the environment parameters change randomly. Consequently, the overall performance of CRNs and the utilization of spectrum resources are improved significantly compared to existing popular solutions.

[1]  H. Vincent Poor,et al.  Reinforcement learning based distributed multiagent sensing policy for cognitive radio networks , 2011, 2011 IEEE International Symposium on Dynamic Spectrum Access Networks (DySPAN).

[2]  Roy D. Yates,et al.  Constrained power control , 1994, Wirel. Pers. Commun..

[3]  Arumugam Nallanathan,et al.  On the Throughput and Spectrum Sensing Enhancement of Opportunistic Spectrum Access Cognitive Radio Networks , 2012, IEEE Transactions on Wireless Communications.

[4]  Zhi Chen,et al.  Intelligent Power Control for Spectrum Sharing in Cognitive Radios: A Deep Reinforcement Learning Approach , 2017, IEEE Access.

[5]  Kyriakos G. Vamvoudakis,et al.  Safe reinforcement learning for dynamical games , 2020, International Journal of Robust and Nonlinear Control.

[6]  Yongwei Zhang,et al.  A cooperative spectrum sensing method based on signal decomposition and K-medoids algorithm , 2019 .

[7]  Wei Cheng,et al.  Spectrum prediction in cognitive radio networks , 2013, IEEE Wireless Communications.

[8]  Peter Stone,et al.  Deep Recurrent Q-Learning for Partially Observable MDPs , 2015, AAAI Fall Symposia.

[9]  Kobi Cohen,et al.  Deep Multi-User Reinforcement Learning for Distributed Dynamic Spectrum Access , 2017, IEEE Transactions on Wireless Communications.

[10]  James R. Zeidler,et al.  Opportunistic Spectrum Access for Cognitive Radio Networks withMultiple Secondary Users , 2013, IEEE Transactions on Wireless Communications.

[11]  Syed Ali Jafar,et al.  COGNITIVE RADIOS FOR DYNAMIC SPECTRUM ACCESS - The Throughput Potential of Cognitive Radio: A Theoretical Perspective , 2007, IEEE Communications Magazine.

[12]  Peter Stone,et al.  Reinforcement learning , 2019, Scholarpedia.

[13]  Xin-Ping Guan,et al.  Nonconvex dynamic spectrum allocation for cognitive radio networks via particle swarm optimization and simulated annealing , 2012, Comput. Networks.

[14]  Shane Legg,et al.  Human-level control through deep reinforcement learning , 2015, Nature.

[15]  John M. Chapin,et al.  COGNITIVE RADIOS FOR DYNAMIC SPECTRUM ACCESS - The Path to Market Success for Dynamic Spectrum Access Technology , 2007, IEEE Communications Magazine.

[16]  Pin Wan,et al.  A survey of dynamic spectrum allocation based on reinforcement learning algorithms in cognitive radio networks , 2018, Artif. Intell. Rev..

[17]  A. Wolisz,et al.  Primary Users in Cellular Networks: A Large-Scale Measurement Study , 2008, 2008 3rd IEEE Symposium on New Frontiers in Dynamic Spectrum Access Networks.

[18]  Ángel G. Andrade,et al.  Comparing particle swarm optimization variants for a cognitive radio network , 2013, Appl. Soft Comput..

[19]  Zhengtao Ding,et al.  Reinforcement learning and adaptive optimization of a class of Markov jump systems with completely unknown dynamic information , 2019, Neural Comput. Appl..

[20]  Mohsen Nader Tehrani,et al.  Auction Based Spectrum Trading for Cognitive Radio Networks , 2013, IEEE Communications Letters.

[21]  Zhu Han,et al.  Dynamic spectrum access in IEEE 802.22- based cognitive wireless networks: a game theoretic model for competitive spectrum bidding and pricing , 2009, IEEE Wireless Communications.

[22]  Chenglong Wang,et al.  Adaptive optimal controller design for a class of LDI-based neural network systems with input time-delays , 2020, Neurocomputing.