论文信息 - Reinforcement Learning for Mixed Cooperative/Competitive Dynamic Spectrum Access

Reinforcement Learning for Mixed Cooperative/Competitive Dynamic Spectrum Access

A dynamic spectrum sharing problem with a mixed collaborative/competitive objective and partial information about peers’ performances that arises from the DARPA Spectrum Collaboration Challenge is considered. Because of the very high complexity of the problem and the enormous size of the state space, it is broken down into the subproblems of channel selection, flow admission control, and transmission schedule assignment. The channel selection problem is the focus of this paper. A reinforcement learning algorithm based on a reduced state is developed to select channels, and a neural network is used as a function approximator to fill in missing values in the resulting input-action matrix. The performance is compared with that obtained by a hand-tuned expert system.

[1] H. Vincent Poor,et al. Spectrum Exploration and Exploitation for Cognitive Radio: Recent Advances , 2015, IEEE Signal Processing Magazine.

[2] Richard S. Sutton,et al. Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[3] Sudharman K. Jayaweera,et al. A Survey on Machine-Learning Techniques in Cognitive Radios , 2013, IEEE Communications Surveys & Tutorials.

[4] Pin Wan,et al. A survey of dynamic spectrum allocation based on reinforcement learning algorithms in cognitive radio networks , 2018, Artif. Intell. Rev..