Multi-task Deep Reinforcement Learning for Cognitive Spectrum-agile Communications

This paper introduces a cognitive engine design to achieve spectrum-agile communications over a heterogeneous wideband spectrum. The proposed cognitive approach has the ability to learn and avoid interference signals and other harmful signals. The targeted spectrum in this work is much wider than the ones proposed in the literature, most likely covering several hundreds of MHz. The proposed approach is based on deep reinforcement learning (DRL), more specifically on a double deep Q-network (DDQN) made of a convolutional neural network (CNN). The wideband spectrum is divided into a number of sub-bands and each sub-band consists of a number of channels. The problem is modeled as a multi-task DRL, where each sub-band represents a single task. Transfer learning is used between tasks to speed up the learning process. It is shown, through simulations, that the proposed technique can efficiently learn an effective strategy to avoid harmful signals in a noncontiguous wideband spectrum. Furthermore, it outperforms other DRL-based approaches in the literature while operating in a much wider spectrum and maintaining low computational complexity.

[1]  Joseph Mitola,et al.  Cognitive radio: making software radios more personal , 1999, IEEE Wirel. Commun..

[2]  Sudharman K. Jayaweera,et al.  Learning-Aided Sub-Band Selection Algorithms for Spectrum Sensing in Wide-Band Cognitive Radios , 2014, IEEE Transactions on Wireless Communications.

[3]  Behrouz Farhang-Boroujeny,et al.  Filter Bank Spectrum Sensing for Cognitive Radios , 2008, IEEE Transactions on Signal Processing.

[4]  H. Vincent Poor,et al.  Two-dimensional anti-jamming communication based on deep reinforcement learning , 2017, 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[5]  Sudharman K. Jayaweera,et al.  Spectrum-Agile Cognitive Interference Avoidance Through Deep Reinforcement Learning , 2019, CrownCom.

[6]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[7]  Liang Xiao,et al.  Anti-Jamming Underwater Transmission With Mobility and Learning , 2018, IEEE Communications Letters.

[8]  Qiang Yang,et al.  A Survey on Transfer Learning , 2010, IEEE Transactions on Knowledge and Data Engineering.

[9]  Sudharman K. Jayaweera,et al.  Replicated Q-learning based sub-band selection for wideband spectrum sensing in cognitive radios , 2016, 2016 IEEE/CIC International Conference on Communications in China (ICCC).

[10]  H. T. Kung,et al.  Competing Cognitive Resilient Networks , 2016, IEEE Transactions on Cognitive Communications and Networking.

[11]  Cheng-Xiang Wang,et al.  Wideband spectrum sensing for cognitive radio networks: a survey , 2013, IEEE Wireless Communications.

[12]  Georgios B. Giannakis,et al.  Compressed Sensing for Wideband Cognitive Radios , 2007, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07.

[13]  Sudharman K. Jayaweera,et al.  Multi-Agent Reinforcement Learning Based Cognitive Anti-Jamming , 2017, 2017 IEEE Wireless Communications and Networking Conference (WCNC).

[14]  Martine Villegas,et al.  Survey on spectrum utilization in Europe: Measurements, analyses and observations , 2010, 2010 Proceedings of the Fifth International Conference on Cognitive Radio Oriented Wireless Networks and Communications.

[15]  Alagan Anpalagan,et al.  Anti-Jamming Communications Using Spectrum Waterfall: A Deep Reinforcement Learning Approach , 2017, IEEE Communications Letters.

[16]  Shane Legg,et al.  Human-level control through deep reinforcement learning , 2015, Nature.