Multi-Agent Reinforcement Learning Based Cognitive Anti-Jamming

This paper proposes a reinforcement learning based approach to anti-jamming communications with wideband autonomous cognitive radios (WACRs) in a multi-agent environment. Assumed system model allows multiple WACRs to simultaneously operate over the same (wide) spectrum band. Each radio attempts to evade the transmissions of other WACRs as well as avoiding a jammer signal that sweeps across the whole spectrum band of interest. The WACR makes use of its spectrum knowledge acquisition ability to detect and identify the location (in frequency) of this sweeping jammer and the signals of other WACRs. This information and reinforcement learning is used to successfully learn a sub-band selection policy to avoid both the jammer signal as well as interference from other radios. It is shown, through simulations, that the proposed learning-based sub-band selection policy has low computational complexity and significantly outperforms the random sub-band selection policy.

[1]  Sudharman K. Jayaweera,et al.  A Survey on Machine-Learning Techniques in Cognitive Radios , 2013, IEEE Communications Surveys & Tutorials.

[2]  Sudharman K. Jayaweera,et al.  Replicated Q-learning based sub-band selection for wideband spectrum sensing in cognitive radios , 2016, 2016 IEEE/CIC International Conference on Communications in China (ICCC).

[3]  K. J. Ray Liu,et al.  An anti-jamming stochastic game for cognitive radio networks , 2011, IEEE Journal on Selected Areas in Communications.

[4]  Fangli Xu,et al.  Overview of 3GPP LTE-advanced carrier aggregation for 4G wireless communications , 2012, IEEE Communications Magazine.

[5]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[6]  Roberto Di Pietro,et al.  Jamming mitigation in cognitive radio networks , 2013, IEEE Network.

[7]  Sudharman K. Jayaweera,et al.  Reinforcement learning based anti-jamming with wideband autonomous cognitive radios , 2016, 2016 IEEE/CIC International Conference on Communications in China (ICCC).

[8]  Aditya Trivedi,et al.  Anti-jamming in cognitive radio networks using reinforcement learning algorithms , 2012, 2012 Ninth International Conference on Wireless and Optical Communications Networks (WOCN).

[9]  Husheng Li,et al.  Multi-agent Q-learning of channel selection in multi-user cognitive radio systems: A two by two case , 2009, 2009 IEEE International Conference on Systems, Man and Cybernetics.

[10]  Husheng Li,et al.  Multi-Agent Q-Learning for Competitive Spectrum Access in Cognitive Radio Systems , 2010, 2010 Fifth IEEE Workshop on Networking Technologies for Software Defined Radio Networks (SDR).

[11]  Sudharman K. Jayaweera,et al.  Learning-Aided Sub-Band Selection Algorithms for Spectrum Sensing in Wide-Band Cognitive Radios , 2014, IEEE Transactions on Wireless Communications.

[12]  Ben Y. Zhao,et al.  Multi-channel Jamming Attacks using Cognitive Radios , 2007, 2007 16th International Conference on Computer Communications and Networks.

[13]  Joseph Mitola,et al.  Cognitive radio: making software radios more personal , 1999, IEEE Wirel. Commun..

[14]  Kok-Lim Alvin Yau,et al.  Applications of Reinforcement Learning to Cognitive Radio Networks , 2010, 2010 IEEE International Conference on Communications Workshops.

[15]  Ian F. Akyildiz,et al.  Multiagent jamming-resilient control channel game for cognitive radio ad hoc networks , 2012, 2012 IEEE International Conference on Communications (ICC).