Decentralized Reinforcement Learning Based Anti-Jamming Communication for Self-Organizing Networks

This paper investigates the problem of decentralized spectrum sharing in self-organizing networks against a dynamic and unknown jamming environment using reinforcement learning. In the network, the anti-jamming spectrum sharing has to not only coordinate spectrum access of users, but also combat the malicious jamming. However, most existing anti-jamming approaches are centralized and require information exchange, which are not suitable for decentralized self-organizing networks in the jamming environment. We formulate the multiuser anti-jamming channel selection problem as a Markov game, and propose a decentralized deep reinforcement learning based collaborative anti-jamming algorithm to achieve the equilibrium solution. It is shown in the simulation part that without information exchange, the approach enables multiple users to independently explore the spectrum environment and obtain effective (close to optimal) collaborative anti-jamming strategies against unknown and dynamic jamming.

[1]  Wei Li,et al.  On the Secrecy Capacity of 5G MmWave Small Cell Networks , 2018, IEEE Wireless Communications.

[2]  Kavosh Asadi,et al.  DeepMellow: Removing the Need for a Target Network in Deep Q-Learning , 2019, IJCAI.

[3]  Georges Kaddoum,et al.  A Survey on Intelligent MAC Layer Jamming Attacks and Countermeasures in WSNs , 2016, 2016 IEEE 84th Vehicular Technology Conference (VTC-Fall).

[4]  Charalampos Konstantopoulos,et al.  A survey on jamming attacks and countermeasures in WSNs , 2009, IEEE Communications Surveys & Tutorials.

[5]  Alagan Anpalagan,et al.  Self-Organizing Relay Selection in UAV Communication Networks: A Matching Game Perspective , 2018, IEEE Wireless Communications.

[6]  Alagan Anpalagan,et al.  Anti-Jamming Communications Using Spectrum Waterfall: A Deep Reinforcement Learning Approach , 2017, IEEE Communications Letters.

[7]  Ximing Wang,et al.  Mean Field Reinforcement Learning Based Anti-Jamming Communications for Ultra-Dense Internet of Things in 6G , 2020, 2020 International Conference on Wireless Communications and Signal Processing (WCSP).

[8]  Alagan Anpalagan,et al.  Dynamic Spectrum Anti-Jamming in Broadband Communications: A Hierarchical Deep Reinforcement Learning Approach , 2020, IEEE Wireless Communications Letters.

[9]  Srikanth V. Krishnamurthy,et al.  Denial of Service Attacks in Wireless Networks: The Case of Jammers , 2011, IEEE Communications Surveys & Tutorials.

[10]  Sneha A. Dalvi,et al.  Internet of Things for Smart Cities , 2017 .

[11]  Alagan Anpalagan,et al.  Opportunistic UAV Utilization in Wireless Networks: Motivations, Applications, and Challenges , 2020, IEEE Communications Magazine.

[12]  Shane Legg,et al.  Human-level control through deep reinforcement learning , 2015, Nature.

[13]  Peter Dayan,et al.  Q-learning , 1992, Machine Learning.

[14]  Luliang Jia,et al.  A Collaborative Multi-Agent Reinforcement Learning Anti-Jamming Algorithm in Wireless Networks , 2018, IEEE Wireless Communications Letters.

[15]  Bart De Schutter,et al.  A Comprehensive Survey of Multiagent Reinforcement Learning , 2008, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[16]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[17]  Maria Rita Palattella,et al.  Internet of Things in the 5G Era: Enablers, Architecture, and Business Models , 2016, IEEE Journal on Selected Areas in Communications.

[18]  Muhammad Ali Imran,et al.  A Survey of Self Organisation in Future Cellular Networks , 2013, IEEE Communications Surveys & Tutorials.

[19]  Xin Liu,et al.  Dynamic Spectrum Anti-Jamming Communications: Challenges and Opportunities , 2020, IEEE Communications Magazine.

[20]  Wu He,et al.  Internet of Things in Industries: A Survey , 2014, IEEE Transactions on Industrial Informatics.