Can Reinforcement Learning Address Security Issues? an Investigation into a Clustering Scheme in Distributed Cognitive Radio Networks

This paper investigates the effectiveness of reinforcement learning (RL) model in clustering as an approach to achieve higher network scalability in distributed cognitive radio networks. Specifically, it analyzes the effects of RL parameters, namely the learning rate and discount factor in a volatile environment, which consists of member nodes (or secondary users) that launch attacks with various probabilities of attack. The clusterhead, which resides in an operating region (environment) that is characterized by the probability of attacks, countermeasures the malicious SUs by leveraging on a RL model. Simulation results have shown that in a volatile operating environment, the RL model with learning rate α= 1 provides the highest network scalability when the probability of attacks ranges between 0.3 and 0.7, while the discount factor γ does not play a significant role in learning in an operating environment that is volatile due to attacks.

[1]  Danny H. K. Tsang,et al.  Guest Editorial Special Section on Energy Informatics for Green Cities , 2018, IEEE Trans. Ind. Informatics.

[2]  Feten Slimeni,et al.  Cognitive Radio Jamming Mitigation using Markov Decision Process and Reinforcement Learning , 2015 .

[3]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[4]  Alagan Anpalagan,et al.  Towards the fulfillment of 5G network requirements: technologies and challenges , 2016, Telecommunication Systems.

[5]  Yasir Saleem,et al.  Clustering and Reinforcement-Learning-Based Routing for Cognitive Radio Networks , 2017, IEEE Wireless Communications.

[6]  Mee Hong Ling,et al.  Reinforcement learning-based trust and reputation model for cluster head selection in cognitive radio networks , 2014, The 9th International Conference for Internet Technology and Secured Transactions (ICITST-2014).

[7]  Ian F. Akyildiz,et al.  NeXt generation/dynamic spectrum access/cognitive radio wireless networks: A survey , 2006, Comput. Networks.

[8]  Thomas Kunz,et al.  New findings on the complexity of cluster head selection algorithms , 2011, 2011 IEEE International Symposium on a World of Wireless, Mobile and Multimedia Networks.

[9]  David Grace,et al.  Reinforcement learning‐based clustering protocols for a self‐organising cognitive radio network , 2016, Trans. Emerg. Telecommun. Technol..

[10]  Navrati Saxena,et al.  A Survey on 5G Network Technologies from Social Perspective , 2017 .

[11]  Wenjie Zhang,et al.  Cluster-Based Cooperative Spectrum Sensing Assignment Strategy for Heterogeneous Cognitive Radio Network , 2015, IEEE Transactions on Vehicular Technology.

[12]  Qiang Ni,et al.  Application of reinforcement learning for security enhancement in cognitive radio networks , 2015, Appl. Soft Comput..

[13]  Shahaboddin Shamshirband,et al.  Cooperative game theoretic approach using fuzzy Q-learning for detecting and preventing intrusions in wireless sensor networks , 2014, Eng. Appl. Artif. Intell..

[14]  Mohsen Guizani,et al.  Smart Cities: A Survey on Data Management, Security, and Enabling Technologies , 2017, IEEE Communications Surveys & Tutorials.

[15]  Shahram Sarkani,et al.  Optimizing Attack Surface and Configuration Diversity Using Multi-objective Reinforcement Learning , 2015, 2015 IEEE 14th International Conference on Machine Learning and Applications (ICMLA).

[16]  Hiren Kumar Deva Sarma,et al.  Cluster based routing in cognitive radio adhoc networks: Reconnoitering SINR and ETT impact on clustering , 2018, Comput. Commun..