Routing based on layered stochastic games for multi-hop cognitive radio networks

This paper proposes a distributed routing mechanism for the multi-hop Cognitive Radio Networks (CRNs) over multiple primary channels. The Secondary Users (SUs) attempts to utilize the channels and minimize their delay along the route while avoiding causing interference to the Primary Users (PUs). In order to address the problem of time-varying channel condition due to the PU dynamics, the route-selection process is modeled as a global Markov Decision Process (MDP). We show that such a global routing MDP can be decomposed into the layered MDPs, in which the interactions between neighbor SUs with their local next-hop selection are modeled as the local stochastic games. By applying reinforcement learning with utility based fictitious play, the best response of each SU can be learned from the local game with the only need for the information exchange from next-hop SUs. The proposed algorithm is evaluated through simulations and is shown to be effective in reducing the delays for multiple flows in the CRN.

[1]  Xuemin Shen,et al.  Spectrum-Aware Opportunistic Routing in Multi-Hop Cognitive Radio Networks , 2012, IEEE Journal on Selected Areas in Communications.

[2]  Quanyan Zhu,et al.  Interference Aware Routing Game for Cognitive Radio Multi-Hop Networks , 2012, IEEE Journal on Selected Areas in Communications.

[3]  Honggang Zhang,et al.  Topology Management in CogMesh: A Cluster-Based Cognitive Radio Mesh Network , 2007, 2007 IEEE International Conference on Communications.

[4]  Wei Yuan,et al.  Local Coordination Based Routing and Spectrum Assignment in Multi-hop Cognitive Radio Networks , 2008, Mob. Networks Appl..

[5]  Byung-Jae Kwak,et al.  Performance analysis of exponential backoff , 2005, IEEE/ACM Transactions on Networking.

[6]  Jian Tang,et al.  QoS Routing in Wireless Mesh Networks with Cognitive Radios , 2008, IEEE GLOBECOM 2008 - 2008 IEEE Global Telecommunications Conference.

[7]  Brian M. Sadler,et al.  Cognitive Medium Access: Constraining Interference Based on Experimental Models , 2008, IEEE Journal on Selected Areas in Communications.

[8]  Xianfu Chen,et al.  Stochastic Power Adaptation with Multiagent Reinforcement Learning for Cognitive Wireless Mesh Networks , 2013, IEEE Transactions on Mobile Computing.

[9]  Michael P. Wellman,et al.  Nash Q-Learning for General-Sum Stochastic Games , 2003, J. Mach. Learn. Res..

[10]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[11]  Francesca Cuomo,et al.  Routing in cognitive radio networks: Challenges and solutions , 2011, Ad Hoc Networks.