Reinforcement Learning for Routing in Cognitive Radio Ad Hoc Networks

Cognitive radio (CR) enables unlicensed users (or secondary users, SUs) to sense for and exploit underutilized licensed spectrum owned by the licensed users (or primary users, PUs). Reinforcement learning (RL) is an artificial intelligence approach that enables a node to observe, learn, and make appropriate decisions on action selection in order to maximize network performance. Routing enables a source node to search for a least-cost route to its destination node. While there have been increasing efforts to enhance the traditional RL approach for routing in wireless networks, this research area remains largely unexplored in the domain of routing in CR networks. This paper applies RL in routing and investigates the effects of various features of RL (i.e., reward function, exploitation, and exploration, as well as learning rate) through simulation. New approaches and recommendations are proposed to enhance the features in order to improve the network performance brought about by RL to routing. Simulation results show that the RL parameters of the reward function, exploitation, and exploration, as well as learning rate, must be well regulated, and the new approaches proposed in this paper improves SUs' network performance without significantly jeopardizing PUs' network performance, specifically SUs' interference to PUs.

[1]  Jianli Zhao,et al.  An Improved Centralized Cognitive Radio Network Spectrum Allocation Algorithm Based on the Allocation Sequence , 2013, Int. J. Distributed Sens. Networks.

[2]  M. E. Muller,et al.  A Note on the Generation of Random Normal Deviates , 1958 .

[3]  Michael L. Littman,et al.  Packet Routing in Dynamically Changing Networks: A Reinforcement Learning Approach , 1993, NIPS.

[4]  Kok-Lim Alvin Yau,et al.  Routing in Distributed Cognitive Radio Networks: A Survey , 2012, Wireless Personal Communications.

[5]  F. Richard Yu,et al.  Prediction-Based Topology Control and Routing in Cognitive Radio Mobile Ad Hoc Networks , 2010, 2010 INFOCOM IEEE Conference on Computer Communications Workshops.

[6]  Manuela M. Veloso,et al.  Multiagent learning using a variable learning rate , 2002, Artif. Intell..

[7]  Kok-Lim Alvin Yau,et al.  Application of reinforcement learning to routing in distributed wireless networks: a review , 2013, Artificial Intelligence Review.

[8]  Mitesh P. Patel,et al.  Tuning of reinforcement learning parameters applied to OLSR using a cognitive network design tool , 2012, 2012 IEEE Wireless Communications and Networking Conference (WCNC).

[9]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[10]  Quanyan Zhu,et al.  Dynamic Interference Minimization Routing Game for On-Demand Cognitive Pilot Channel , 2010, 2010 IEEE Global Telecommunications Conference GLOBECOM 2010.

[11]  Farbod Razzazi,et al.  Power and Time Slot Allocation in Cognitive Relay Networks Using Particle Swarm Optimization , 2013, TheScientificWorldJournal.

[12]  Wei Zhang,et al.  A geometric approach to improve spectrum efficiency for cognitive relay networks , 2010, IEEE Transactions on Wireless Communications.

[13]  Dragan Jevtic,et al.  Reinforcement learning as adaptive network routing of mobile agents , 2010, The 33rd International Convention MIPRO.

[14]  Ying Zhang,et al.  Constrained flooding: a robust and efficient routing framework for wireless sensor networks , 2006, 20th International Conference on Advanced Information Networking and Applications - Volume 1 (AINA'06).

[15]  Wahidah Hashim,et al.  A reinforcement learning-based routing scheme for cognitive radio ad hoc networks , 2014, 2014 7th IFIP Wireless and Mobile Networking Conference (WMNC).

[16]  Yang Yang,et al.  Reinforcement learning based spectrum-aware routing in multi-hop cognitive radio networks , 2009, 2009 4th International Conference on Cognitive Radio Oriented Wireless Networks and Communications.

[17]  Ian F. Akyildiz,et al.  CRP: A Routing Protocol for Cognitive Radio Ad Hoc Networks , 2011, IEEE Journal on Selected Areas in Communications.

[18]  Petteri Nurmi,et al.  Reinforcement Learning for Routing in Ad Hoc Networks , 2007, 2007 5th International Symposium on Modeling and Optimization in Mobile, Ad Hoc and Wireless Networks and Workshops.

[19]  A. Forstert,et al.  FROMS: Feedback Routing for Optimizing Multiple Sinks in WSN with Reinforcement Learning , 2007, 2007 3rd International Conference on Intelligent Sensors, Sensor Networks and Information.

[20]  Jim Dowling,et al.  Using feedback in collaborative reinforcement learning to adaptively optimize MANET routing , 2005, IEEE Transactions on Systems, Man, and Cybernetics - Part A: Systems and Humans.