Effect of reinforcement learning on routing of cognitive radio ad-hoc networks

Today's network control systems have very limited ability to adapt the changes in network. The addition of reinforcement learning (RL) based network management agents can improve Quality of Service (QoS) by reconfiguring the network layer protocol parameters in response to observed network performance conditions. This paper presents a closed-loop approach to tuning the parameters of the protocol of network layer based on current and previous network state observation for user and channel interference, specifically by modifying some parameters of Ad-Hoc On-Demand Distance Vector (AODV) routing protocol for Cognitive Radio Ad-Hoc Network (CRAHN) environment. In this work, we provide a self-contained learning method based on machine-learning techniques that have been or can be used for developing cognitive routing protocols. Generally, the developed mathematical model based on the one RL technique to handle the route decision in channel switching and user mobility situation so that the overall end-to-end delay can be minimized and the overall throughput of the network can be maximized according to the application requirement in CRAHN environment. Here is the proposed self-configuration method based on RL technique can improve the performance of the original AODV protocol, reducing protocol overhead and end-to-end delay for CRAHN while increasing the packet delivery ratio depending upon the traffic model. Simulation results are shown using NS-2 which shows the proposed model performance is much better than the previous AODV protocol.

[1]  Dimitri P. Bertsekas,et al.  Approximate Dynamic Programming , 2017, Encyclopedia of Machine Learning and Data Mining.

[2]  Sampath Rangarajan,et al.  Cross-layer optimization for streaming scalable video over fading wireless networks , 2010, IEEE Journal on Selected Areas in Communications.

[3]  Marco Conti,et al.  MobileMAN: integration and experimentation of legacy mobile multihop ad hoc networks , 2006, IEEE Communications Magazine.

[4]  Ian F. Akyildiz,et al.  NeXt generation/dynamic spectrum access/cognitive radio wireless networks: A survey , 2006, Comput. Networks.

[5]  Weihua Zhuang,et al.  Cross-layer design for resource allocation in 3G wireless networks and beyond , 2005, IEEE Communications Magazine.

[6]  M. Motani,et al.  Cross-layer design: a survey and the road ahead , 2005, IEEE Communications Magazine.

[7]  Yuguang Fang,et al.  Coolest Path: Spectrum Mobility Aware Routing Metrics in Cognitive Ad Hoc Networks , 2011, 2011 31st International Conference on Distributed Computing Systems.

[8]  Peter Dayan,et al.  Q-learning , 1992, Machine Learning.

[9]  Li Sun,et al.  Performance Comparison of Routing Protocols for Cognitive Radio Networks , 2013, 2013 IEEE 21st International Symposium on Modelling, Analysis and Simulation of Computer and Telecommunication Systems.

[10]  Amr Mohamed,et al.  Joint Routing and Resource Allocation for Delay Minimization in Cognitive Radio Based Mesh Networks , 2014, IEEE Transactions on Wireless Communications.

[11]  Michael L. Littman,et al.  Packet Routing in Dynamically Changing Networks: A Reinforcement Learning Approach , 1993, NIPS.

[12]  Abhijit Gosavi,et al.  Reinforcement Learning: A Tutorial Survey and Recent Advances , 2009, INFORMS J. Comput..

[13]  Roman Levenshteyn,et al.  Mobile services interworking for IMS and XML WebServices , 2006, IEEE Communications Magazine.

[14]  Markus Fiedler,et al.  A Management Architecture for Multimedia Communication in Cognitive Radio Networks , 2015 .

[15]  Richard E. Tuggle Cognitive multipath routing for mission critical multi-hop wireless networks , 2010, 2010 42nd Southeastern Symposium on System Theory (SSST).

[16]  Alexandru Popescu,et al.  Cognitive Radio Networks: Elements and Architectures , 2014 .

[17]  Marco Conti,et al.  MobileMAN: Design, Integration, and Experimentation of Cross-Layer Mobile Multihop Ad Hoc Networks , 2006 .

[18]  Ting Wang,et al.  Adaptive Routing for Sensor Networks using Reinforcement Learning , 2006, The Sixth IEEE International Conference on Computer and Information Technology (CIT'06).

[19]  Rafael P. Laufer,et al.  XPRESS: a cross-layer backpressure architecture for wireless multi-hop networks , 2011, MobiCom '11.

[20]  Violet R. Syrotiuk,et al.  On Timers of Routing Protocols in MANETs , 2004, ADHOC-NOW.

[21]  Riaan Wolhuter,et al.  Traffic Class Prediction and Prioritization on a Diversified IP Network Using Machine Learning , 2009, 2009 IEEE Globecom Workshops.

[22]  Brandon F. Lo A survey of common control channel design in cognitive radio networks , 2011, Phys. Commun..

[23]  Miao Ma,et al.  Joint Spectrum Sharing and Fair Routing in Cognitive Radio Networks , 2008, 2008 5th IEEE Consumer Communications and Networking Conference.

[24]  Songwu Lu,et al.  SAMER: Spectrum Aware Mesh Routing in Cognitive Radio Networks , 2008, 2008 3rd IEEE Symposium on New Frontiers in Dynamic Spectrum Access Networks.

[25]  Gerald Tesauro,et al.  Programming backgammon using self-teaching neural nets , 2002, Artif. Intell..

[26]  Zhi-Hua Zhou,et al.  Resource Allocation for Heterogeneous Cognitive Radio Networks with Imperfect Spectrum Sensing , 2013, IEEE Journal on Selected Areas in Communications.

[27]  Jim Dowling,et al.  Using feedback in collaborative reinforcement learning to adaptively optimize MANET routing , 2005, IEEE Transactions on Systems, Man, and Cybernetics - Part A: Systems and Humans.

[28]  Yiwei Thomas Hou,et al.  A Distributed Optimization Algorithm for Multi-Hop Cognitive Radio Networks , 2008, IEEE INFOCOM 2008 - The 27th Conference on Computer Communications.

[29]  Detlef D. Nauck,et al.  Machine learning based Call Admission Control approaches: A comparative study , 2010, 2010 International Conference on Network and Service Management.

[30]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[31]  Marco Conti,et al.  Cross-layering in mobile ad hoc network design , 2004, Computer.