A self-adaptive routing paradigm for wireless mesh networks based on reinforcement learning

Classical routing protocols for WMNs are typically designed to achieve specific target objectives (e.g., maximum throughput), and they offer very limited flexibility. As a consequence, more intelligent and adaptive mesh networking solutions are needed to obtain high performance in diverse network conditions. To this end, we propose a reinforcement learning-based routing framework that allows each mesh device to dynamically select at run time a routing protocol from a pre-defined set of routing options, which provides the best performance. The most salient advantages of our solution are: i) it can maximize routing performance considering different optimization goals, ii) it relies on a compact representation of the network state and it does not need any model of its evolution, and iii) it efficiently applies Q-learning methods to guarantee convergence of the routing decision process. Through extensive ns-2 simulations we show the superior performance of the proposed routing approach in comparison with two alternative routing schemes.

[1]  Robert Tappan Morris,et al.  ExOR: opportunistic multi-hop routing for wireless networks , 2005, SIGCOMM '05.

[2]  Katia Obraczka,et al.  Combining on-demand and opportunistic routing for intermittently connected networks , 2009, Ad Hoc Networks.

[3]  Robert Tappan Morris,et al.  Link-level measurements from an 802.11b mesh network , 2004, SIGCOMM '04.

[4]  Philippe Jacquet,et al.  Optimized Link State Routing Protocol (OLSR) , 2003, RFC.

[5]  Yunhao Liu,et al.  Joint Throughput Optimization for Wireless Mesh Networks , 2009, IEEE Transactions on Mobile Computing.

[6]  Jonathan Wellons,et al.  Augmenting predictive with oblivious routing for wireless mesh networks under traffic uncertainty , 2010, Comput. Networks.

[7]  A. Forster,et al.  Machine Learning Techniques Applied to Wireless Ad-Hoc Networks: Guide and Survey , 2007, 2007 3rd International Conference on Intelligent Sensors, Sensor Networks and Information.

[8]  Jitendra Padhye,et al.  Routing in multi-radio, multi-hop wireless mesh networks , 2004, MobiCom '04.

[9]  Richard S. Sutton,et al.  Introduction to Reinforcement Learning , 1998 .

[10]  Marco Conti,et al.  Mesh networks: commodity multihop ad hoc networks , 2005, IEEE Communications Magazine.

[11]  Marco Conti,et al.  Opportunistic packet scheduling and routing in wireless mesh networks , 2010, 2010 IFIP Wireless Days.

[12]  Ping Zhou,et al.  Asymptotic Capacity of Infrastructure Wireless Mesh Networks , 2008, IEEE Transactions on Mobile Computing.

[13]  Charles E. Perkins,et al.  Ad hoc On-Demand Distance Vector (AODV) Routing , 2001, RFC.

[14]  Peter Dayan,et al.  Q-learning , 1992, Machine Learning.

[15]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[16]  Raffaele Bruno,et al.  Survey on diversity-based routing in wireless mesh networks: Challenges and solutions , 2010, Comput. Commun..

[17]  Vasilios A. Siris,et al.  A Contention-Aware Routing Metric for Multi-Rate Multi-Radio Mesh Networks , 2008, 2008 5th Annual IEEE Communications Society Conference on Sensor, Mesh and Ad Hoc Communications and Networks.

[18]  Robert Morris,et al.  Link-level measurements from an 802.11b mesh network , 2004, SIGCOMM 2004.