Design and performance analysis of inductive QoS scheduling for dynamic network routing

In the last decade, due to emerging real-time and multimedia applications, there has been much interest for developing mechanisms to take into account the quality of service required by these applications. We have proposed earlier an approach used an adaptive algorithm for packet routing using reinforcement learning called K-optimal path Q-routing algorithm (KOQRA) which optimizes simultaneously two additive QoS criteria: cumulative cost path and end-to-end delay. The approach developed here adds a new module to KOQRA dealing with the packet scheduling topic in order to achieve QoS differentiation and to optimize the queuing delay in a dynamically wireless changing environment. This module uses a multi-agent system in which each agent tries to optimize its own behaviour and communicate with other agents to make global coordination possible. This communication is done by mobile agents. In this paper, we adopt the framework of Markov decision process applied to multi-agent system and present a pheromone-Q learning approach which combines the standard Q-learning technique with a synthetic pheromone that acts as a communication medium speeding up the learning process of cooperating agents. Numerical results obtained with OPNET simulator for different levels of trafficpsilas load show that adaptive scheduling improves clearly performances of our earlier KOQRA.

[1]  Risto Miikkulainen,et al.  Confidence Based Dual Reinforcement Q-Routing: An adaptive online network routing algorithm , 1999, IJCAI.

[2]  Sartaj Sahni,et al.  An online heuristic for maximum lifetime routing in wireless sensor networks , 2006, IEEE Transactions on Computers.

[3]  Marwan Krunz,et al.  A randomized algorithm for finding a path subject to multiple QoS requirements , 2001, Comput. Networks.

[4]  Said Hoceini,et al.  Reinforcing Probabilistic Selective Quality of service Routes in Dynamic Heterogeneous Networks , 2008 .

[5]  Sartaj Sahni,et al.  Approximation algorithms for multiconstrained quality-of-service routing , 2006, IEEE Transactions on Computers.

[6]  Sartaj Sahni,et al.  Data Structures, Algorithms, and Applications in C++ , 1997 .

[7]  Dorothy Ndedi Monekosso,et al.  The analysis and performance evaluation of the pheromone‐Q‐learning algorithm , 2004, Expert Syst. J. Knowl. Eng..

[8]  Daniel Kudenko,et al.  Reinforcement learning of coordination in heterogeneous cooperative multi-agent systems , 2004, Proceedings of the Third International Joint Conference on Autonomous Agents and Multiagent Systems, 2004. AAMAS 2004..

[9]  Jeffrey M. Jaffe,et al.  Algorithms for finding paths with multiple constraints , 1984, Networks.

[10]  David S. Johnson,et al.  Computers and Intractability: A Guide to the Theory of NP-Completeness , 1978 .

[11]  Marta M. B. Pascoal,et al.  The k Shortest Paths Problem , 1998 .

[12]  Piet Van Mieghem,et al.  Conditions that impact the complexity of QoS routing , 2005, IEEE/ACM Transactions on Networking.

[13]  Said Hoceini,et al.  K-Shortest Paths Q-Routing: A New QoS Routing Algorithm in Telecommunication Networks , 2005, ICN.

[14]  Said Hoceini,et al.  Reinforcing probabilistic selective Quality of Service routes in dynamic irregular networks , 2008, Comput. Commun..

[15]  Azzedine Boukerche,et al.  Quality of service based routing algorithms for heterogeneous networks [Guest editorial] , 2007, IEEE Communications Magazine.

[16]  Daniel Kudenko,et al.  Reinforcement learning of coordination in cooperative multi-agent systems , 2002, AAAI/IAAI.

[17]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.