A reinforcement learning approach for and scheduling packets in dynamic networks

Actually, various kinds of sources (such as voice, video, or data) with diverse traffic characteristics and quality of service requirements (QoS), which are multiplexed at very high rates, leads to significant traffic problems such as packet losses, transmission delays, delay variations, etc, caused mainly by congestion in the networks. The prediction of these problems in real time is quite difficult, making the effectiveness of "traditional" methodologies based on analytical models questionable. Effective network routing means selecting the optimal communication paths. It can be modeled as a multiagent RL problem. We propose an adaptive routing and scheduling algorithm based on reinforcement learning techniques.

[1]  Dit-Yan Yeung,et al.  Predictive Q-Routing: A Memory-based Reinforcement Learning Approach to Adaptive Traffic Control , 1995, NIPS.

[2]  Klara Nahrstedt,et al.  Hop-by-hop routing algorithms for premium-class traffic in DiffServ networks , 2002, Proceedings.Twenty-First Annual Joint Conference of the IEEE Computer and Communications Societies.

[3]  Cui Yong Research on Internetwork QoS Routing Algorithms: a Survey , 2002 .

[4]  P. Mars,et al.  Satisfying QoS with a learning based scheduling algorithm , 1998, 1998 Sixth International Workshop on Quality of Service (IWQoS'98) (Cat. No.98EX136).

[5]  P. Gallinari,et al.  Discriminative training for improved neural prediction systems , 1994, Proceedings of ICASSP '94. IEEE International Conference on Acoustics, Speech and Signal Processing.

[6]  Serge Fdida,et al.  A scalable algorithm for link-state QoS-based routing with three metrics , 2001, ICC 2001. IEEE International Conference on Communications. Conference Record (Cat. No.01CH37240).

[7]  Devika Subramanian,et al.  Ants and Reinforcement Learning: A Case Study in Routing in Dynamic Networks , 1997, IJCAI.

[8]  Marco Dorigo,et al.  An adaptive multi-agent routing algorithm inspired by ants behavior , 1998 .

[9]  Stephen E. Deering,et al.  Distance Vector Multicast Routing Protocol , 1988, RFC.

[10]  Ian F. Akyildiz,et al.  A new preemption policy for DiffServ-aware traffic engineering to minimize rerouting , 2002, Proceedings.Twenty-First Annual Joint Conference of the IEEE Computer and Communications Societies.

[11]  Kang G. Shin,et al.  Adaptive-weighted packet scheduling for premium service , 2001, ICC.

[12]  Erol Gelenbe,et al.  Towards Networks with Cognitive Packets , 2001 .

[13]  Michael L. Littman,et al.  Packet Routing in Dynamically Changing Networks: A Reinforcement Learning Approach , 1993, NIPS.

[14]  Martin Heusse,et al.  A New Distributed and Adaptive Approach to Routing and Load Balancing in Dynamic Communication Networks , .