论文信息 - A Distributed Reinforcement Learning Scheme for Network Routing

A Distributed Reinforcement Learning Scheme for Network Routing

In this paper we describe a self-adjusting algorithm for packet routing, in which a reinforcement learning module is embedded into each node of a switching network. Only local communication is used to keep accurate statistics at each node on which routing policies lead to minimal delivery times. In simple experiments involving a 36-node, irregularly connected network, this learning approach proves superior to a nonadaptive algorithm based on precomputed shortest paths.

Michael L. Littman | Justin A. Boyan | M. Littman | J. Boyan

[1] Harry Rudin,et al. On Routing and "Delta Routing": A Taxonomy and Performance Comparison of Techniques for Packet-Switched Networks , 1976, IEEE Trans. Commun..

[2] Long-Ji Lin,et al. Reinforcement learning for robots using neural networks , 1992 .

[3] Ben J. A. Kröse,et al. Learning from delayed rewards , 1995, Robotics Auton. Syst..