A Distributed Reinforcement Learning Scheme for Network Routing
暂无分享,去创建一个
In this paper we describe a self-adjusting algorithm for packet routing, in which a reinforcement learning module is embedded into each node of a switching network. Only local communication is used to keep accurate statistics at each node on which routing policies lead to minimal delivery times. In simple experiments involving a 36-node, irregularly connected network, this learning approach proves superior to a nonadaptive algorithm based on precomputed shortest paths.
[1] Harry Rudin,et al. On Routing and "Delta Routing": A Taxonomy and Performance Comparison of Techniques for Packet-Switched Networks , 1976, IEEE Trans. Commun..
[2] Long-Ji Lin,et al. Reinforcement learning for robots using neural networks , 1992 .
[3] Ben J. A. Kröse,et al. Learning from delayed rewards , 1995, Robotics Auton. Syst..