A Multi-Agent Reinforcement Learning Approach to Path Selection in Optical Burst Switching Networks

An important issue of research in optical burst switching (OBS) networks is to minimize the loss of bursts due to contention at the intermediate nodes. These contention losses can be minimized with the design of efficient path selection algorithms at the ingress node. Path selection algorithms that learn the optimal path dynamically with the changing traffic conditions outperform the deterministic path selection algorithms. Usually in the single agent path selection algorithms, a path is selected by the agent based on the feedback received at the ingress node which does not capture the effect of the paths selected by the other nodes in the network. We develop a multi-agent approach for path selection that includes the effect of the selection made by all the other nodes in the network. The proposed path selection algorithm uses agents at different source nodes to collectively learn the network dynamics and select the best outgoing path for each burst. We present simulation results to demonstrate the effectiveness of the proposed algorithm over the other similar algorithms in the literature.

[1]  Vinod Vokkarane,et al.  Dynamic congestion-based load balanced routing in optical burst-switched networks , 2003, GLOBECOM '03. IEEE Global Telecommunications Conference (IEEE Cat. No.03CH37489).

[2]  Richard S. Sutton,et al.  Introduction to Reinforcement Learning , 1998 .

[3]  Jason P. Jue,et al.  Optical Burst Switched Networks , 2004 .

[4]  Chunming Qiao,et al.  Assembling TCP/IP packets in optical burst switched networks , 2002, Global Telecommunications Conference, 2002. GLOBECOM '02. IEEE.

[5]  Yijun Xiong,et al.  Control architecture in optical burst-switched WDM networks , 2000, IEEE Journal on Selected Areas in Communications.

[6]  C. Siva Ram Murthy,et al.  A Reinforcement Learning Framework for Path Selection and Wavelength Selection in Optical Burst Switched Networks , 2007, IEEE Journal on Selected Areas in Communications.

[7]  Vinod Vokkarane,et al.  Evaluation of burst retransmission in optical burst-switched networks , 2005, 2nd International Conference on Broadband Networks, 2005..

[8]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[9]  Kee Chaing Chua,et al.  Load balancing using adaptive alternate routing in IP-over-WDM optical burst switching networks , 2003, OptiComm: Optical Networking and Communications Conference.

[10]  Li Yang,et al.  Adaptive path selection in OBS networks , 2006, Journal of Lightwave Technology.

[11]  Naoaki Yamanaka,et al.  Self-Learning Route Selection Scheme Using Search Packets in an OBS Network , 2005 .

[12]  Bart De Schutter,et al.  A Comprehensive Survey of Multiagent Reinforcement Learning , 2008, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[13]  Jing Teng,et al.  Routing path optimization in optical burst switched networks , 2005, Conference onOptical Network Design and Modeling, 2005..