论文信息 - Decomposition of Reinforcement Learning for Admission Control of Self-Similar Call Arrival Processes

Decomposition of Reinforcement Learning for Admission Control of Self-Similar Call Arrival Processes

This paper presents predictive gain scheduling, a technique for simplifying reinforcement learning problems by decomposition. Link admission control of self-similar call traffic is used to demonstrate the technique. The control problem is decomposed into on-line prediction of near-future call arrival rates, and precomputation of policies for Poisson call arrival processes. At decision time, the predictions are used to select among the policies. Simulations show that this technique results in significantly faster learning without any performance loss, compared to a reinforcement learning controller that does not decompose the problem.

Jakob Carlström

[1] Zbigniew Dziong,et al. Call admission and routing in multi-service loss networks , 1994, IEEE Trans. Commun..

[2] Timothy X. Brown,et al. Adaptive call admission control under quality of service constraints: a reinforcement learning solution , 2000, IEEE Journal on Selected Areas in Communications.

[3] Simon Haykin,et al. Neural Networks: A Comprehensive Foundation , 1998 .

[4] Anja Feldmann,et al. The changing nature of network traffic: scaling phenomena , 1998, CCRV.

[5] Jakob and Nordström Ernst Carlström. Reinforcement learning for control of self-similar call traffic in broadband networks , 1999 .

[6] Sally Floyd,et al. Wide area traffic: the failure of Poisson modeling , 1995, TNET.

[7] Richard S. Sutton,et al. Introduction to Reinforcement Learning , 1998 .

[8] John N. Tsitsiklis,et al. Call admission control and routing in integrated services networks using neuro-dynamic programming , 2000, IEEE Journal on Selected Areas in Communications.

[9] Dimitri P. Bertsekas,et al. Dynamic Programming and Optimal Control, Two Volume Set , 1995 .

[10] Karl Johan Åström,et al. Adaptive Control , 1989, Embedded Digital Control with Microcontrollers.

[11] Walter Willinger,et al. On the self-similar nature of Ethernet traffic , 1993, SIGCOMM '93.

[12] Zbigniew Dziong,et al. ATM Network Resource Management , 1997 .