论文信息 - Approximate dynamic programming techniques for the control of time-varying queuing systems applied to call centers with abandonments and retrials

Approximate dynamic programming techniques for the control of time-varying queuing systems applied to call centers with abandonments and retrials

In this article we develop techniques for applying Approximate Dynamic Programming (ADP) to the control of time-varying queuing systems. First, we show that the classical state space representation in queuing systems leads to approximations that can be significantly improved by increasing the dimensionality of the state space by state disaggregation. Second, we deal with time-varying parameters by adding them to the state space with an ADP parameterization. We demonstrate these techniques for the optimal admission control in a retrial queue with abandonments and time-varying parameters. The numerical experiments show that our techniques have near to optimal performance.

Sandjai Bhulai | Dennis Roubos

[1] Martin L. Puterman,et al. Markov Decision Processes: Discrete Stochastic Dynamic Programming , 1994 .

[2] Richard S. Sutton,et al. Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[3] Sandjai Bhulai,et al. On the structure of value functions for threshold policies in queueing models , 2003, Journal of Applied Probability.

[4] Armann Ingolfsson,et al. Accounting for time-varying queueing effects in workforce scheduling , 2002, Eur. J. Oper. Res..

[5] Winfried K. Grassmann. Transient solutions in markovian queueing systems , 1977, Comput. Oper. Res..

[6] Benjamin Van Roy,et al. On Constraint Sampling in the Linear Programming Approach to Approximate Dynamic Programming , 2004, Math. Oper. Res..

[7] John N. Tsitsiklis,et al. Neuro-Dynamic Programming , 1996, Encyclopedia of Machine Learning.

[8] Benjamin Van Roy,et al. Approximate Linear Programming for Average-Cost Dynamic Programming , 2002, NIPS.

[9] B. Krogh,et al. State aggregation in Markov decision processes , 2002, Proceedings of the 41st IEEE Conference on Decision and Control, 2002..

[10] O. Hernández-Lerma,et al. Discrete-time Markov control processes , 1999 .

[11] John N. Tsitsiklis,et al. Feature-based methods for large scale dynamic programming , 2004, Machine Learning.