A neuro-dynamic programming approach to call admission control in integrated service networks : the single link case
暂无分享,去创建一个
John N. Tsitsiklis | Decision Systems. | Peter Marbach | J. Tsitsiklis | P. Marbach | Decision Systems.
[1] Gerald Tesauro,et al. Practical Issues in Temporal Difference Learning , 1992, Mach. Learn..
[2] G. Tesauro. Practical Issues in Temporal Difference Learning , 1992 .
[3] Andrew G. Barto,et al. Improving Elevator Performance Using Reinforcement Learning , 1995, NIPS.
[4] Dimitri P. Bertsekas,et al. Dynamic Programming and Optimal Control, Two Volume Set , 1995 .
[5] Lars Asplund,et al. Neural networks for adaptive traffic control in ATM networks , 1995 .
[6] Thomas G. Dietterich,et al. High-Performance Job-Shop Scheduling With A Time-Delay TD-lambda Network , 1995, NIPS.
[7] John N. Tsitsiklis,et al. Neuro-Dynamic Programming , 1996, Encyclopedia of Machine Learning.
[8] Dimitri P. Bertsekas,et al. Reinforcement Learning for Dynamic Channel Allocation in Cellular Telephone Systems , 1996, NIPS.