论文信息 - Approximate receding horizon approach for Markov decision processes: average reward case - 字舞流文

Approximate receding horizon approach for Markov decision processes: average reward case

S. Marcus | H. Chang

[1] Joseph Y. Hui,et al. On computing Markov decision theory-based cost for routing in circuit-switched broadband networks , 2005, Journal of Network and Systems Management.

[2] Robert Givan,et al. Parallel Rollout for Online Solution of Partially Observable Markov Decision Processes , 2004, Discret. Event Dyn. Syst..

[3] E. Gilbert,et al. Optimal infinite-horizon feedback laws for a general class of constrained discrete-time systems: Stability and moving-horizon approximations , 1988 .

[4] Sandjai Bhulai,et al. On the structure of value functions for threshold policies in queueing models , 2003, Journal of Applied Probability.

[5] Martin L. Puterman,et al. A probabilistic analysis of bias optimality in unichain Markov decision processes , 2001, IEEE Trans. Autom. Control..

[6] Robert Givan,et al. On-line sampling-based control for network queueing problems , 2001 .

[7] Nicola Secomandi,et al. Comparing neuro-dynamic programming algorithms for the vehicle routing problem with stochastic demands , 2000, Comput. Oper. Res..

[8] Eric Allender,et al. Complexity of finite-horizon Markov decision process problems , 2000, JACM.

[9] Ger Koole,et al. On the value function of a priority queue with an application to a controlled polling model , 1999, Queueing Syst. Theory Appl..

[10] José Luis González Velarde,et al. Computing tools for modeling, optimization and simulation : interfaces in computer science and operations research , 2000 .

[11] W. A. van den Broek,et al. Moving horizon control in dynamic games , 2002 .

[12] Thomas Parisini,et al. Neural approximators and team theory for dynamic routing: a receding-horizon approach , 1999, Proceedings of the 38th IEEE Conference on Decision and Control (Cat. No.99CH36304).

[13] Ger Koole,et al. On the value function of a priority queue , 1999, Proceedings of the 38th IEEE Conference on Decision and Control (Cat. No.99CH36304).

[14] Jay H. Lee,et al. Model predictive control: past, present and future , 1999 .

[15] W. van den Broek. Moving Horizon Control in Dynamic Games , 1999 .

[16] Steven I. Marcus,et al. Simulation-Based Algorithms for Average Cost Markov Decision Processes , 1999 .

[17] G. Koole. The deviation matrix of the M/M/1//spl infin/ and M/M/1/N queue, with applications to controlled queueing models , 1998, Proceedings of the 37th IEEE Conference on Decision and Control (Cat. No.98CH36171).

[18] D.A. Castanon,et al. Rollout Algorithms for Stochastic Scheduling Problems , 1998, Proceedings of the 37th IEEE Conference on Decision and Control (Cat. No.98CH36171).

[19] Sean P. Meyn. The policy iteration algorithm for average reward Markov decision processes with general state space , 1997, IEEE Trans. Autom. Control..

[20] W. N. Patten,et al. A sliding horizon feedback control problem with feedforward and disturbance , 1997 .

[21] Gerald Tesauro,et al. On-line Policy Improvement using Monte-Carlo Search , 1996, NIPS.

[22] Awi Federgruen,et al. Detection of minimal forecast horizons in dynamic programs with multiple indicators of the future , 1996 .

[23] Martin L. Puterman,et al. Markov Decision Processes: Discrete Stochastic Dynamic Programming , 1994 .

[24] M. K. Ghosh,et al. Discrete-time controlled Markov processes with average cost criterion: a survey , 1993 .

[25] K. R. Krishnan,et al. Separable routing: A scheme for state-dependent routing of circuit switched telephone traffic , 1992, Ann. Oper. Res..

[26] Xi-Ren Cao,et al. Perturbation analysis of discrete event dynamic systems , 1991 .

[27] O. Hernández-Lerma,et al. Error bounds for rolling horizon policies in discrete-time Markov control processes , 1990 .

[28] O. Hernández-Lerma. Adaptive Markov Control Processes , 1989 .

[29] H. Michalska,et al. Receding horizon control of nonlinear systems , 1988, Proceedings of the 28th IEEE Conference on Decision and Control,.

[30] O. Hernández-Lerma,et al. A forecast horizon and a stopping rule for general Markov decision processes , 1988 .

[31] J. Lasserre,et al. An on-line procedure in discounted infinite-horizon stochastic optimal control , 1986 .

[32] D. J. White,et al. The Determination of Approximately Optimal Policies in Markov Decision Processes by the Use of Bounds , 1982 .

[33] Leif Johansen,et al. Lectures on macroeconomic planning , 1980 .

[34] Richard Grinold,et al. Finite horizon approximations of infinite horizon linear programs , 1977, Math. Program..

[35] Onésimo Hernández-Lerma,et al. Controlled Markov Processes , 1965 .