论文信息 - Policy Iteration for Average Cost Markov Control Processes on Borel Spaces - 字舞流文

Policy Iteration for Average Cost Markov Control Processes on Borel Spaces

O. Hernández-Lerma | J. Lasserre

[1] Manfred Schäl,et al. Average Optimality in Dynamic Programming with General State Space , 1993, Math. Oper. Res..

[2] R. Dekker. Counter examples for compact action Markov decision chains with average reward criteria , 1987 .

[3] Irwin E. Schochetman. Pointwise versions of the maximum theorem with applications in optimization , 1990 .

[4] Martin L. Puterman,et al. On the Convergence of Policy Iteration in Finite State Undiscounted Markov Decision Processes: The Unichain Case , 1987, Math. Oper. Res..

[5] Onésimo Hernández-Lerma. Existence of average optimal policies in Markov control processes with strictly unbounded costs , 1993, Kybernetika.

[6] Linn I. Sennott,et al. Average Cost Optimal Stationary Policies in Infinite State Markov Decision Processes with Unbounded Costs , 1989, Oper. Res..

[7] O. Hernández-Lerma,et al. Discrete-time Markov control processes , 1999 .

[8] Onésimo Hernández-Lerma,et al. Average cost Markov control processes with weighted norms: existence of canonical policies , 1995 .

[9] P. Glynn. A Lyapunov Bound for Solutions of Poisson's Equation , 1989 .

[10] V. Benes,et al. Finite regular invariant measures for Feller processes , 1968, Journal of Applied Probability.

[11] Robert L. Smith,et al. Convergence of selections with applications in optimization , 1991 .

[12] G. Klimov. Existence of a final distribution for an irreducible Feller process with invariant measure , 1985 .

[13] P. Schweitzer. On undiscounted markovian decision processes with compact action spaces , 1985 .

[14] Richard L. Tweedie,et al. Markov Chains and Stochastic Stability , 1993, Communications and Control Engineering Series.

[15] Martin L. Puterman,et al. Markov Decision Processes: Discrete Stochastic Dynamic Programming , 1994 .

[16] Raúl Montes-de-Oca,et al. Value iteration in average cost Markov control processes on Borel spaces , 1996 .

[17] Onésimo Hernández-Lerma,et al. Controlled Markov Processes , 1965 .

[18] Marie Duflo. Méthodes récursives aléatoires , 1990 .

[19] O. Hernández-Lerma,et al. Average cost Markov control processes with weighted norms: value iteration , 1994 .

[20] E. Denardo,et al. Multichain Markov Renewal Programs , 1968 .

[21] O. Hernández-Lerma,et al. Linear Programming and Average Optimality of Markov Control Processes on Borel Spaces---Unbounded Costs , 1994 .

[22] M. K. Ghosh,et al. Discrete-time controlled Markov processes with average cost criterion: a survey , 1993 .

[23] O. Hernández-Lerma,et al. Average cost optimal policies for Markov control processes with Borel state space and unbounded costs , 1990 .