Policy Iteration for Average Cost Markov Control Processes on Borel Spaces
暂无分享,去创建一个
[1] Manfred Schäl,et al. Average Optimality in Dynamic Programming with General State Space , 1993, Math. Oper. Res..
[2] R. Dekker. Counter examples for compact action Markov decision chains with average reward criteria , 1987 .
[3] Irwin E. Schochetman. Pointwise versions of the maximum theorem with applications in optimization , 1990 .
[4] Martin L. Puterman,et al. On the Convergence of Policy Iteration in Finite State Undiscounted Markov Decision Processes: The Unichain Case , 1987, Math. Oper. Res..
[5] Onésimo Hernández-Lerma. Existence of average optimal policies in Markov control processes with strictly unbounded costs , 1993, Kybernetika.
[6] Linn I. Sennott,et al. Average Cost Optimal Stationary Policies in Infinite State Markov Decision Processes with Unbounded Costs , 1989, Oper. Res..
[7] O. Hernández-Lerma,et al. Discrete-time Markov control processes , 1999 .
[8] Onésimo Hernández-Lerma,et al. Average cost Markov control processes with weighted norms: existence of canonical policies , 1995 .
[9] P. Glynn. A Lyapunov Bound for Solutions of Poisson's Equation , 1989 .
[10] V. Benes,et al. Finite regular invariant measures for Feller processes , 1968, Journal of Applied Probability.
[11] Robert L. Smith,et al. Convergence of selections with applications in optimization , 1991 .
[12] G. Klimov. Existence of a final distribution for an irreducible Feller process with invariant measure , 1985 .
[13] P. Schweitzer. On undiscounted markovian decision processes with compact action spaces , 1985 .
[14] Richard L. Tweedie,et al. Markov Chains and Stochastic Stability , 1993, Communications and Control Engineering Series.
[15] Martin L. Puterman,et al. Markov Decision Processes: Discrete Stochastic Dynamic Programming , 1994 .
[16] Raúl Montes-de-Oca,et al. Value iteration in average cost Markov control processes on Borel spaces , 1996 .
[17] Onésimo Hernández-Lerma,et al. Controlled Markov Processes , 1965 .
[18] Marie Duflo. Méthodes récursives aléatoires , 1990 .
[19] O. Hernández-Lerma,et al. Average cost Markov control processes with weighted norms: value iteration , 1994 .
[20] E. Denardo,et al. Multichain Markov Renewal Programs , 1968 .
[21] O. Hernández-Lerma,et al. Linear Programming and Average Optimality of Markov Control Processes on Borel Spaces---Unbounded Costs , 1994 .
[22] M. K. Ghosh,et al. Discrete-time controlled Markov processes with average cost criterion: a survey , 1993 .
[23] O. Hernández-Lerma,et al. Average cost optimal policies for Markov control processes with Borel state space and unbounded costs , 1990 .