Policy Iteration for Average Cost Markov Control Processes on Borel Spaces

[1]  Manfred Schäl,et al.  Average Optimality in Dynamic Programming with General State Space , 1993, Math. Oper. Res..

[2]  R. Dekker Counter examples for compact action Markov decision chains with average reward criteria , 1987 .

[3]  Irwin E. Schochetman Pointwise versions of the maximum theorem with applications in optimization , 1990 .

[4]  Martin L. Puterman,et al.  On the Convergence of Policy Iteration in Finite State Undiscounted Markov Decision Processes: The Unichain Case , 1987, Math. Oper. Res..

[5]  Onésimo Hernández-Lerma Existence of average optimal policies in Markov control processes with strictly unbounded costs , 1993, Kybernetika.

[6]  Linn I. Sennott,et al.  Average Cost Optimal Stationary Policies in Infinite State Markov Decision Processes with Unbounded Costs , 1989, Oper. Res..

[7]  O. Hernández-Lerma,et al.  Discrete-time Markov control processes , 1999 .

[8]  Onésimo Hernández-Lerma,et al.  Average cost Markov control processes with weighted norms: existence of canonical policies , 1995 .

[9]  P. Glynn A Lyapunov Bound for Solutions of Poisson's Equation , 1989 .

[10]  V. Benes,et al.  Finite regular invariant measures for Feller processes , 1968, Journal of Applied Probability.

[11]  Robert L. Smith,et al.  Convergence of selections with applications in optimization , 1991 .

[12]  G. Klimov Existence of a final distribution for an irreducible Feller process with invariant measure , 1985 .

[13]  P. Schweitzer On undiscounted markovian decision processes with compact action spaces , 1985 .

[14]  Richard L. Tweedie,et al.  Markov Chains and Stochastic Stability , 1993, Communications and Control Engineering Series.

[15]  Martin L. Puterman,et al.  Markov Decision Processes: Discrete Stochastic Dynamic Programming , 1994 .

[16]  Raúl Montes-de-Oca,et al.  Value iteration in average cost Markov control processes on Borel spaces , 1996 .

[17]  Onésimo Hernández-Lerma,et al.  Controlled Markov Processes , 1965 .

[18]  Marie Duflo Méthodes récursives aléatoires , 1990 .

[19]  O. Hernández-Lerma,et al.  Average cost Markov control processes with weighted norms: value iteration , 1994 .

[20]  E. Denardo,et al.  Multichain Markov Renewal Programs , 1968 .

[21]  O. Hernández-Lerma,et al.  Linear Programming and Average Optimality of Markov Control Processes on Borel Spaces---Unbounded Costs , 1994 .

[22]  M. K. Ghosh,et al.  Discrete-time controlled Markov processes with average cost criterion: a survey , 1993 .

[23]  O. Hernández-Lerma,et al.  Average cost optimal policies for Markov control processes with Borel state space and unbounded costs , 1990 .