论文信息 - Denumerable Markov Decision Chains: Sensitive Optimality Criteria

Denumerable Markov Decision Chains: Sensitive Optimality Criteria

Assuming compact metric action spaces and the usual continuity properties of the immediate costs and of the transition probabilities we regard the existence of average and/or sensitive optimal stationary policies. We generalize results from the unichain case to the multichain case. It appears that the simultaneous Doeblin condition is not sufficient. However, the continuity of the ergodic potential guarantees not only average but also bias and Blackwell optimality. Relations between these conditions and uniform strong ergodicity are discussed. An extension is also made to the unbounded costs case.

Rommert Dekker | Arie Hordijk | A. Hordijk | R. Dekker

[1] G. Luecke,et al. Strongly ergodic Markov chains and rates of convergence using spectral conditions , 1978 .

[2] Arie Hordijk,et al. Dynamic programming and Markov potential theory , 1974 .

[3] A. Federgruen,et al. Denumerable state semi-markov decision processes with unbounded costs, average cost criterion : (preprint) , 1979 .

[4] A. F. Veinott. ON FINDING OPTIMAL POLICIES IN DISCRETE DYNAMIC PROGRAMMING WITH NO DISCOUNTING , 1966 .

[5] A. F. Veinott. Discrete Dynamic Programming with Sensitive Discount Optimality Criteria , 1969 .