论文信息 - The Laurent series, sensitive discount and Blackwell optimality for continuous-time controlled Markov chains

The Laurent series, sensitive discount and Blackwell optimality for continuous-time controlled Markov chains

Abstract.This paper gives conditions for the convergence of the Laurent series expansion for a class of continuous-time controlled Markov chains with possibly unbounded reward (or cost) rates and unbounded transition rates. That series is then used to study several optimization criteria, including n-discount optimality (for n=−1,0,1,...), Blackwell optimality, and the maximization of a certain vector criterion that in particular gives gain and bias optimality.

Onésimo Hernández-Lerma | Tomás Prieto-Rumeau | O. Hernández-Lerma | T. Prieto-Rumeau

[1] M. Puterman. Sensitive Discount Optimality in Controlled One-Dimensional Diffusions , 1974 .

[2] Arie Hordijk,et al. Blackwell optimality in the class of all policies in Markov decision chains with a Borel state space and unbounded rewards , 1999, Math. Methods Oper. Res..

[3] Martin L. Puterman,et al. Markov Decision Processes: Discrete Stochastic Dynamic Programming , 1994 .

[4] Howard M. Taylor,et al. A laurent series for the resolvent of a strongly continuous stochastic semi-group , 1976 .

[5] Arie Hordijk,et al. Blackwell optimality in the class of stationary policies in Markov decision chains with a Borel state space and unbounded rewards , 1999, Math. Methods Oper. Res..

[6] Onésimo Hernández-Lerma,et al. Bias Optimality for Continuous-Time Controlled Markov Chains , 2006, SIAM J. Control. Optim..

[7] B. L. Miller,et al. Discrete Dynamic Programming with a Small Interest Rate , 1969 .

[8] O. Hernández-Lerma,et al. Continuous-time controlled Markov chains , 2003 .

[9] D. Blackwell. Discrete Dynamic Programming , 1962 .

[10] A. Yushkevich. On Reducing a Jump Controllable Markov Model to a Model with Discrete Time , 1980 .

[11] Xianping Guo,et al. Drift and monotonicity conditions for continuous-time controlled Markov chains with an average criterion , 2003, IEEE Trans. Autom. Control..

[12] Xianping Guo,et al. Continuous-Time Controlled Markov Chains with Discounted Rewards , 2003 .

[13] A. F. Veinott. Discrete Dynamic Programming with Sensitive Discount Optimality Criteria , 1969 .

[14] O. Hernández-Lerma,et al. Further topics on discrete-time Markov control processes , 1999 .

[15] Onésimo Hernández-Lerma,et al. Bias Optimality versus Strong 0-Discount Optimality in Markov Control Processes with Unbounded Costs , 2003 .

[16] Tosio Kato. Perturbation theory for linear operators , 1966 .