Weak conditions for the existence of optimal stationary policies in average Markov decision chains with unbounded costs
暂无分享,去创建一个
[1] L. Sennott. A new condition for the existence of optimal stationary policies in average cost Markov decision processes , 1986 .
[2] R. Cavazos-Cadena. Necessary conditions for the optimality equation in average-reward Markov decision processes , 1989 .
[3] L. Sennott. A new condition for the existence of optimum stationary policies in average cost Markov decision processes - Unbounded cost case , 1986, 1986 25th IEEE Conference on Decision and Control.
[4] R. Cavazos-Cadena. Necessary and sufficient conditions for a bounded solution to the optimality equation in average reward Markov decision chains , 1988 .
[5] Daniel P. Heyman,et al. Stochastic models in operations research , 1982 .
[6] Arie Hordijk,et al. Dynamic programming and Markov potential theory , 1974 .