An expected average reward criterion
暂无分享,去创建一个
[1] Kai Lai Chung,et al. A Course in Probability Theory , 1949 .
[2] Ronald A. Howard,et al. Dynamic Programming and Markov Processes , 1960 .
[3] Michel Loève,et al. Probability Theory I , 1977 .
[4] Onésimo Hernández-Lerma,et al. Controlled Markov Processes , 1965 .
[5] S. Ross,et al. An Example in Denumerable Decision Processes , 1968 .
[6] J. Bather. Optimal decision procedures for finite markov chains. Part I: Examples , 1973, Advances in Applied Probability.
[7] Arie Hordijk,et al. Dynamic programming and Markov potential theory , 1974 .
[8] P. Mandl,et al. Estimation and control in Markov chains , 1974, Advances in Applied Probability.
[9] R. Y. Chitashvili,et al. A Controlled Finite Markov Chain with an Arbitrary Set of Decisions , 1976 .
[10] E. A. Fainberg. On Controlled Finite State Markov Processes with Compact Control Sets , 1976 .
[11] A. Federgruen,et al. Denumerable state semi-markov decision processes with unbounded costs, average cost criterion : (preprint) , 1979 .
[12] M. Loève. Probability Theory II , 1978 .
[13] E. Fainberg,et al. The Existence of a Stationary $\varepsilon $-Optimal Policy for a Finite Markov Chain , 1979 .
[14] E. Fainberg. An $\varepsilon $-Optimal Control of a Finite Markov Chain with an Average Reward Criterion , 1980 .
[15] Paul J. Schweitzer,et al. Denumerable Undiscounted Semi-Markov Decision Processes with Unbounded Rewards , 1983, Math. Oper. Res..
[16] T. Hill,et al. On maximizing the average time at a goal , 1984 .