The asymptotic behaviour of the minimal total expected cost for the denumerable state Markov decision model : (prepublication)
暂无分享,去创建一个
[1] Kai Lai Chung,et al. Markov Chains with Stationary Transition Probabilities , 1961 .
[2] J. Bather. Optimal decision procedures for finite Markov chains. Part II: Communicating systems , 1973, Advances in Applied Probability.
[3] P. Schweitzer. Perturbation theory and Markovian decision processes. , 1965 .
[4] A. Hordijk,et al. Convergence results and approximations for optimal (s,s) policies , 1974 .
[5] Mark R. Lembersky. On Maximal Rewards and $|varepsilon$-Optimal Policies in Continuous Time Markov Decision Chains , 1974 .
[6] Paul J. Schweitzer,et al. Perturbation Theory and Undiscounted Markov Renewal Programming , 1969, Oper. Res..
[7] E. Denardo. A Markov Decision Problem , 1973 .
[8] S. Ross. Arbitrary State Markovian Decision Processes , 1968 .
[9] W. Barry. On the Iterative Method of Dynamic Programming on a Finite Space Discrete Time Markov Process , 1965 .
[10] E. Lanery,et al. Étude asymptotique des systèmes markoviens à commande , 1967 .