论文信息 - The asymptotic behaviour of the minimal total expected cost for the denumerable state Markov decision model : (prepublication) - 字舞流文

The asymptotic behaviour of the minimal total expected cost for the denumerable state Markov decision model : (prepublication)

P. Schweitzer | A. Hordijk | H. Tijms

[1] Kai Lai Chung,et al. Markov Chains with Stationary Transition Probabilities , 1961 .

[2] J. Bather. Optimal decision procedures for finite Markov chains. Part II: Communicating systems , 1973, Advances in Applied Probability.

[3] P. Schweitzer. Perturbation theory and Markovian decision processes. , 1965 .

[4] A. Hordijk,et al. Convergence results and approximations for optimal (s,s) policies , 1974 .

[5] Mark R. Lembersky. On Maximal Rewards and $|varepsilon$-Optimal Policies in Continuous Time Markov Decision Chains , 1974 .

[6] Paul J. Schweitzer,et al. Perturbation Theory and Undiscounted Markov Renewal Programming , 1969, Oper. Res..

[7] E. Denardo. A Markov Decision Problem , 1973 .

[8] S. Ross. Arbitrary State Markovian Decision Processes , 1968 .

[9] W. Barry. On the Iterative Method of Dynamic Programming on a Finite Space Discrete Time Markov Process , 1965 .

[10] E. Lanery,et al. Étude asymptotique des systèmes markoviens à commande , 1967 .