Contraction mappings underlying undiscounted Markov decision problems—II
暂无分享,去创建一个
[1] E. Denardo. CONTRACTION MAPPINGS IN THE THEORY UNDERLYING DYNAMIC PROGRAMMING , 1967 .
[2] D. White,et al. Dynamic programming, Markov chains, and the method of successive approximations , 1963 .
[3] P. Schweitzer. Iterative solution of the functional equations of undiscounted Markov renewal programming , 1971 .
[4] Cyrus Derman,et al. Finite State Markovian Decision Processes , 1970 .
[5] N. Hastings,et al. Note---A Test for Nonoptimal Actions in Undiscounted Finite Markov Decision Chains , 1976 .
[6] N. A. J. Hastings. Technical Note - Bounds on the Gain of a Markov Decision Process , 1971, Oper. Res..
[7] M. Bartlett,et al. Weak ergodicity in non-homogeneous Markov chains , 1958, Mathematical Proceedings of the Cambridge Philosophical Society.
[8] Evan L. Porteus. Some Bounds for Discounted Sequential Decision Processes , 1971 .
[9] William S. Jewell,et al. Markov-Renewal Programming. I: Formulation, Finite Return Models , 1963 .
[10] W. Jewell. MARKOV-RENEWAL PROGRAMMING , 1962 .
[11] J. Bather. Optimal decision procedures for finite Markov chains. Part II: Communicating systems , 1973, Advances in Applied Probability.
[12] P. Schweitzer. Perturbation theory and Markovian decision processes. , 1965 .
[13] Rolf A. Deininger,et al. Generalization of White's Method of Successive Approximations to Periodic Markovian Decision Processes , 1972, Oper. Res..
[14] Paul J. Schweitzer,et al. The Functional Equations of Undiscounted Markov Renewal Programming , 1971, Math. Oper. Res..
[15] T. Morton,et al. Discounting, Ergodicity and Convergence for Markov Decision Processes , 1977 .
[16] Ronald A. Howard,et al. Dynamic Programming and Markov Processes , 1960 .
[17] H. Tijms,et al. Exponential convergence of products of stochastic matrices , 1977 .
[18] Amedeo R. Odoni,et al. On Finding the Maximal Gain for Markov Decision Processes , 1969, Oper. Res..
[19] J. MacQueen. A MODIFIED DYNAMIC PROGRAMMING METHOD FOR MARKOVIAN DECISION PROBLEMS , 1966 .
[20] E. Denardo,et al. Multichain Markov Renewal Programs , 1968 .
[21] W. Barry. On the Iterative Method of Dynamic Programming on a Finite Space Discrete Time Markov Process , 1965 .
[22] J. MacQueen,et al. Letter to the Editor - A Test for Suboptimal Actions in Markovian Decision Problems , 1967, Oper. Res..
[23] Paul J. Schweitzer,et al. The Asymptotic Behavior of Undiscounted Value Iteration in Markov Decision Problems , 1977, Math. Oper. Res..
[24] J. Bather. Optimal decision procedures for finite markov chains. Part I: Examples , 1973, Advances in Applied Probability.
[25] E. Lanery,et al. Étude asymptotique des systèmes markoviens à commande , 1967 .
[26] P. Schweitzer,et al. Geometric convergence of value-iteration in multichain Markov decision problems , 1979, Advances in Applied Probability.