论文信息 - Contraction mappings underlying undiscounted Markov decision problems—II - 字舞流文

Contraction mappings underlying undiscounted Markov decision problems—II

[1] E. Denardo. CONTRACTION MAPPINGS IN THE THEORY UNDERLYING DYNAMIC PROGRAMMING , 1967 .

[2] D. White,et al. Dynamic programming, Markov chains, and the method of successive approximations , 1963 .

[3] P. Schweitzer. Iterative solution of the functional equations of undiscounted Markov renewal programming , 1971 .

[4] Cyrus Derman,et al. Finite State Markovian Decision Processes , 1970 .

[5] N. Hastings,et al. Note---A Test for Nonoptimal Actions in Undiscounted Finite Markov Decision Chains , 1976 .

[6] N. A. J. Hastings. Technical Note - Bounds on the Gain of a Markov Decision Process , 1971, Oper. Res..

[7] M. Bartlett,et al. Weak ergodicity in non-homogeneous Markov chains , 1958, Mathematical Proceedings of the Cambridge Philosophical Society.

[8] Evan L. Porteus. Some Bounds for Discounted Sequential Decision Processes , 1971 .

[9] William S. Jewell,et al. Markov-Renewal Programming. I: Formulation, Finite Return Models , 1963 .

[10] W. Jewell. MARKOV-RENEWAL PROGRAMMING , 1962 .

[11] J. Bather. Optimal decision procedures for finite Markov chains. Part II: Communicating systems , 1973, Advances in Applied Probability.

[12] P. Schweitzer. Perturbation theory and Markovian decision processes. , 1965 .

[13] Rolf A. Deininger,et al. Generalization of White's Method of Successive Approximations to Periodic Markovian Decision Processes , 1972, Oper. Res..

[14] Paul J. Schweitzer,et al. The Functional Equations of Undiscounted Markov Renewal Programming , 1971, Math. Oper. Res..

[15] T. Morton,et al. Discounting, Ergodicity and Convergence for Markov Decision Processes , 1977 .

[16] Ronald A. Howard,et al. Dynamic Programming and Markov Processes , 1960 .

[17] H. Tijms,et al. Exponential convergence of products of stochastic matrices , 1977 .

[18] Amedeo R. Odoni,et al. On Finding the Maximal Gain for Markov Decision Processes , 1969, Oper. Res..

[19] J. MacQueen. A MODIFIED DYNAMIC PROGRAMMING METHOD FOR MARKOVIAN DECISION PROBLEMS , 1966 .

[20] E. Denardo,et al. Multichain Markov Renewal Programs , 1968 .

[21] W. Barry. On the Iterative Method of Dynamic Programming on a Finite Space Discrete Time Markov Process , 1965 .

[22] J. MacQueen,et al. Letter to the Editor - A Test for Suboptimal Actions in Markovian Decision Problems , 1967, Oper. Res..

[23] Paul J. Schweitzer,et al. The Asymptotic Behavior of Undiscounted Value Iteration in Markov Decision Problems , 1977, Math. Oper. Res..

[24] J. Bather. Optimal decision procedures for finite markov chains. Part I: Examples , 1973, Advances in Applied Probability.

[25] E. Lanery,et al. Étude asymptotique des systèmes markoviens à commande , 1967 .

[26] P. Schweitzer,et al. Geometric convergence of value-iteration in multichain Markov decision problems , 1979, Advances in Applied Probability.