Computational comparison of value iteration algorithms for discounted Markov decision processes
暂无分享,去创建一个
[1] Ronald A. Howard,et al. Dynamic Programming and Markov Processes , 1960 .
[2] D. Blackwell. Discounted Dynamic Programming , 1965 .
[3] J. MacQueen,et al. Letter to the Editor - A Test for Suboptimal Actions in Markovian Decision Problems , 1967, Oper. Res..
[4] Harold J. Kushner,et al. Accelerated procedures for the solution of discrete Markov control problems , 1971 .
[5] Evan L. Porteus. Some Bounds for Discounted Sequential Decision Processes , 1971 .
[6] H. Kushner. Introduction to stochastic control , 1971 .
[7] Evan L. Porteus. Bounds and Transformations for Discounted Finite Markov Decision Chains , 1975, Oper. Res..
[8] J.A.E.E. van Nunen,et al. The action elimination algorithm for Markov decision processes , 1976 .
[9] Evan L. Porteus,et al. Technical Note - Accelerated Computation of the Expected Discounted Return in a Markov Chain , 1978, Oper. Res..
[10] L. Thomas. Second order bounds for Markov Decision Processes , 1981 .