THE ANALYTIC THEORY OF POLICY ITERATION
暂无分享,去创建一个
[1] Martin L. Puterman,et al. On the Convergence of Policy Iteration in Stationary Dynamic Programming , 1979, Math. Oper. Res..
[2] M. Puterman,et al. Modified Policy Iteration Algorithms for Discounted Markov Decision Problems , 1978 .
[3] V. Hee,et al. Strongly convergent dynamic programming , 1978 .
[4] M. Puterman. Optimal control of diffusion processes with reflection , 1977 .
[5] B. Doshi. Continuous Time Control of Markov Processes on an Arbitrary State Space: Discounted Rewards , 1976 .
[6] van der J Jan Wal,et al. Strongly convergent dynamic programming : some results , 1976 .
[7] Uriel G. Rothblum,et al. Normalized Markov Decision Chains I; Sensitive Discount Optimality , 1975, Oper. Res..
[8] S. Lippman. On Dynamic Programming with Unbounded Rewards , 1975 .
[9] J Jaap Wessels,et al. Note---A Note on Dynamic Programming with Unbounded Rewards , 1975 .
[10] J. Wessels. Markov programming by successive approximations by respect to weighted supremum norms , 1976, Advances in Applied Probability.
[11] S. Pliska. Single-person controlled diffusions with discounted costs , 1973 .
[12] N. Furukawa. Markovian Decision Processes with Compact Action Spaces , 1972 .
[13] J. Harrison. Discrete Dynamic Programming with Unbounded Rewards , 1972 .
[14] P. Kakumanu. Continuously Discounted Markov Decision Model with Countable State and Action Space , 1971 .
[15] A. F. Veinott. Discrete Dynamic Programming with Sensitive Discount Optimality Criteria , 1969 .
[16] M. Pollatschek,et al. Algorithms for Stochastic Games with Geometrical Interpretation , 1969 .
[17] B. L. Miller,et al. An Optimality Condition for Discrete Dynamic Programming with no Discounting , 1968 .
[18] B. L. Miller. Finite state continuous time Markov decision processes with an infinite planning horizon , 1968 .
[19] J. Vandergraft. Newton's method for convex operators in partially ordered spaces. , 1967 .
[20] E. Denardo. CONTRACTION MAPPINGS IN THE THEORY UNDERLYING DYNAMIC PROGRAMMING , 1967 .
[21] A. F. Veinott. ON FINDING OPTIMAL POLICIES IN DISCRETE DYNAMIC PROGRAMMING WITH NO DISCOUNTING , 1966 .
[22] R. Strauch. Negative Dynamic Programming , 1966 .
[23] C. Derman. DENUMERABLE STATE MARKOVIAN DECISION PROCESSES: AVERAGE COST CRITERION. , 1966 .
[24] L. Kantorovich,et al. Functional analysis and applied mathematics , 1963 .
[25] D. Blackwell. Discrete Dynamic Programming , 1962 .
[26] R. Kalaba. ON NONLINEAR DIFFERENTIAL EQUATIONS, THE MAXIMUM OPERATION, AND MONOTONE CONVERGENCE, , 1959 .