Adaptive aggregation methods for discounted dynamic programming
暂无分享,去创建一个
[1] J. MacQueen. A MODIFIED DYNAMIC PROGRAMMING METHOD FOR MARKOVIAN DECISION PROBLEMS , 1966 .
[2] Harold J. Kushner,et al. Accelerated procedures for the solution of discrete Markov control problems , 1971 .
[3] Evan L. Porteus. Some Bounds for Discounted Sequential Decision Processes , 1971 .
[4] M. Puterman,et al. Modified Policy Iteration Algorithms for Discounted Markov Decision Problems , 1978 .
[5] W. Miranker,et al. Acceleration by aggregation of successive approximation methods , 1982 .
[6] Martin L. Puterman,et al. Action Elimination Procedures for Modified Policy Iteration Algorithms , 1982, Oper. Res..
[7] Roy Mendelssohn,et al. An Iterative Aggregation Procedure for Markov Decision Processes , 1982, Oper. Res..
[8] Dimitri P. Bertsekas,et al. Dynamic Programming: Deterministic and Stochastic Models , 1987 .