The Shift-Function Approach for Markov Decision Processes with Unbounded Returns.
暂无分享,去创建一个
[1] J. Wessels. Markov programming by successive approximations by respect to weighted supremum norms , 1976, Advances in Applied Probability.
[2] J. Harrison. Discrete Dynamic Programming with Unbounded Rewards , 1972 .
[3] J. MacQueen. A MODIFIED DYNAMIC PROGRAMMING METHOD FOR MARKOVIAN DECISION PROBLEMS , 1966 .
[4] R. Strauch. Negative Dynamic Programming , 1966 .
[5] E. Lehmann. Ordered Families of Distributions , 1955 .
[6] D. Blackwell. Discounted Dynamic Programming , 1965 .
[7] Shaler Stidham,et al. Individual versus Social Optimization in Exponential Congestion Systems , 1977, Oper. Res..
[8] Evan L. Porteus. Bounds and Transformations for Discounted Finite Markov Decision Chains , 1975, Oper. Res..
[9] Michael J. Magazine,et al. A Classified Bibliography of Research on Optimal Design and Control of Queues , 1977, Oper. Res..