论文信息 - An Heuristic for Multi-Dimensional Markov Decision Processes

An Heuristic for Multi-Dimensional Markov Decision Processes

Abstract An heuristic procedure is presented for multi-dimensional Markov decision processes where current approximating optimal procedures are computationally demanding. It is applicable when certain simple policies are easy to evaluate, and it is not necessary to evaluate the improved policies. Bounds on the loss of optimality arising from such policies arc given, which are used to reject or accept any policy derived.

D. J. White

[1] D. J. White,et al. Real Applications of Markov Decision Processes , 1985 .

[2] D. J. White,et al. Further Real Applications of Markov Decision Processes , 1988 .

[3] Evan L. Porteus. Some Bounds for Discounted Sequential Decision Processes , 1971 .