Constrained Markovian decision processes: the dynamic programming approach
暂无分享,去创建一个
We consider semicontinuous controlled Markov models in discrete time with total expected losses. Only control strategies which meet a set of given constraint inequalities are admissible. One has to build an optimal admissible strategy. The main result consists in the constructive development of optimal strategy with the help of the dynamic programming method. The model studied covers the case of a finite horizon and the case of a homogeneous discounted model with different discount factors.
[1] A. Piunovskiy. Optimal Control of Random Sequences in Problems with Constraints , 1997 .
[2] P. WHITTLE,et al. Markov Processes and Their Applications , 1962, Nature.
[3] M. Schäl. On dynamic programming: Compactness of the space of policies , 1975 .
[4] M. K rn,et al. Stochastic Optimal Control , 1988 .
[5] Eugene A. Feinberg,et al. Constrained Markov Decision Models with Weighted Discounted Rewards , 1995, Math. Oper. Res..