论文信息 - Dynamic Programming for Discrete-Time Systems with Uncertain Gain

Dynamic Programming for Discrete-Time Systems with Uncertain Gain

We generalise the optimisation technique of dynamic programming for discretetime systems with an uncertain gain function. We assume that uncertainty about the gain function is described by an imprecise probability model, which generalises the well-known Bayesian, or precise, models. We compare various optimality criteria that can be associated with such a model, and which coincide in the precise case: maximality, robust optimality and maximinity. We show that (only) for the first two an optimal feedback can be constructed by solving a Bellman-like equation.

Gert de Cooman | Matthias C. M. Troffaes | G. Cooman | M. Troffaes

[1] Morgane Cheve,et al. Optimal pollution control under imprecise environmental risk and irreversibility , 2000 .

[2] G. Shafer. The Enterprise of Knowledge: An Essay on Knowledge, Credal Probability, and Chance , 1982 .

[3] R. Bellman. Dynamic programming. , 1957, Science.

[4] P. Walley. Statistical Reasoning with Imprecise Probabilities , 1990 .

[5] I. Levi,et al. The Enterprise of Knowledge: An Essay on Knowledge, Credal Probability, and Chance , 1983 .

[6] Francisco Javier Girón González-Torre,et al. Quasi-Bayesian behaviour: a more realistic approach to decision making? , 1980 .

[7] Lev V. Utkin,et al. Imprecise Reliability Models for the General Lifetime Distribution Classes , 1999, ISIPTA.

[8] F. J. Girón,et al. Quasi-Bayesian Behaviour: A more realistic approach to decision making? , 1980 .

[9] Isaac Levi,et al. The Enterprise Of Knowledge , 1980 .

[10] D. Harmanec. Generalizing Markov decision processes to imprecise probabilities , 2002 .