论文信息 - A simple condition for regularity in negative programming

A simple condition for regularity in negative programming

A simple condition (the ‘bridging condition') is given for a Markov decision problem with non-negative costs to enjoy the regularity properties enunciated in Theorem 1. The bridging condition is sufficient for regularity, and is not far from being necessary, in a sense explained in Section 2. In Section 8 we consider the different classes of terminal loss functions (domains of attraction) associated with different solutions of (14). Some conjectures concerning these domains of attraction are either proved, or disproved by counter-example.

P. Whittle

[1] D. Robinson. Markov decision chains with unbounded costs and applications to the control of queues , 1976, Advances in Applied Probability.

[2] Manfred SchÄl,et al. Conditions for optimality in dynamic programming and for the limit of n-stage optimal policies to be optimal , 1975 .

[3] Arie Hordijk,et al. Dynamic programming and Markov potential theory , 1974 .

[4] J. Harrison. Discrete Dynamic Programming with Unbounded Rewards , 1972 .

[5] K. Hinderer,et al. Foundations of Non-stationary Dynamic Programming with Discrete Time Parameter , 1970 .

[6] C. Derman,et al. A SOLUTION TO A COUNTABLE SYSTEM OF EQUATIONS ARISING IN MARKOVIAN DECISION PROCESSES. , 1966 .