A simple condition for regularity in negative programming
暂无分享,去创建一个
[1] D. Robinson. Markov decision chains with unbounded costs and applications to the control of queues , 1976, Advances in Applied Probability.
[2] Manfred SchÄl,et al. Conditions for optimality in dynamic programming and for the limit of n-stage optimal policies to be optimal , 1975 .
[3] Arie Hordijk,et al. Dynamic programming and Markov potential theory , 1974 .
[4] J. Harrison. Discrete Dynamic Programming with Unbounded Rewards , 1972 .
[5] K. Hinderer,et al. Foundations of Non-stationary Dynamic Programming with Discrete Time Parameter , 1970 .
[6] C. Derman,et al. A SOLUTION TO A COUNTABLE SYSTEM OF EQUATIONS ARISING IN MARKOVIAN DECISION PROCESSES. , 1966 .