Identities relating forward and backward treatments of optimisation
暂无分享,去创建一个
A general form of Green's theorem is used to derive relations between expected costs for an optimisation problem and the distribution of state variable. A characterisation of the optimal stopping set and an alternative proof of the Howard improvement lemma emerge as non-trivial consequences.
[1] D. W. Reid,et al. Optimal Parameter Selection of Parabolic Systems , 1980, Math. Oper. Res..
[2] P. Whittle. An approximate characterisation of optimal stopping boundaries , 1973, Journal of Applied Probability.
[3] Cyrus Derman,et al. Finite State Markovian Decision Processes , 1970 .
[4] D. Blackwell. Discounted Dynamic Programming , 1965 .
[5] C. Coulson,et al. Mathematics of Physics and Chemistry , 1957, Nature.