论文信息 - Technical Note - Identifying Forecast Horizons in Nonhomogeneous Markov Decision Processes

Technical Note - Identifying Forecast Horizons in Nonhomogeneous Markov Decision Processes

A procedure for identifying forecast horizons in nonhomogeneous Markov decision processes, based on convergence results for relative value functions, is developed. Two different algorithmic implementations of this procedure are discussed, and a closed form expression for computing sufficiently long horizons to guarantee epsilon optimality is presented.

Wallace J. Hopp | W. Hopp

[1] Thomas E. Morton,et al. Infinite-Horizon Dynamic Programming Models - A Planning-Horizon Formulation , 1979, Oper. Res..

[2] D. White,et al. Dynamic programming, Markov chains, and the method of successive approximations , 1963 .

[3] Robert L. Smith,et al. Conditions for the Existence of Planning Horizons , 1984, Math. Oper. Res..

[4] Dean Isaacson,et al. Markov Chains: Theory and Applications , 1976 .

[5] M. Bartlett,et al. Weak ergodicity in non-homogeneous Markov chains , 1958, Mathematical Proceedings of the Cambridge Philosophical Society.

[6] Shinhong Kim,et al. A Partially Observable Markov Decision Process with Lagged Information , 1987 .

[7] Chelsea C. White,et al. Parameter Imprecision in Finite State, Finite Action Dynamic Programs , 1986, Oper. Res..

[8] Marius Iosifescu,et al. Finite Markov Processes and Their Applications , 1981 .

[9] Edward J. Sondik,et al. The Optimal Control of Partially Observable Markov Processes over a Finite Horizon , 1973, Oper. Res..

[10] A. Federgruen,et al. The optimality equation in average cost denumerable state semi-Markov decision problems, recurrency conditions and algorithms , 1978 .

[11] A. Federgruen,et al. A note on simultaneous recurrence conditions on a set of denumerable stochastic matrices : (preprint) , 1978 .