论文信息 - Computing the discounted return in markov and semi‐markov chains

Computing the discounted return in markov and semi‐markov chains

This paper addresses the problem of computing the expected discounted return in finite Markov and semi-Markov chains. The objective is to reveal insights into two questions. First, which iterative methods hold the most promise? Second, when are interative methods preferred to Gaussian elimination? A set of twenty-seven randomly generated problems is used to compare the performance of the methods considered. The observations that apply to the problems generated here are as follows: Gauss-Seidel is not preferred to Pre-Jacobi in general. However, if the matrix is reordered in a certain way and the author's row sum extrapolation is used, then Gauss-Seidel is preferred. Transforming a semi-Markov problem into a Markov one using a transformation that comes from Schweitzer does not yield improved performance. A method analogous to symmetric successive overrelaxation (SSOR) in numerical analysis yields improved performance, especially when the row-sum extrapolation is used only sparingly. This method is then compared to Gaussian elimination and is found to be superior for most of the problems generated.

Evan L. Porteus

[1] M. S. Bartlett,et al. The ergodic properties of non-homogeneous finite Markov chains , 1956, Mathematical Proceedings of the Cambridge Philosophical Society.

[2] H. Markowitz. The Elimination form of the Inverse and its Application to Linear Programming , 1957 .

[3] M. Bartlett,et al. Weak ergodicity in non-homogeneous Markov chains , 1958, Mathematical Proceedings of the Cambridge Philosophical Society.

[4] Ronald A. Howard,et al. Dynamic Programming and Markov Processes , 1960 .

[5] K. Fan. NOTE ON M -MATRICES , 1960 .

[6] D. White,et al. Dynamic programming, Markov chains, and the method of successive approximations , 1963 .

[7] William S. Jewell,et al. Markov-Renewal Programming. I: Formulation, Finite Return Models , 1963 .

[8] J. MacQueen. A MODIFIED DYNAMIC PROGRAMMING METHOD FOR MARKOVIAN DECISION PROBLEMS , 1966 .

[9] E. Denardo. CONTRACTION MAPPINGS IN THE THEORY UNDERLYING DYNAMIC PROGRAMMING , 1967 .

[10] A. F. Veinott. Extreme points of leontief substitution systems , 1968 .

[11] J. MacQueen,et al. On computing the expected discounted return in a markov chain , 1970 .