Growth Optimality for Branching Markov Decision Chains

This paper considers a (multiplicative) process called branching Markov decision chains in which the output at the end of the Nth period equals the product of N nonnegative matrices chosen at the beginning of periods 1, …, N, respectively, times a positive (fixed) terminal reward vector. It is assumed that the above transition matrices are drawn out of a finite set of matrices given in product form (i.e., the rows of the matrices can be selected independently out of finite sets of nonnegative row vectors). For each coordinate s we define the geometric and algebraic growth rates, respectively, of the sth coordinate of the stream of output. These growth rates are defined so that the magnitude of the corresponding sequence is of the order αNNk where α is the geometric growth rate and k is the algebraic growth rate. The main result of this paper is the constructive establishment of the existence of a transition matrix whose repeated use will guarantee, for each coordinate, the achievement of the best geometri...

[1]  David Gale The theory of linear economic models , 1960 .

[2]  J. Gillis,et al.  Matrix Iterative Analysis , 1961 .

[3]  T. Koopmans Economic Growth at a Maximal Rate , 1964 .

[4]  Samuel Karlin,et al.  A First Course on Stochastic Processes , 1968 .

[5]  L. McKenzie Maximal Paths in the von Neumann Model , 1967 .

[6]  D. Gale On Optimal Development in a Multi-Sector Economy , 1967 .

[7]  T. Koopmans Objectives, Constraints, and Outcomes in Optimal Growth Models , 1967 .

[8]  M. Shubik,et al.  Convex structures and economic theory , 1968 .

[9]  E. Seneta,et al.  THE THEORY OF NON-NEGATIVE MATRICES IN A DYNAMIC PROGRAMMING PROBLEM , 1969 .

[10]  L. McKenzie Capital Accumulation Optimal in the Final State , 1971 .

[11]  R. Howard,et al.  Risk-Sensitive Markov Decision Processes , 1972 .

[12]  U. Rothblum Algebraic eigenspaces of nonnegative matrices , 1975 .

[13]  Karel Sladký,et al.  On dynamic programming recursions for multiplicative Markov decision chains , 1976 .

[14]  S. Pliska Optimization of Multitype Branching Processes , 1976 .

[15]  U. Rothblum Normalized Markov Decision Chains. II: Optimality of Nonstationary Policies , 1977 .

[16]  Whm Henk Zijm,et al.  Maximizing the growth of the utility vector in a dynamic programming model , 1979 .

[17]  W. Zijm Nonnegative matrices in dynamic programming , 1979 .

[18]  Karel Sladký,et al.  Bounds on discrete dynamic programming recursions. I. Models with non-negative matrices , 1980, Kybernetika.

[19]  U. Rothblum Sensitive Growth Analysis of Multiplicative Systems I: The Dynamic Approach , 1981 .