论文信息 - Asymptotic behavior of a hierarchical system of learning automata

Asymptotic behavior of a hierarchical system of learning automata

Learning automata arranged in a two-level hierarchy are considered. The automata operate in a stationary random environment and update their action probabilities according to the linear-reward- -penalty algorithm at each level. Unlike some hierarchical systems previously proposed, no information transfer exists from one level to another, and yet the hierarchy possesses good convergence properties. Using weak-convergence concepts it is shown that for large time and small values of parameters in the algorithm, the evolution of the optimal path probability can be represented by a diffusion whose parameters can be computed explicitly.

Mandayam A. L. Thathachar | K. M. Ramachandran

[1] P. Billingsley,et al. Convergence of Probability Measures , 1969 .

[2] Kumpati S. Narendra,et al. Learning Automata - A Survey , 1974, IEEE Trans. Syst. Man Cybern..

[3] M. Thathachar,et al. Asymptotic behaviour of a learning algorithm , 1984 .

[4] H. Kushner,et al. Averaging Methods for the Asymptotic Analysis of Learning and Adaptive Systems, with Small Adjustment Rate. Analysis of Nonlinear Stochastic Systems with Wide-Band Inputs. , 1980 .

[5] H. Kushner. Introduction to stochastic control , 1971 .

[6] M. Thathachar,et al. A Hierarchical System of Learning Automata , 1981, IEEE Transactions on Systems, Man, and Cybernetics.

[7] S. Lakshmivarahan,et al. Learning Algorithms Theory and Applications , 1981 .