Parallel algorithms for modules of learning automata

Parallel algorithms are presented for modules of learning automata with the objective of improving their speed of convergence without compromising accuracy. A general procedure suitable for parallelizing a large class of sequential learning algorithms on a shared memory system is proposed. Results are derived to show the quantitative improvements in speed obtainable using parallelization. The efficacy of the procedure is demonstrated by simulation studies on algorithms for common payoff games, parametrized learning automata and pattern classification problems with noisy classification of training samples.

[1]  P. S. Sastry,et al.  Continuous action set learning automata for stochastic optimization , 1994 .

[2]  V. V. Phansalkar,et al.  Decentralized Learning of Nash Equilibria in Multi-Person Stochastic Games With Incomplete Information , 1994, IEEE Trans. Syst. Man Cybern. Syst..

[3]  Shirley Dex,et al.  JR 旅客販売総合システム(マルス)における運用及び管理について , 1991 .

[4]  K. R. Ramakrishnan,et al.  A cooperative game of a pair of learning automata , 1984, Autom..

[5]  Mandayam A. L. Thathachar,et al.  Learning the global maximum with parameterized learning automata , 1995, IEEE Trans. Neural Networks.

[6]  Scott E. Fahlman,et al.  An empirical study of learning speed in back-propagation networks , 1988 .

[7]  S. Lakshmivarahan,et al.  Learning Algorithms Theory and Applications , 1981 .

[8]  P. Anandan,et al.  Pattern-recognizing stochastic learning automata , 1985, IEEE Transactions on Systems, Man, and Cybernetics.

[9]  Harold J. Kushner,et al.  Approximation and Weak Convergence Methods for Random Processes , 1984 .

[10]  Mandayam A. L. Thathachar,et al.  Convergence of teams and hierarchies of learning automata in connectionist systems , 1995, IEEE Trans. Syst. Man Cybern..

[11]  R. Lippmann,et al.  An introduction to computing with neural nets , 1987, IEEE ASSP Magazine.

[12]  J. Doob Stochastic processes , 1953 .

[13]  Pierre Priouret,et al.  Adaptive Algorithms and Stochastic Approximations , 1990, Applications of Mathematics.

[14]  Kumpati S. Narendra,et al.  Learning automata - an introduction , 1989 .

[15]  K. Narendra,et al.  Decentralized learning in finite Markov chains , 1985, 1985 24th IEEE Conference on Decision and Control.

[16]  F. Aluffi-Pentini,et al.  Global optimization and stochastic differential equations , 1985 .

[17]  Richard O. Duda,et al.  Pattern classification and scene analysis , 1974, A Wiley-Interscience publication.

[18]  Mandayam A. L. Thathachar,et al.  Learning Optimal Discriminant Functions through a Cooperative Game of Automata , 1987, IEEE Transactions on Systems, Man, and Cybernetics.

[19]  John H. Holland,et al.  Adaptation in Natural and Artificial Systems: An Introductory Analysis with Applications to Biology, Control, and Artificial Intelligence , 1992 .

[20]  Kai Hwang,et al.  Advanced computer architecture - parallelism, scalability, programmability , 1992 .

[21]  Kaddour Najim,et al.  Learning Automata: Theory and Applications , 1994 .