Recent Developments in Learning Automata

The paper surveys the major developments in learning automata theory and applications in the last two decades. Since a survey article on the subject appeared in 1977, the emphasis is on subsequent developments. Of particular importance is some recent work on the use of many automata interacting in a decentralized manner. This framework provides a conceptual focus and an analytical basis for future research on modeling and control of complex systems. Applications of the theory are also reviewed, with special attention devoted to the problem of traffic routing in telecommunication networks.

[1]  Frederick Mosteller,et al.  Stochastic Models for Learning , 1956 .

[2]  R. Bellman A Markovian Decision Process , 1957 .

[3]  Ronald A. Howard,et al.  Dynamic Programming and Markov Processes , 1960 .

[4]  M. L. Tsetlin On the Behavior of Finite Automata in Random Media , 1961 .

[5]  Harley Bornbach,et al.  An introduction to mathematical learning theory , 1967 .

[6]  Radu Theodorescu,et al.  Random processes and learning , 1969 .

[7]  M. L. Tsetlin,et al.  Automaton theory and modeling of biological systems , 1973 .

[8]  Tripathi,et al.  Application of learning automata to telephone traffic routing problems , 1974 .

[9]  Kumpati S. Narendra,et al.  Learning Automata - A Survey , 1974, IEEE Trans. Syst. Man Cybern..

[10]  Norio Baba,et al.  On the Learning Behavior of Stochastic Automata Under a Nonstationary Random Environment , 1975, IEEE Transactions on Systems, Man, and Cybernetics.

[11]  John H. Holland,et al.  Adaptation in Natural and Artificial Systems: An Introductory Analysis with Applications to Biology, Control, and Artificial Intelligence , 1992 .

[12]  Kumpati S. Narendra,et al.  Application of Learning Automata to Telephone Traffic Routing and Control , 1977, IEEE Transactions on Systems, Man, and Cybernetics.

[13]  Daniel E. Koditschek,et al.  Fixed Structure Automata in a Multi-Teacher Environment , 1977, IEEE Transactions on Systems, Man, and Cybernetics.

[14]  V. Borkar,et al.  Adaptive control of Markov chains, I: Finite parameter set , 1979 .

[15]  Kumpati S. Narendra,et al.  On the Behavior of a Learning Automaton in a Changing Environment with Application to Telephone Traffic Routing , 1980, IEEE Transactions on Systems, Man, and Cybernetics.

[16]  Y. M. El-Fattah,et al.  Stochastic Automata Modeling of Certain Problems of Collective Behavior , 1980, IEEE Transactions on Systems, Man, and Cybernetics.

[17]  Robert M. Glorioso Engineering Intelligent Systems , 1980 .

[18]  S. Lakshmivarahan,et al.  Learning Algorithms Theory and Applications , 1981 .

[19]  M. Thathachar,et al.  A Hierarchical System of Learning Automata , 1981, IEEE Transactions on Systems, Man, and Cybernetics.

[20]  S. Lakshmivarahan,et al.  Learning Algorithms for Two-Person Zero-Sum Stochastic Games with Incomplete Information , 1981, Math. Oper. Res..

[21]  P. R. Srikantakumar,et al.  A LEARNING MODEL FOR ROUTING IN TELEPHONE NETWORKS , 1982 .

[22]  K. Narendra,et al.  Learning Algorithms for Two-Person Zero-Sum Stochastic Games with Incomplete Information: A Unified Approach , 1982 .

[23]  P. Kumar,et al.  Optimal adaptive controllers for unknown Markov chains , 1982 .

[24]  Y. M. Abdel-Fattah Multi-automaton games: A rationale for expedient collective behavior , 1982 .

[25]  K. R. Ramakrishnan,et al.  Hierarchical Systems and Cooperative Games of Learning Automata , 1982 .

[26]  Kumpati S. Narendra,et al.  The use of learning algorithms in telephone traffic routing - A methodology , 1983, Autom..

[27]  Norio Baba,et al.  On the learning behaviors of variable-structure stochastic automaton in the general n-teacher environment , 1983, IEEE Transactions on Systems, Man, and Cybernetics.

[28]  Yousri M. Abdel-Fattah Fairness and mutual profitability in collective behavior of automata , 1983, IEEE Transactions on Systems, Man, and Cybernetics.

[29]  Kumpati S. Narendra,et al.  An N-player sequential stochastic game with identical payoffs , 1983, IEEE Transactions on Systems, Man, and Cybernetics.

[30]  D. I. Kountanis,et al.  A reorganization scheme for a hierarchical system of learning automata , 1984, IEEE Transactions on Systems, Man, and Cybernetics.

[31]  P. Anandan,et al.  Pattern-recognizing stochastic learning automata , 1985, IEEE Transactions on Systems, Man, and Cybernetics.

[32]  Kumpati S. Narendra,et al.  Learning Models for Decentralized Decision Making , 1985, 1985 American Control Conference.

[33]  Richard Wheeler,et al.  Decentralized learning in finite Markov chains , 1985, 1985 24th IEEE Conference on Decision and Control.

[34]  Stewart W. Wilson Knowledge Growth in an Artificial Animal , 1985, ICGA.

[35]  M. A. L. THATHACHAR,et al.  A new approach to the design of reinforcement schemes for learning automata , 1985, IEEE Transactions on Systems, Man, and Cybernetics.

[36]  K. S. Narenda,et al.  Routing in communication networks, a case study of learning in large scale systems , 1985 .

[37]  P. Anandan,et al.  Cooperativity in Networks of Pattern Recognizing Stochastic Learning Automata , 1986 .

[38]  P. Mars,et al.  Application of Learning Automata to Image Data Compression , 1986 .

[39]  L. G. Mason,et al.  Learning Automata Models for Adaptive Flow Control in Packet-Switching Networks , 1986 .