Associative search network: A reinforcement learning associative memory

An associative memory system is presented which does not require a “teacher” to provide the desired associations. For each input key it conducts a search for the output pattern which optimizes an external payoff or reinforcement signal. The associative search network (ASN) combines pattern recognition and function optimization capabilities in a simple and effective way. We define the associative search problem, discuss conditions under which the associative search network is capable of solving it, and present results from computer simulations. The synthesis of sensory-motor control surfaces is discussed as an example of the associative search problem.

[1]  A. A. Mullin,et al.  Principles of neurodynamics , 1962 .

[2]  Frank Rosenblatt,et al.  PRINCIPLES OF NEURODYNAMICS. PERCEPTRONS AND THE THEORY OF BRAIN MECHANISMS , 1963 .

[3]  F. Downton Stochastic Approximation , 1969, Nature.

[4]  Marvin Minsky,et al.  Perceptrons: An Introduction to Computational Geometry , 1969 .

[5]  H. C. LONGUET-HIGGINS,et al.  Non-Holographic Associative Memory , 1969, Nature.

[6]  Jerry M. Mendel,et al.  Adaptive, learning, and pattern recognition systems : theory and applications , 1970 .

[7]  A. H. Klopf,et al.  Brain Function and Adaptive Systems: A Heterostatic Theory , 1972 .

[8]  Kaoru Nakano,et al.  Associatron-A Model of Associative Memory , 1972, IEEE Trans. Syst. Man Cybern..

[9]  Bernard Widrow,et al.  Punish/Reward: Learning with a Critic in Adaptive Threshold Systems , 1973, IEEE Trans. Syst. Man Cybern..

[10]  Richard O. Duda,et al.  Pattern classification and scene analysis , 1974, A Wiley-Interscience publication.

[11]  M. L. Tsetlin,et al.  Automaton theory and modeling of biological systems , 1973 .

[12]  Leon N. Cooper,et al.  A possible organization of animal memory and learning , 1973 .

[13]  E Harth,et al.  Alopex: a stochastic method for determining visual receptive fields. , 1974, Vision research.

[14]  K. Narendra,et al.  Learning AutomataA Survey , 1974 .

[15]  Kumpati S. Narendra,et al.  Learning Automata - A Survey , 1974, IEEE Trans. Syst. Man Cybern..

[16]  R. Jindra Mass action in the nervous system W. J. Freeman, Academic Press, New York (1975), 489 pp., (hard covers). $34.50 , 1976, Neuroscience.

[17]  R. Didday A model of visuomotor mechanisms in the frog optic tectum , 1976 .

[18]  Stephen A. Ritz,et al.  Distinctive features, categorical perception, and probability learning: some applications of a neural model , 1977 .

[19]  Teuvo Kohonen,et al.  Associative memory. A system-theoretical approach , 1977 .

[20]  E. John,et al.  The neurophysiology of information processing and cognition. , 1978, Annual review of psychology.

[21]  C. C. Wood Variations on a theme by Lashley: lesion experiments on the neural model of Anderson, Silverstein, Ritz, and Jones. , 1978, Psychological review.

[22]  J. Albus Mechanisms of planning and problem solving in the brain , 1979 .

[23]  A G Barto,et al.  Toward a modern theory of adaptive networks: expectation and prediction. , 1981, Psychological review.

[24]  John S. Edwards,et al.  The Hedonistic Neuron: A Theory of Memory, Learning and Intelligence , 1983 .

[25]  Ina Ruck,et al.  USA , 1969, The Lancet.

[26]  Jerry M. Mendel,et al.  Reinforcement-learning control and pattern recognition systems , 1994 .

[27]  S. Cleveland,et al.  Dynamic properties of Renshaw cells: Frequency response characteristics , 1977, Biological Cybernetics.

[28]  H. Wigström,et al.  A neuron model with learning capability and its relation to mechanisms of association , 1973, Kybernetik.

[29]  S.-I. Amari,et al.  Neural theory of association and concept-formation , 1977, Biological Cybernetics.

[30]  S. Grossberg,et al.  Adaptive pattern classification and universal recoding: I. Parallel development and coding of neural feature detectors , 1976, Biological Cybernetics.

[31]  Stephen Grossberg,et al.  Adaptive pattern classification and universal recoding: II. Feedback, expectation, olfaction, illusions , 1976, Biological Cybernetics.

[32]  T. Poggio,et al.  On optimal nonlinear associative recall , 1975, Biological Cybernetics.

[33]  E. Oja,et al.  Fast adaptive formation of orthogonalizing filters and associative memory in recurrent networks of neuron-like elements , 1976, Biological Cybernetics.