Combining Hebbian and reinforcement learning in a minibrain model

A toy model of a neural network in which both Hebbian learning and reinforcement learning occur is studied. The problem of 'path interference', which makes that the neural net quickly forgets previously learned input-output relations is tackled by adding a Hebbian term (proportional to the learning rate nu) to the reinforcement term (proportional to delta) in the learning rule. It is shown that the number of learning steps is reduced considerably if 1/4<nu/delta<1/2, i.e. if the Hebbian term is neither too small nor too large compared to the reinforcement term.

[1]  Marvin Minsky,et al.  An introduction to computational geometry , 1969 .

[2]  R. Palmer,et al.  Introduction to the theory of neural computation , 1994, The advanced book program.

[3]  M. Heerema,et al.  Derivation of Hebb's rule , 1999 .

[4]  Anders Krogh,et al.  Introduction to the theory of neural computation , 1994, The advanced book program.

[5]  Marvin Minsky,et al.  Perceptrons: An Introduction to Computational Geometry , 1969 .

[6]  P. Bak,et al.  Adaptive learning by extremal dynamics and negative feedback. , 2000, Physical review. E, Statistical, nonlinear, and soft matter physics.

[7]  A. A. Mullin,et al.  Principles of neurodynamics , 1962 .

[8]  I. T. Draper Human Neuropsychology , 1979 .

[9]  Richard S. Sutton,et al.  Introduction to Reinforcement Learning , 1998 .

[10]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[11]  Berndt Müller,et al.  Neural networks: an introduction , 1990 .

[12]  D. Signorini,et al.  Neural networks , 1995, The Lancet.

[13]  Alstrom,et al.  Versatility and adaptive performance. , 1995, Physical review. E, Statistical physics, plasmas, fluids, and related interdisciplinary topics.

[14]  M. Alexander,et al.  Principles of Neural Science , 1981 .

[15]  P. Bak,et al.  Learning from mistakes , 1997, Neuroscience.