Neural network models for zebra finch song production and reinforcement learning

The zebra finch is a standard experimental system for studying learning and generation of temporally extended motor patterns. The first part of this project concerned the evaluation of simple models for the operation and structure of the network in the motor nucleus RA. A directed excitatory chain with a global inhibitory network, for which experimental evidence exists, was found to produce waves of activity similar to those observed in RA; this similarity included one particularly important feature of the measured activity, synchrony between the onset of bursting in one neuron and the offset of bursting in another. Other models, which were simpler and more analytically tractable, were also able to exhibit this feature, but not for parameter values quantitatively close to those observed. Another issue of interest concerns how these networks are initially learned by the bird during song acquisition. The second part of the project concerned the analysis of exemplars of REINFORCE algorithms, a general class of algorithms for reinforcement learning in neural networks, which are on several counts more biologically plausible than standard prescriptions such as backpropagation. The former compared favorably with backpropagation on tasks involving single input-output pairs, though a noise analysis suggested it should not perform so well. On tasks involving trajectory learning, REINFORCE algorithms meet with some success, though the analysis that predicts their success on input-output-pair tasks fails to explain it for trajectories. Thesis Supervisor: H. Sebastian Seung Title: Robert A. Swanson Career Development Assistant Professor in Life Sciences

[1]  Gert Cauwenberghs,et al.  An analog VLSI recurrent neural network learning a continuous-time trajectory , 1996, IEEE Trans. Neural Networks.

[2]  J. Hertz,et al.  Learning short synfire chains by self-organization. , 1996, Network.

[3]  A. C. Yu,et al.  Temporal Hierarchical Control of Singing in Birds , 1996, Science.

[4]  Xiaohui Xie,et al.  Spike-based Learning Rules and Stabilization of Persistent Neural Activity , 1999, NIPS.

[5]  M. Dalva,et al.  Long-range inhibition within the zebra finch song nucleus RA can coordinate the firing of multiple projection neurons. , 1999, Journal of neurophysiology.

[6]  Marwan A. Jabri,et al.  Weight perturbation: an optimal architecture and learning technique for analog VLSI feedforward and recurrent multilayer networks , 1992, IEEE Trans. Neural Networks.

[7]  D. Margoliash,et al.  Song replay during sleep and computational rules for sensorimotor vocal learning. , 2000, Science.

[8]  T. Troyer,et al.  An associational model of birdsong sensorimotor learning II. Temporal hierarchies and the learning of song sequence. , 2000, Journal of neurophysiology.

[9]  T W Troyer,et al.  An associational model of birdsong sensorimotor learning I. Efference copy and the learning of song syllables. , 2000, Journal of neurophysiology.

[10]  W. Schultz Dopamine neurons and their role in reward mechanisms , 1997, Current Opinion in Neurobiology.

[11]  Javier R. Movellan,et al.  Contrastive Hebbian Learning in the Continuous Hopfield Model , 1991 .

[12]  A. Doupe,et al.  Interruption of a basal ganglia–forebrain circuit prevents plasticity of learned vocalizations , 2000, Nature.

[13]  Robert W. Warner,et al.  The anatomy of the syrinx in passerine birds , 1972 .

[14]  D. Vicario,et al.  Contributions of syringeal muscles to respiration and vocalization in the zebra finch. , 1991, Journal of neurobiology.

[15]  Adam Prügel-Bennett,et al.  Analysis of synfire chains , 1995 .

[16]  Oh,et al.  Generalization in a two-layer neural network. , 1993, Physical review. E, Statistical physics, plasmas, fluids, and related interdisciplinary topics.

[17]  A. Doupe Song- and Order-Selective Neurons in the Songbird Anterior Forebrain and their Emergence during Vocal Development , 1997, The Journal of Neuroscience.

[18]  A. Doupe,et al.  Song- and order-selective neurons develop in the songbird anterior forebrain during vocal learning. , 1997, Journal of neurobiology.

[19]  R. J. Williams,et al.  Simple Statistical Gradient-Following Algorithms for Connectionist Reinforcement Learning , 2004, Machine Learning.

[20]  John J. Hopfield,et al.  Neural networks and physical systems with emergent collective computational abilities , 1999 .

[21]  Moshe Abeles,et al.  Corticonics: Neural Circuits of Cerebral Cortex , 1991 .

[22]  Anders Krogh,et al.  Introduction to the theory of neural computation , 1994, The advanced book program.

[23]  S Dehaene,et al.  Neural networks that learn temporal sequences by selection. , 1987, Proceedings of the National Academy of Sciences of the United States of America.