论文信息 - Efficient Reward-Based Learning through Body Representation in a Spiking Neural Network

Efficient Reward-Based Learning through Body Representation in a Spiking Neural Network

Brain-body interactions guide the development of behavioral and cognitive functions. Sensory signals during behavior are relayed to the brain and evoke neural activity. This feedback is important for the organization of neural networks via neural plasticity, which in turn facilitates the generation of motor commands for new behaviors. In this study, we investigated how brain-body interactions develop and affect reward-based learning. We constructed a spiking neural network (SNN) model for the reward-based learning of canonical babbling, i.e., combination of a vowel and consonant. Motor commands to a vocal simulator were generated by SNN output and auditory signals representing the vocalized sound were fed back into the SNN. Synaptic weights in the SNN were updated using spike-timing-dependent plasticity (STDP). Connections from the SNN to the vocal simulator were modulated based on reward signals in terms of saliency of the vocalized sound. Our results showed that, under auditory feedback, STDP enabled the model to rapidly acquire babbling-like vocalization. We found that some neurons in the SNN were more highly activated during vocalization of a consonant than during other sounds. That is, neural dynamics in the SNN adapted to task-related articulator movements. Accordingly, body representation in the SNN facilitated brain-body interaction and accelerated the acquisition of babbling behavior.

[1] Eugene M. Izhikevich,et al. Simple model of spiking neurons , 2003, IEEE Trans. Neural Networks.

[2] D K Oller,et al. The role of audition in infant babbling. , 1988, Child development.

[3] A. Warlaumont,et al. Learning to Produce Syllabic Speech Sounds via Reward-Modulated Neural Plasticity , 2016, PloS one.

[4] Paul Boersma,et al. Praat: doing phonetics by computer , 2003 .

[5] Yongduan Song,et al. Computational modeling of spiking neural network with learning rules from STDP and intrinsic plasticity , 2018 .

[6] Linda B. Smith,et al. Developmental process emerges from extended brain–body–behavior networks , 2014, Trends in Cognitive Sciences.

[7] Suneeti Nathani Iyer,et al. Prelinguistic Vocal Development in Infants with Typical Hearing and Infants with Severe-to-Profound Hearing Loss. , 2008, The Volta review.

[8] Sue L. Denham,et al. Model cortical responses for the detection of perceptual onsets and beat tracking in singing , 2009, Connect. Sci..

[9] Minoru Asada,et al. Self-organization based on auditory feedback promotes acquisition of babbling , 2017, 2017 Joint IEEE International Conference on Development and Learning and Epigenetic Robotics (ICDL-EpiRob).

[10] Minoru Asada,et al. Chaotic itinerancy within the coupled dynamics between a physical body and neural oscillator networks , 2017, PloS one.

[11] Paul Boersma,et al. Praat, a system for doing phonetics by computer , 2002 .

[12] Karl J. Friston,et al. Theoretical neurobiology and schizophrenia. , 1996, British medical bulletin.

[13] Gordon Pipa,et al. SORN: A Self-Organizing Recurrent Neural Network , 2009, Front. Comput. Neurosci..

[14] E. Izhikevich. Solving the distal reward problem through linkage of STDP and dopamine signaling , 2007, BMC Neuroscience.

[15] G. Edelman,et al. Complexity and coherency: integrating information in the brain , 1998, Trends in Cognitive Sciences.

[16] Eugene M. Izhikevich,et al. Polychronization: Computation with Spikes , 2006, Neural Computation.

[17] Dan Ventura,et al. Preparing More Effective Liquid State Machines Using Hebbian Learning , 2006, The 2006 IEEE International Joint Conference on Neural Network Proceedings.

[18] D. Feldman,et al. Synaptic plasticity at thalamocortical synapses in developing rat somatosensory cortex: LTP, LTD, and silent synapses. , 1999, Journal of neurobiology.

[19] Heidi Johansen-Berg,et al. Motor Skill Learning Induces Changes in White Matter Microstructure and Myelination , 2013, The Journal of Neuroscience.

[20] Y. Kuniyoshi,et al. An Embodied Brain Model of the Human Foetus , 2016, Scientific Reports.

[21] Michael C. Crair,et al. A critical period for long-term potentiation at thalamocortical synapses , 1995, Nature.

[22] Kristofer E. Bouchard,et al. Functional Organization of Human Sensorimotor Cortex for Speech Articulation , 2013, Nature.

[23] Peter Ford Dominey. Complex sensory-motor sequence learning based on recurrent state representation and reinforcement learning , 1995, Biological Cybernetics.

[24] Fangzheng Xue,et al. Computational capability of liquid state machines with spike-timing-dependent plasticity , 2013, Neurocomputing.