论文信息 - Theoretical Analysis of Learning with Reward-Modulated Spike-Timing-Dependent Plasticity

Theoretical Analysis of Learning with Reward-Modulated Spike-Timing-Dependent Plasticity

Reward-modulated spike-timing-dependent plasticity (STDP) has recently emerged as a candidate for a learning rule that could explain how local learning rules at single synapses support behaviorally relevant adaptive changes in complex networks of spiking neurons. However the potential and limitations of this learning rule could so far only be tested through computer simulations. This article provides tools for an analytic treatment of reward-modulated STDP, which allow us to predict under which conditions reward-modulated STDP will be able to achieve a desired learning effect. In particular, we can produce in this way a theoretical explanation and a computer model for a fundamental experimental finding on biofeedback in monkeys (reported in [1]).

Robert A. Legenstein | Wolfgang Maass | Dejan Pecevski

[1] Robert A. Legenstein,et al. A Learning Theory for Reward-Modulated Spike-Timing-Dependent Plasticity with Application to Biofeedback , 2008, PLoS Comput. Biol..

[2] Razvan V. Florian,et al. Reinforcement Learning Through Modulation of Spike-Timing-Dependent Synaptic Plasticity , 2007, Neural Computation.

[3] E. Fetz,et al. Operantly conditioned patterns on precentral unit activity and correlated responses in adjacent cells and contralateral muscles. , 1973, Journal of neurophysiology.

[4] Wulfram Gerstner,et al. Spiking Neuron Models , 2002 .

[5] T. Sejnowski,et al. Fluctuating synaptic conductances recreate in vivo-like activity in neocortical neurons , 2001, Neuroscience.

[6] E. Kandel,et al. Is Heterosynaptic modulation essential for stabilizing hebbian plasiticity and memory , 2000, Nature Reviews Neuroscience.

[7] Wulfram Gerstner,et al. Spiking Neuron Models: An Introduction , 2002 .

[8] E. Izhikevich. Solving the distal reward problem through linkage of STDP and dopamine signaling , 2007, BMC Neuroscience.