A Computational Model of Match Decision-Making Problem Using Spiking SHESN with Reward-Modulated Reinforcement Learning

Match decision-making problem is one of the hot topics in the field of computational neuroscience. In this paper, we propose a spiking SHESN model with reward-modulated reinforcement learning so as to conduct computational modeling and prediction of such an open problem in a manner that has more neurophysiological characteristics. Neural coding of two sequentially-presented stimuli is read out from a collection of clustered neural populations in state reservoir through reward-modulated reinforcement learning. To evaluate match decision-making performance of our computational model, we set up three kinds of test datasets with different spike timing trains and present a criterion of maximum correlation coefficient for assessing whether match/nonmatch decision-making is successful or not. Finally, extensive experimental results show that the proposed model has strong robustness on interval of both spike timings and spike shift, which is consistent with monkey’s behavior records exhibited in match decision-making experiment [1].

[1]  Herbert Jaeger,et al.  Optimization and applications of echo state networks with leaky- integrator neurons , 2007, Neural Networks.

[2]  Wolfgang Maass,et al.  A Reward-Modulated Hebbian Learning Rule Can Explain Experimentally Observed Network Reorganization in a Brain Control Task , 2010, The Journal of Neuroscience.

[3]  Harald Haas,et al.  Harnessing Nonlinearity: Predicting Chaotic Systems and Saving Energy in Wireless Communication , 2004, Science.

[4]  A. Prinz,et al.  Multi-objective evolutionary algorithms for model neuron parameter value selection matching biological behavior under different simulation scenarios , 2009, BMC Neuroscience.

[5]  Christos Constantinidis,et al.  Neural Correlates of a Decision Variable Before Learning to Perform a Match/Non-Match Task , 2012, The Journal of Neuroscience.

[6]  Peter Dayan,et al.  Theoretical Neuroscience: Computational and Mathematical Modeling of Neural Systems , 2001 .

[7]  Neil Burgess,et al.  Attractor Dynamics in the Hippocampal Representation of the Local Environment , 2005, Science.

[8]  Zhidong Deng,et al.  Collective Behavior of a Small-World Recurrent Neural System With Scale-Free Distribution , 2007, IEEE Transactions on Neural Networks.

[9]  Wulfram Gerstner,et al.  SPIKING NEURON MODELS Single Neurons , Populations , Plasticity , 2002 .

[10]  R. Krimm,et al.  Neuron/target matching between chorda tympani neurons and taste buds during postnatal rat development. , 2000, Journal of neurobiology.

[11]  Y. Dan,et al.  Spike timing-dependent plasticity: a Hebbian learning rule. , 2008, Annual review of neuroscience.

[12]  Bruno A. Olshausen,et al.  Book Review , 2003, Journal of Cognitive Neuroscience.