A spiking neuron model of the cortico-basal ganglia circuits for goal-directed and habitual action learning.

Dual-system theories postulate that actions are supported either by a goal-directed or by a habit-driven response system. Neuroimaging and anatomo-functional studies have provided evidence that the prefrontal cortex plays a fundamental role in the first type of action control, while internal brain areas such as the basal ganglia are more active during habitual and overtrained responses. Additionally, it has been shown that areas of the cortex and the basal ganglia are connected through multiple parallel "channels", which are thought to function as an action selection mechanism resolving competitions between alternative options available in a given context. In this paper we propose a multi-layer network of spiking neurons that implements in detail the thalamo-cortical circuits that are believed to be involved in action learning and execution. A key feature of this model is that neurons are organized in small pools in the motor cortex and form independent loops with specific pools of the basal ganglia where inhibitory circuits implement a multistep selection mechanism. The described model has been validated utilizing it to control the actions of a virtual monkey that has to learn to turn on briefly flashing lights by pressing corresponding buttons on a board. When the animal is able to fluently execute the task the button-light associations are remapped so that it has to suppress its habitual behavior in order to execute goal-directed actions. The model nicely shows how sensory-motor associations for action sequences are formed at the cortico-basal ganglia level and how goal-directed decisions may override automatic motor responses.

[1]  K. Gurney,et al.  A Physiologically Plausible Model of Action Selection and Oscillatory Activity in the Basal Ganglia , 2006, The Journal of Neuroscience.

[2]  L. Abbott,et al.  Competitive Hebbian learning through spike-timing-dependent synaptic plasticity , 2000, Nature Neuroscience.

[3]  Bernhard Hommel,et al.  Acquisition and Control of Voluntary Action — 1 Acquisition and Control of Voluntary Action , 2001 .

[4]  A. Destexhe Kinetic Models of Synaptic Transmission , 1997 .

[5]  W. Schultz,et al.  Learning of sequential movements by neural network model with dopamine-like reinforcement signal , 1998, Experimental Brain Research.

[6]  T. Stanford,et al.  Subcortical loops through the basal ganglia , 2005, Trends in Neurosciences.

[7]  L. Abbott,et al.  Synaptic plasticity: taming the beast , 2000, Nature Neuroscience.

[8]  J. Mink THE BASAL GANGLIA: FOCUSED SELECTION AND INHIBITION OF COMPETING MOTOR PROGRAMS , 1996, Progress in Neurobiology.

[9]  C. Stevens,et al.  Voltage dependence of NMDA-activated macroscopic conductances predicted by single-channel kinetics , 1990, The Journal of neuroscience : the official journal of the Society for Neuroscience.

[10]  R. Wise Dopamine, learning and motivation , 2004, Nature Reviews Neuroscience.

[11]  G. E. Alexander,et al.  Parallel organization of functionally segregated circuits linking basal ganglia and cortex. , 1986, Annual review of neuroscience.

[12]  A. Faure,et al.  Lesion to the Nigrostriatal Dopamine System Disrupts Stimulus-Response Habit Formation , 2005, The Journal of Neuroscience.

[13]  B. Balleine,et al.  Lesions of dorsolateral striatum preserve outcome expectancy but disrupt habit formation in instrumental learning , 2004, The European journal of neuroscience.

[14]  H. Yin,et al.  The role of the basal ganglia in habit formation , 2006, Nature Reviews Neuroscience.

[15]  R. Poldrack,et al.  Cortical and Subcortical Contributions to Stop Signal Response Inhibition: Role of the Subthalamic Nucleus , 2006, The Journal of Neuroscience.

[16]  S. Killcross,et al.  Coordination of actions and habits in the medial prefrontal cortex of rats. , 2003, Cerebral cortex.

[17]  J W Donahoe,et al.  A selectionist approach to reinforcement. , 1993, Journal of the experimental analysis of behavior.

[18]  B. Balleine,et al.  Goal-directed instrumental action: contingency and incentive learning and their cortical substrates , 1998, Neuropharmacology.

[19]  V. Mountcastle The columnar organization of the neocortex. , 1997, Brain : a journal of neurology.

[20]  Peter Redgrave,et al.  A computational model of action selection in the basal ganglia. I. A new functional anatomy , 2001, Biological Cybernetics.

[21]  Roshan Cools,et al.  Habitual versus Goal-directed Action Control in Parkinson Disease , 2011, Journal of Cognitive Neuroscience.

[22]  David A Lewis,et al.  Synaptic efficacy during repetitive activation of excitatory inputs in primate dorsolateral prefrontal cortex. , 2004, Cerebral cortex.

[23]  Duncan J. Watts,et al.  Collective dynamics of ‘small-world’ networks , 1998, Nature.

[24]  Michael J. Frank,et al.  By Carrot or by Stick: Cognitive Reinforcement Learning in Parkinsonism , 2004, Science.

[25]  P. Strick,et al.  Basal ganglia and cerebellar loops: motor and cognitive circuits , 2000, Brain Research Reviews.

[26]  B. Balleine,et al.  The role of prelimbic cortex in instrumental conditioning , 2003, Behavioural Brain Research.

[27]  Peter Dayan,et al.  A Neural Substrate of Prediction and Reward , 1997, Science.

[28]  H. Bergman,et al.  Goal-directed and habitual control in the basal ganglia: implications for Parkinson's disease , 2010, Nature Reviews Neuroscience.

[29]  K. Johnston,et al.  Top-Down Control-Signal Dynamics in Anterior Cingulate and Prefrontal Cortex Neurons following Task Switching , 2007, Neuron.

[30]  C. Gerfen,et al.  Crossed connections of the substantia nigra in the rat , 1982, The Journal of comparative neurology.

[31]  G. Meinhardt Detection of sinusoidal gratings by pattern-specific detectors: further evidence for the correlation principle in human vision , 2001, Biological Cybernetics.

[32]  O. Hikosaka,et al.  Differential activation of monkey striatal neurons in the early and late stages of procedural learning , 2002, Experimental Brain Research.

[33]  L. Fogassi,et al.  Grasping Neurons of Monkey Parietal and Premotor Cortices Encode Action Goals at Distinct Levels of Abstraction during Complex Action Sequences , 2011, The Journal of Neuroscience.

[34]  Charles J. Wilson,et al.  The origins of two-state spontaneous membrane potential fluctuations of neostriatal spiny neurons , 1996, The Journal of neuroscience : the official journal of the Society for Neuroscience.

[35]  G. Bi,et al.  Synaptic modification by correlated activity: Hebb's postulate revisited. , 2001, Annual review of neuroscience.

[36]  Richard S. Sutton,et al.  Reinforcement learning with replacing eligibility traces , 2004, Machine Learning.

[37]  Dale E. Zand,et al.  Parallel Organization , 2014 .

[38]  B. Balleine,et al.  Blockade of NMDA receptors in the dorsomedial striatum prevents action–outcome learning in instrumental conditioning , 2005, The European journal of neuroscience.

[39]  T. Robbins,et al.  Stop-signal reaction-time task performance: role of prefrontal cortex and subthalamic nucleus. , 2008, Cerebral cortex.

[40]  Sacha Jennifer van Albada,et al.  Mean-field modeling of the basal ganglia-thalamocortical system. I Firing rates in healthy and parkinsonian states. , 2009, Journal of theoretical biology.

[41]  Leonardo Fogassi,et al.  Neuronal Chains for Actions in the Parietal Lobe: A Computational Model , 2011, PloS one.

[42]  J. Houk,et al.  Model of cortical-basal ganglionic processing: encoding the serial order of sensory events. , 1998, Journal of neurophysiology.

[43]  A. Iriki,et al.  Acquisition and development of monkey tool-use: behavioral and kinematic analyses. , 2000, Canadian journal of physiology and pharmacology.

[44]  Benjamin O. Turner,et al.  Cortical and basal ganglia contributions to habit learning and automaticity , 2010, Trends in Cognitive Sciences.

[45]  Henning Schroll,et al.  Working memory and response selection: A computational account of interactions among cortico-basalganglio-thalamic loops , 2012, Neural Networks.

[46]  J. Bargas,et al.  D 1 Receptor Activation Enhances Evoked Discharge in Neostriatal Medium Spiny Neurons by Modulating an L-Type Ca 2 1 Conductance , 1997 .

[47]  W. Schultz,et al.  Responses of monkey dopamine neurons during learning of behavioral reactions. , 1992, Journal of neurophysiology.

[48]  V. S. Chakravarthy,et al.  The Role of the Basal Ganglia in Exploration in a Neural Model Based on Reinforcement Learning , 2006, Int. J. Neural Syst..

[49]  W. Regehr,et al.  Short-term synaptic plasticity. , 2002, Annual review of physiology.

[50]  Richard S. Sutton,et al.  Learning to predict by the methods of temporal differences , 1988, Machine Learning.

[51]  Jesper Tegnér,et al.  Spike-timing-dependent plasticity: common themes and divergent vistas , 2002, Biological Cybernetics.

[52]  G. Rizzolatti,et al.  Parietal Lobe: From Action Organization to Intention Understanding , 2005, Science.

[53]  Jörg Lücke,et al.  Rapid Processing and Unsupervised Learning in a Model of the Cortical Macrocolumn , 2004, Neural Computation.

[54]  E. Izhikevich Solving the distal reward problem through linkage of STDP and dopamine signaling , 2007, BMC Neuroscience.