Basal Ganglia Neurons Dynamically Facilitate Exploration during Associative Learning

The basal ganglia (BG) appear to play a prominent role in associative learning, the process of pairing external stimuli with rewarding responses. Accumulating evidence suggests that the contributions of various BG components may be described within a reinforcement learning model, in which a broad repertoire of possible responses to environmental stimuli are evaluated before the most profitable one is chosen. The striatum receives diverse cortical inputs, providing a rich source of contextual information about environmental cues. It also receives projections from midbrain dopaminergic neurons, whose phasic activity reflects a reward prediction error signal. These coincident information streams are well suited for evaluating responses and biasing future actions toward the most profitable response. Still lacking in this model is a mechanistic description of how initial response variability is generated. To investigate this question, we recorded the activity of single neurons in the globus pallidus internus (GPi), the primary BG output nucleus, in nonhuman primates (Macaca mulatta) performing a motor associative learning task. A subset (29%) of GPi neurons showed learning-related effects, decreasing firing during the early stages of learning, then returning to higher baseline rates as associations were mastered. On a trial-by-trial basis, lower firing rates predicted exploratory behavior, whereas higher rates predicted an exploitive response. These results suggest that, during associative learning, BG output is initially permissive, allowing exploration of a variety of responses. Once a profitable response is identified, increased GPi activity suppresses alternative responses, sharpening the response profile and encouraging exploitation of the profitable learned behavior.

[1]  Michale S Fee,et al.  A basal ganglia-forebrain circuit in the songbird biases motor output to avoid vocal errors , 2009, Proceedings of the National Academy of Sciences.

[2]  A. Graybiel,et al.  Neurons in the thalamic CM-Pf complex supply striatal neurons with information about behaviorally significant sensory events. , 2001, Journal of neurophysiology.

[3]  E. Miller,et al.  Different time courses of learning-related activity in the prefrontal cortex and striatum , 2005, Nature.

[4]  M. Inase,et al.  Thalamic distribution of projection neurons to the primary motor cortex relative to afferent terminal fields from the globus pallidus in the macaque monkey , 1995, The Journal of comparative neurology.

[5]  M. Inase,et al.  Pallidal activity is involved in visuomotor association learning in monkeys , 2001, The European journal of neuroscience.

[6]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[7]  O. Hikosaka,et al.  A neural correlate of response bias in monkey caudate nucleus , 2002, Nature.

[8]  Nikolaus R. McFarland,et al.  Convergent Inputs from Thalamic Motor Nuclei and Frontal Cortical Areas to the Dorsal Striatum in the Primate , 2000, The Journal of Neuroscience.

[9]  Aaron S. Andalman,et al.  Vocal Experimentation in the Juvenile Songbird Requires a Basal Ganglia Circuit , 2005, PLoS biology.

[10]  A. Graybiel,et al.  Activity of striatal neurons reflects dynamic encoding and recoding of procedural memories , 2005, Nature.

[11]  V. S. Chakravarthy,et al.  The Role of the Basal Ganglia in Exploration in a Neural Model Based on Reinforcement Learning , 2006, Int. J. Neural Syst..

[12]  J. Yelnik Functional anatomy of the basal ganglia , 2002, Movement disorders : official journal of the Movement Disorder Society.

[13]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[14]  Adam Ponzi,et al.  Dynamical model of salience gated working memory, action selection and reinforcement based on basal ganglia and dopamine feedback , 2008, Neural Networks.

[15]  Ziv M. Williams,et al.  Selective enhancement of associative learning by microstimulation of the anterior caudate , 2006, Nature Neuroscience.

[16]  Samuel D. Gale,et al.  A Basal Ganglia Pathway Drives Selective Auditory Responses in Songbird Dopaminergic Neurons via Disinhibition , 2010, The Journal of Neuroscience.

[17]  K. Doya,et al.  Representation of Action-Specific Reward Values in the Striatum , 2005, Science.

[18]  Minmin Luo,et al.  An Avian Basal Ganglia Pathway Essential for Vocal Learning Forms a Closed Topographic Loop , 2001, The Journal of Neuroscience.

[19]  A. Doupe,et al.  Contributions of an avian basal ganglia–forebrain circuit to real-time modulation of song , 2005, Nature.

[20]  M. Merello,et al.  [Functional anatomy of the basal ganglia]. , 2000, Revista de neurologia.

[21]  Michael S Brainard,et al.  Lesions of an avian basal ganglia circuit prevent context-dependent changes to song variability. , 2006, Journal of neurophysiology.

[22]  A. Graybiel,et al.  Two input systems for body representations in the primate striatal matrix: experimental evidence in the squirrel monkey , 1993, The Journal of neuroscience : the official journal of the Society for Neuroscience.

[23]  K. H. Britten,et al.  A relationship between behavioral choice and the visual responses of neurons in macaque MT , 1996, Visual Neuroscience.

[24]  S. Wise,et al.  Comparison of learning‐related neuronal activity in the dorsal premotor cortex and striatum , 2004, The European journal of neuroscience.

[25]  M. Delong,et al.  Activity of pallidal neurons during movement. , 1971, Journal of neurophysiology.

[26]  A. Parent,et al.  Functional anatomy of the basal ganglia. I. The cortico-basal ganglia-thalamo-cortical loop , 1995, Brain Research Reviews.

[27]  O. Hikosaka,et al.  Reward-predicting activity of dopamine and caudate neurons--a possible mechanism of motivational control of saccadic eye movement. , 2004, Journal of neurophysiology.

[28]  J. Hollerman,et al.  Changes in behavior-related neuronal activity in the striatum during learning , 2003, Trends in Neurosciences.

[29]  O. Hikosaka,et al.  Expectation of reward modulates cognitive signals in the basal ganglia , 1998, Nature Neuroscience.

[30]  A. Graybiel The basal ganglia: learning new tricks and loving it , 2005, Current Opinion in Neurobiology.

[31]  S. Haber,et al.  Reward-Related Cortical Inputs Define a Large Striatal Region in Primates That Interface with Associative Cortical Connections, Providing a Substrate for Incentive-Based Learning , 2006, The Journal of Neuroscience.

[32]  G. E. Alexander,et al.  Functional architecture of basal ganglia circuits: neural substrates of parallel processing , 1990, Trends in Neurosciences.

[33]  Emery N Brown,et al.  Dynamic Analysis of Learning in Behavioral Experiments , 2004, The Journal of Neuroscience.