Activity of striatal neurons reflects dynamic encoding and recoding of procedural memories

Learning to perform a behavioural procedure as a well-ingrained habit requires extensive repetition of the behavioural sequence, and learning not to perform such behaviours is notoriously difficult. Yet regaining a habit can occur quickly, with even one or a few exposures to cues previously triggering the behaviour. To identify neural mechanisms that might underlie such learning dynamics, we made long-term recordings from multiple neurons in the sensorimotor striatum, a basal ganglia structure implicated in habit formation, in rats successively trained on a reward-based procedural task, given extinction training and then given reacquisition training. The spike activity of striatal output neurons, nodal points in cortico-basal ganglia circuits, changed markedly across multiple dimensions during each of these phases of learning. First, new patterns of task-related ensemble firing successively formed, reversed and then re-emerged. Second, task-irrelevant firing was suppressed, then rebounded, and then was suppressed again. These changing spike activity patterns were highly correlated with changes in behavioural performance. We propose that these changes in task representation in cortico-basal ganglia circuits represent neural equivalents of the explore–exploit behaviour characteristic of habit learning.

[1]  W. James,et al.  The Principles of Psychology. , 1983 .

[2]  I. Pavlov,et al.  Conditioned reflexes: An investigation of the physiological activity of the cerebral cortex. , 1929, Annals of neurosciences.

[3]  A. Routtenberg,et al.  THE SUBSTANTIA NIGRA AND NEOSTRIATUM: SUBSTRATES FOR MEMORY CONSOLIDATION* , 1978 .

[4]  L. Butcher Cholinergic-monoaminergic interactions in the brain , 1978 .

[5]  G. Shepherd The Synaptic Organization of the Brain , 1979 .

[6]  A. Dickinson Actions and habits: the development of behavioural autonomy , 1985 .

[7]  B. Balleine,et al.  Motivational control of goal-directed action , 1994 .

[8]  A M Graybiel,et al.  The basal ganglia and adaptive motor control. , 1994, Science.

[9]  S P Wise,et al.  Distributed modular architectures linking basal ganglia, cerebellum, and cerebral cortex: their role in planning and controlling action. , 1995, Cerebral cortex.

[10]  Joel L. Davis,et al.  Adaptive Critics and the Basal Ganglia , 1995 .

[11]  J. Mink THE BASAL GANGLIA: FOCUSED SELECTION AND INHIBITION OF COMPETING MOTOR PROGRAMS , 1996, Progress in Neurobiology.

[12]  A. Graybiel The Basal Ganglia and Chunking of Action Repertoires , 1998, Neurobiology of Learning and Memory.

[13]  C. I. Connolly,et al.  Building neural representations of habits. , 1999, Science.

[14]  M. Gazzaniga,et al.  The new cognitive neurosciences , 2000 .

[15]  Örjan Ekeberg,et al.  Cortex-basal ganglia interaction and attractor states , 2001, Neurocomputing.

[16]  Michael J. Frank,et al.  Interactions between frontal cortex and basal ganglia in working memory: A computational model , 2001, Cognitive, affective & behavioral neuroscience.

[17]  Peter Redgrave,et al.  A computational model of action selection in the basal ganglia. I. A new functional anatomy , 2001, Biological Cybernetics.

[18]  J. Wickens,et al.  A cellular mechanism of reward-related learning , 2001, Nature.

[19]  M. Gluck,et al.  Interactive memory systems in the human brain , 2001, Nature.

[20]  Michael Davis,et al.  Behavioral and Neural Analysis of Extinction , 2002, Neuron.

[21]  B. Knowlton,et al.  Learning and memory functions of the Basal Ganglia. , 2002, Annual review of neuroscience.

[22]  Kenji Doya,et al.  Metalearning and neuromodulation , 2002, Neural Networks.

[23]  Samuel M. McClure,et al.  Temporal Prediction Errors in a Passive Learning Task Activate Human Striatum , 2003, Neuron.

[24]  Saori C. Tanaka,et al.  Prediction of immediate and future rewards differentially recruits cortico-basal ganglia loops , 2004, Nature Neuroscience.

[25]  Jonathan D. Cohen,et al.  Computational roles for dopamine in behavioural control , 2004, Nature.

[26]  B. Balleine,et al.  Lesions of dorsolateral striatum preserve outcome expectancy but disrupt habit formation in instrumental learning , 2004, The European journal of neuroscience.

[27]  M. Bouton Context and behavioral processes in extinction. , 2004, Learning & memory.

[28]  A. Doupe,et al.  Contributions of an avian basal ganglia–forebrain circuit to real-time modulation of song , 2005, Nature.

[29]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 2005, IEEE Transactions on Neural Networks.

[30]  Aaron S. Andalman,et al.  Vocal Experimentation in the Juvenile Songbird Requires a Basal Ganglia Circuit , 2005, PLoS biology.