Dissociating neural learning signals in human sign- and goal-trackers

Individuals differ in how they learn from experience. In Pavlovian conditioning models, where cues predict reinforcer delivery at a different goal location, some animals—called sign-trackers—come to approach the cue, whereas others, called goal-trackers, approach the goal. In sign-trackers, model-free phasic dopaminergic reward-prediction errors underlie learning, which renders stimuli ‘wanted’. Goal-trackers do not rely on dopamine for learning and are thought to use model-based learning. We demonstrate this double dissociation in 129 male humans using eye-tracking, pupillometry and functional magnetic resonance imaging informed by computational models of sign- and goal-tracking. We show that sign-trackers exhibit a neural reward prediction error signal that is not detectable in goal-trackers. Model-free value only guides gaze and pupil dilation in sign-trackers. Goal-trackers instead exhibit a stronger model-based neural state prediction error signal. This model-based construct determines gaze and pupil dilation more in goal-trackers. Schad et al. find that, during Pavlovian conditioning, model-free striatal reward prediction errors are present in a group of sign-tracking humans, while goal-tracking humans show learning signals from a model-based system instead.

[1]  S. S. Stevens,et al.  Handbook of experimental psychology , 1951 .

[2]  R. Leclerc,et al.  Sign-tracking in aversive conditioning , 1980 .

[3]  K. Berridge,et al.  The neural basis of drug craving: An incentive-sensitization theory of addiction , 1993, Brain Research Reviews.

[4]  P. Dayan,et al.  A framework for mesencephalic dopamine systems based on predictive Hebbian learning , 1996, The Journal of neuroscience : the official journal of the Society for Neuroscience.

[5]  Peter Dayan,et al.  A Neural Substrate of Prediction and Reward , 1997, Science.

[6]  Anthony C. Davison,et al.  Bootstrap Methods and their Application , 1997 .

[7]  D H Brainard,et al.  The Psychophysics Toolbox. , 1997, Spatial vision.

[8]  Hans-Ulrich Wittchen,et al.  DIA-X-Interviews: Manual für Screening-Verfahren und Interview; Interviewheft , 1997 .

[9]  D G Pelli,et al.  The VideoToolbox software for visual psychophysics: transforming numbers into movies. , 1997, Spatial vision.

[10]  K. Berridge,et al.  What is the role of dopamine in reward: hedonic impact, reward learning, or incentive salience? , 1998, Brain Research Reviews.

[11]  Kenji Doya,et al.  What are the computations of the cerebellum, the basal ganglia and the cerebral cortex? , 1999, Neural Networks.

[12]  S. Kakade,et al.  Learning and selective attention , 2000, Nature Neuroscience.

[13]  C. Koch,et al.  Computational modelling of visual attention , 2001, Nature Reviews Neuroscience.

[14]  N. Logothetis,et al.  Neurophysiological investigation of the basis of the fMRI signal , 2001, Nature.

[15]  B. Balleine,et al.  The Role of Learning in the Operation of Motivational Systems , 2002 .

[16]  K. Berridge,et al.  Erratum to: “Parsing reward” [Trends Neurosci. 26 (2003) 507–513] , 2003, Trends in Neurosciences.

[17]  Samuel M. McClure,et al.  A computational substrate for incentive salience , 2003, Trends in Neurosciences.

[18]  R. Hu Diagnostic and Statistical Manual of Mental Disorders (DSM-IV) , 2003 .

[19]  K. Berridge,et al.  Parsing reward , 2003, Trends in Neurosciences.

[20]  Sasa Kenjeres,et al.  Visualization of turbulence structures reorganization in thermal convection subjected to external magnetic field , 2004, J. Vis..

[21]  Simon B. Eickhoff,et al.  A new SPM toolbox for combining probabilistic cytoarchitectonic maps and functional imaging data , 2005, NeuroImage.

[22]  P. Dayan,et al.  Uncertainty-based competition between prefrontal and dorsolateral striatal systems for behavioral control , 2005, Nature Neuroscience.

[23]  M. Paulus,et al.  Neurobiology of Decision Making: A Selective Review from a Neurocognitive and Clinical Perspective , 2005, Biological Psychiatry.

[24]  K. Berridge The debate over dopamine’s role in reward: the case for incentive salience , 2007, Psychopharmacology.

[25]  Peter Dayan,et al.  Non-commercial Research and Educational Use including without Limitation Use in Instruction at Your Institution, Sending It to Specific Colleagues That You Know, and Providing a Copy to Your Institution's Administrator. All Other Uses, Reproduction and Distribution, including without Limitation Comm , 2022 .

[26]  G. Ruxton The unequal variance t-test is an underused alternative to Student's t-test and the Mann–Whitney U test , 2006 .

[27]  Preeti Verghese,et al.  Where to look next? Eye movements reduce local uncertainty. , 2007, Journal of vision.

[28]  R. Wightman,et al.  Associative learning mediates dynamic shifts in dopamine signaling in the nucleus accumbens , 2007, Nature Neuroscience.

[29]  P. Dayan,et al.  Differential Encoding of Losses and Gains in the Human Striatum , 2007, The Journal of Neuroscience.

[30]  Richard D. Morey,et al.  Confidence Intervals from Normalized Data: A correction to Cousineau (2005) , 2008 .

[31]  T. Robinson,et al.  Dissociating the Predictive and Incentive Motivational Properties of Reward-Related Cues Through the Study of Individual Differences , 2009, Biological Psychiatry.

[32]  Christopher J. Peck,et al.  Reward Modulates Attention Independently of Action Value in Posterior Parietal Cortex , 2009, The Journal of Neuroscience.

[33]  P. Dayan,et al.  States versus Rewards: Dissociable Neural Prediction Error Signals Underlying Model-Based and Model-Free Reinforcement Learning , 2010, Neuron.

[34]  Christopher J. Mitchell,et al.  Attention and Associative Learning: From Brain to Behaviour , 2010 .

[35]  A. Dickinson,et al.  Selective attention to conditioned stimuli in human discrimination learning: Untangling the effect of outcome prediction, valence, arousal and uncertainty , 2010 .

[36]  L. Chelazzi,et al.  Reward Changes Salience in Human Vision via the Anterior Cingulate , 2010, The Journal of Neuroscience.

[37]  P. Glimcher,et al.  Testing the Reward Prediction Error Hypothesis with an Axiomatic Model , 2010, The Journal of Neuroscience.

[38]  Raymond J. Dolan,et al.  Disentangling the Roles of Approach, Activation and Valence in Instrumental and Pavlovian Responding , 2011, PLoS Comput. Biol..

[39]  T. Robinson,et al.  A selective role for dopamine in reward learning , 2010, Nature.

[40]  Satrajit S. Ghosh,et al.  Nipype: A Flexible, Lightweight and Extensible Neuroimaging Data Processing Framework in Python , 2011, Front. Neuroinform..

[41]  T. Robinson,et al.  A food predictive cue must be attributed with incentive salience for it to induce c-fos mRNA expression in cortico-striatal-thalamic brain regions , 2011, Neuroscience.

[42]  J. Gottlieb Attention, Learning, and the Value of Information , 2012, Neuron.

[43]  W. Zoest,et al.  Reward creates oculomotor salience , 2012, Current Biology.

[44]  Robert C. Wilson,et al.  Rational regulation of learning dynamics by pupil–linked arousal systems , 2012, Nature Neuroscience.

[45]  Jane R. Garrison,et al.  Prediction error in reinforcement learning: A meta-analysis of neuroimaging studies , 2013, Neuroscience & Biobehavioral Reviews.

[46]  Roshan Cools,et al.  Serotonin and Aversive Pavlovian Control of Instrumental Behavior in Humans , 2013, The Journal of Neuroscience.

[47]  C. Mathys,et al.  Hierarchical Prediction Errors in Midbrain and Basal Forebrain during Sensory Learning , 2013, Neuron.

[48]  Pierre-Yves Oudeyer,et al.  Information-seeking, curiosity, and attention: computational and neural mechanisms , 2013, Trends in Cognitive Sciences.

[49]  M. Haruno,et al.  Reward Prediction Error Signal Enhanced by Striatum–Amygdala Interaction Explains the Acceleration of Probabilistic Reward Learning by Emotion , 2013, The Journal of Neuroscience.

[50]  Josiah R. Boivin,et al.  A Causal Link Between Prediction Errors, Dopamine Neurons and Learning , 2013, Nature Neuroscience.

[51]  T. Robinson,et al.  Individual variation in resisting temptation: Implications for addiction , 2013, Neuroscience & Biobehavioral Reviews.

[52]  P. Dayan,et al.  Model-based and model-free Pavlovian reward learning: Revaluation, revision, and revelation , 2014, Cognitive, affective & behavioral neuroscience.

[53]  A. Heinz,et al.  Pavlovian-to-Instrumental Transfer in Alcohol Dependence: A Pilot Study , 2014, Neuropsychobiology.

[54]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[55]  Mehdi Khamassi,et al.  Modelling Individual Differences in the Form of Pavlovian Conditioned Approach Responses: A Dual Learning Systems Approach with Factored Representations , 2014, PLoS Comput. Biol..

[56]  L. Deserno,et al.  Devaluation and sequential decisions: linking goal-directed and model-based behavior , 2014, Front. Hum. Neurosci..

[57]  P. Tobler,et al.  The role of learning-related dopamine signals in addiction vulnerability. , 2014, Progress in brain research.

[58]  S. Garofalo,et al.  Individual differences in the influence of task-irrelevant Pavlovian cues on human behavior , 2015, Front. Behav. Neurosci..

[59]  Sara E. Morrison,et al.  Sign Tracking, but Not Goal Tracking, is Resistant to Outcome Devaluation , 2015, Front. Neurosci..

[60]  A. Lahti,et al.  Contribution of substantia nigra glutamate to prediction error signals in schizophrenia: a combined magnetic resonance spectroscopy/functional imaging study , 2015, npj Schizophrenia.

[61]  T. Robinson,et al.  Individual Variation in the Motivational and Neurobiological Effects of an Opioid Cue , 2015, Neuropsychopharmacology.

[62]  M. Peelen,et al.  Neural Mechanisms of Incentive Salience in Naturalistic Human Vision , 2015, Neuron.

[63]  R. Dolan,et al.  Ventral striatal dopamine reflects behavioral and neural signatures of model-based control during sequential decision making , 2015, Proceedings of the National Academy of Sciences.

[64]  Robert C. Wilson,et al.  Is Model Fitting Necessary for Model-Based fMRI? , 2015, PLoS Comput. Biol..

[65]  M. Husain,et al.  Reduced pupillary reward sensitivity in Parkinson’s disease , 2015, npj Parkinson's Disease.

[66]  Norbert Kathmann,et al.  Don't Think, Just Feel the Music: Individuals with Strong Pavlovian-to-Instrumental Transfer Effects Rely Less on Model-based Reinforcement Learning , 2016, Journal of Cognitive Neuroscience.

[67]  H. Walter,et al.  Pavlovian‐to‐instrumental transfer effects in the nucleus accumbens relate to relapse in alcohol dependence , 2016, Addiction biology.

[68]  Jared M. Hotaling,et al.  The Attraction Effect Modulates Reward Prediction Errors and Intertemporal Choices. , 2016, The Journal of neuroscience : the official journal of the Society for Neuroscience.

[69]  H. Walter,et al.  Neural correlates of instrumental responding in the context of alcohol-related cues index disorder severity and relapse risk , 2019, European Archives of Psychiatry and Clinical Neuroscience.

[70]  R. Vogels,et al.  Representation of Semantic Similarity in the Left Intraparietal Sulcus: Functional Magnetic Resonance Imaging Evidence , 2017, Front. Hum. Neurosci..

[71]  A. Heinz,et al.  No association of goal‐directed and habitual control with alcohol consumption in young adults , 2018, Addiction biology.

[72]  C. Mathys,et al.  Hierarchical Prediction Errors in Midbrain and Basal Forebrain during Sensory Learning , 2013, Neuron.