Chasing probabilities — Signaling negative and positive prediction errors across domains

Adaptive actions build on internal probabilistic models of possible outcomes that are tuned according to the errors of their predictions when experiencing an actual outcome. Prediction errors (PEs) inform choice behavior across a diversity of outcome domains and dimensions, yet neuroimaging studies have so far only investigated such signals in singular experimental contexts. It is thus unclear whether the neuroanatomical distribution of PE encoding reported previously pertains to computational features that are invariant with respect to outcome valence, sensory domain, or some combination of the two. We acquired functional MRI data while volunteers performed four probabilistic reversal learning tasks which differed in terms of outcome valence (reward-seeking versus punishment-avoidance) and domain (abstract symbols versus facial expressions) of outcomes. We found that ventral striatum and frontopolar cortex coded increasingly positive PEs, whereas dorsal anterior cingulate cortex (dACC) traced increasingly negative PEs, irrespectively of the outcome dimension. Individual reversal behavior was unaffected by context manipulations and was predicted by activity in dACC and right inferior frontal gyrus (IFG). The stronger the response to negative PEs in these areas, the lower was the tendency to reverse choice behavior in response to negative events, suggesting that these regions enforce a rule-based strategy across outcome dimensions. Outcome valence influenced PE-related activity in left amygdala, IFG, and dorsomedial prefrontal cortex, where activity selectively scaled with increasingly positive PEs in the reward-seeking but not punishment-avoidance context, irrespective of sensory domain. Left amygdala displayed an additional influence of sensory domain. In the context of avoiding punishment, amygdala activity increased with increasingly negative PEs, but only for facial stimuli, indicating an integration of outcome valence and sensory domain during probabilistic choices.

[1]  Karl J. Friston,et al.  Bayesian model selection for group studies — Revisited , 2014, NeuroImage.

[2]  William H. Alexander,et al.  Distinct regions of anterior cingulate cortex signal prediction and outcome evaluation , 2014, NeuroImage.

[3]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[4]  K. Stanovich,et al.  Is probability matching smart? Associations between probabilistic choices and cognitive ability , 2003, Memory & cognition.

[5]  Guillem R. Esber,et al.  All that glitters ... dissociating attention and outcome expectancy from prediction errors signals. , 2010, Journal of neurophysiology.

[6]  R. R. Bush,et al.  A Mathematical Model for Simple Learning , 1951 .

[7]  G H Glover,et al.  Image‐based method for retrospective correction of physiological motion effects in fMRI: RETROICOR , 2000, Magnetic resonance in medicine.

[8]  Joseph J. Paton,et al.  Expectation Modulates Neural Responses to Pleasant and Aversive Stimuli in Primate Amygdala , 2007, Neuron.

[9]  Vivian V. Valentin,et al.  Overlapping prediction errors in dorsal striatum during instrumental learning with juice and money reward in the human brain. , 2009, Journal of neurophysiology.

[10]  Robert C. Wilson,et al.  An Approximately Bayesian Delta-Rule Model Explains the Dynamics of Belief Updating in a Changing Environment , 2010, The Journal of Neuroscience.

[11]  Milena Dzhelyova,et al.  Don't look back in anger: the rewarding value of a female face is discounted by an angry expression. , 2014, Journal of experimental psychology. Human perception and performance.

[12]  Ryan K. Jessup,et al.  Error Effects in Anterior Cingulate Cortex Reverse when Error Likelihood Is High , 2010, The Journal of Neuroscience.

[13]  Ajay B. Satpute,et al.  The Brain Basis of Positive and Negative Affect: Evidence from a Meta-Analysis of the Human Neuroimaging Literature. , 2016, Cerebral cortex.

[14]  J. Neumann,et al.  Adaptive coding of action values in the human rostral cingulate zone , 2009, NeuroImage.

[15]  W. Fias,et al.  Dissociating contributions of ACC and vmPFC in reward prediction, outcome, and choice , 2014, Neuropsychologia.

[16]  Mark A. Gluck,et al.  Love to Win or Hate to Lose? Asymmetry of Dopamine D2 Receptor Binding Predicts Sensitivity to Reward versus Punishment , 2014, Journal of Cognitive Neuroscience.

[17]  Daniel Brandeis,et al.  Cognitive flexibility in adolescence: Neural and behavioral mechanisms of reward prediction error processing in adaptive decision making during development , 2015, NeuroImage.

[18]  Christopher K. Kovach,et al.  Anterior Prefrontal Cortex Contributes to Action Selection through Tracking of Recent Reward Trends , 2012, The Journal of Neuroscience.

[19]  Adam R. Aron,et al.  Fronto-Basal-Ganglia Circuits for Stopping Action , 2011 .

[20]  T. Robbins,et al.  Discrimination, reversal, and shift learning in Huntington’s disease: mechanisms of impaired response selection , 1999, Neuropsychologia.

[21]  N. Daw,et al.  The ubiquity of model-based reinforcement learning , 2012, Current Opinion in Neurobiology.

[22]  Massimo Silvetti,et al.  Value and prediction error estimation account for volatility effects in ACC: A model-based fMRI study , 2013, Cortex.

[23]  Timothy Edward John Behrens,et al.  How Green Is the Grass on the Other Side? Frontopolar Cortex and the Evidence in Favor of Alternative Courses of Action , 2009, Neuron.

[24]  R. Paz,et al.  Functional Connectivity between Amygdala and Cingulate Cortex for Adaptive Aversive Learning , 2013, Neuron.

[25]  E. Weber,et al.  A Domain-Specific Risk-Taking (DOSPERT) Scale for Adult Populations , 2006, Judgment and Decision Making.

[26]  Jonathan Westley Peirce,et al.  Neuroinformatics Original Research Article Generating Stimuli for Neuroscience Using Psychopy , 2022 .

[27]  Ajay B. Satpute,et al.  Involvement of Sensory Regions in Affective Experience: A Meta-Analysis , 2015, Front. Psychol..

[28]  K Richard Ridderinkhof,et al.  Adaptive Coding , 2005, Science.

[29]  J. Dreher,et al.  Cerebral correlates of salient prediction error for different rewards and punishments. , 2013, Cerebral cortex.

[30]  Marco K. Wittmann,et al.  Multiple Neural Mechanisms of Decision Making and Their Competition under Changing Risk Pressure , 2014, Neuron.

[31]  P. Gasquoine,et al.  Contributions of the Insula to Cognition and Emotion , 2014, Neuropsychology Review.

[32]  C. Mathys,et al.  Hierarchical Prediction Errors in Midbrain and Basal Forebrain during Sensory Learning , 2013, Neuron.

[33]  A. Aron From Reactive to Proactive and Selective Control: Developing a Richer Model for Stopping Inappropriate Responses , 2011, Biological Psychiatry.

[34]  J. Ashburner,et al.  Nonlinear spatial normalization using basis functions , 1999, Human brain mapping.

[35]  Karl J. Friston,et al.  A Bayesian Foundation for Individual Learning Under Uncertainty , 2011, Front. Hum. Neurosci..

[36]  B. Skinner,et al.  Science and human behavior , 1953 .

[37]  M. Rushworth,et al.  Model-based analyses: Promises, pitfalls, and example applications to the study of cognitive control , 2011, Quarterly journal of experimental psychology.

[38]  Timothy E. J. Behrens,et al.  Review Frontal Cortex and Reward-guided Learning and Decision-making Figure 1. Frontal Brain Regions in the Macaque Involved in Reward-guided Learning and Decision-making Finer Grained Anatomical Divisions with Frontal Cortical Systems for Reward-guided Behavior , 2022 .

[39]  Timothy E. J. Behrens,et al.  Perceptual Classification in a Rapidly Changing Environment , 2011, Neuron.

[40]  Karl J. Friston,et al.  Conjunction revisited , 2005, NeuroImage.

[41]  Timothy E. J. Behrens,et al.  Learning the value of information in an uncertain world , 2007, Nature Neuroscience.

[42]  Andrea Bergmann,et al.  Statistical Parametric Mapping The Analysis Of Functional Brain Images , 2016 .

[43]  M. Mermillod,et al.  Is eye contact the key to the social brain? , 2010, Behavioral and Brain Sciences.

[44]  Joshua W. Brown,et al.  Medial prefrontal cortex as an action-outcome predictor , 2011, Nature Neuroscience.

[45]  R. Rescorla A theory of pavlovian conditioning: The effectiveness of reinforcement and non-reinforcement , 1972 .

[46]  C. Fiorillo Two Dimensions of Value: Dopamine Neurons Represent Reward But Not Aversiveness , 2013, Science.

[47]  Jane R. Garrison,et al.  Prediction error in reinforcement learning: A meta-analysis of neuroimaging studies , 2013, Neuroscience & Biobehavioral Reviews.

[48]  Jan Gläscher,et al.  Visualization of Group Inference Data in Functional Neuroimaging , 2009, Neuroinformatics.

[49]  P. Dayan,et al.  Goals and Habits in the Brain , 2013, Neuron.

[50]  Robin M Heidemann,et al.  Generalized autocalibrating partially parallel acquisitions (GRAPPA) , 2002, Magnetic resonance in medicine.

[51]  P. Tobler,et al.  Segregated and integrated coding of reward and punishment in the cingulate cortex. , 2009, Journal of neurophysiology.

[52]  Mara Mather,et al.  Age-related similarities and differences in brain activity underlying reversal learning , 2013, Front. Integr. Neurosci..

[53]  J. Patton,et al.  Factor structure of the Barratt impulsiveness scale. , 1995, Journal of clinical psychology.

[54]  W. Gaissmaier,et al.  The smart potential behind probability matching , 2008, Cognition.

[55]  S. Gunnery,et al.  Perceptions of Duchenne and non-Duchenne smiles: A meta-analysis , 2016, Cognition & emotion.

[56]  Jin Fan,et al.  Common and distinct networks underlying reward valence and processing stages: A meta-analysis of functional neuroimaging studies , 2011, Neuroscience & Biobehavioral Reviews.

[57]  Karl J. Friston,et al.  Movement‐Related effects in fMRI time‐series , 1996, Magnetic resonance in medicine.

[58]  Daphna Shohamy,et al.  Representation of aversive prediction errors in the human periaqueductal gray , 2014, Nature Neuroscience.

[59]  Natalie C. Ebner,et al.  FACES—A database of facial expressions in young, middle-aged, and older women and men: Development and validation , 2010, Behavior research methods.

[60]  Mark W Woolrich,et al.  Associative learning of social value , 2008, Nature.

[61]  R Turner,et al.  Optimized EPI for fMRI studies of the orbitofrontal cortex , 2003, NeuroImage.

[62]  Angela J. Yu,et al.  Uncertainty, Neuromodulation, and Attention , 2005, Neuron.

[63]  Karl J. Friston,et al.  Behavioral / Systems / Cognitive Striatal Prediction Error Modulates Cortical Coupling , 2010 .

[64]  Mara Mather,et al.  Differential Brain Activity during Emotional versus Nonemotional Reversal Learning , 2012, Journal of Cognitive Neuroscience.

[65]  H. Breiter,et al.  Reward Circuitry Activation by Noxious Thermal Stimuli , 2001, Neuron.

[66]  J. Morrens,et al.  Dopamine neurons coding prediction errors in reward space, but not in aversive space: a matter of location? , 2014, Journal of neurophysiology.