Surprise Signals in Anterior Cingulate Cortex: Neuronal Encoding of Unsigned Reward Prediction Errors Driving Adjustment in Behavior

In attentional models of learning, associations between actions and subsequent rewards are stronger when outcomes are surprising, regardless of their valence. Despite the behavioral evidence that surprising outcomes drive learning, neural correlates of unsigned reward prediction errors remain elusive. Here we show that in a probabilistic choice task, trial-to-trial variations in preference track outcome surprisingness. Concordant with this behavioral pattern, responses of neurons in macaque (Macaca mulatta) dorsal anterior cingulate cortex (dACC) to both large and small rewards were enhanced when the outcome was surprising. Moreover, when, on some trials, probabilities were hidden, neuronal responses to rewards were reduced, consistent with the idea that the absence of clear expectations diminishes surprise. These patterns are inconsistent with the idea that dACC neurons track signed errors in reward prediction, as dopamine neurons do. Our results also indicate that dACC neurons do not signal conflict. In the context of other studies of dACC function, these results suggest a link between reward-related modulations in dACC activity and attention and motor control processes involved in behavioral adjustment. More speculatively, these data point to a harmonious integration between reward and learning accounts of ACC function on one hand, and attention and cognitive control accounts on the other.

[1]  E. D. Adrian,et al.  The action of light on the eye , 1927 .

[2]  D. Ellsberg Decision, probability, and utility: Risk, ambiguity, and the Savage axioms , 1961 .

[3]  R. Rescorla,et al.  A theory of Pavlovian conditioning : Variations in the effectiveness of reinforcement and nonreinforcement , 1972 .

[4]  H B Barlow,et al.  Single units and sensation: a neuron doctrine for perceptual psychology? , 1972, Perception.

[5]  N. Mackintosh A Theory of Attention: Variations in the Associability of Stimuli with Reinforcement , 1975 .

[6]  J. Pearce,et al.  Latent inhibition of a CS during CS-US pairings. , 1979, Journal of experimental psychology. Animal behavior processes.

[7]  J. Pearce,et al.  A model for Pavlovian learning: variations in the effectiveness of conditioned but not of unconditioned stimuli. , 1980, Psychological review.

[8]  M. Mesulam A cortical network for directed attention and unilateral neglect , 1981, Annals of neurology.

[9]  J. Pearce,et al.  The Strength of the Orienting Response during Blocking , 1984, The Quarterly journal of experimental psychology. B, Comparative and physiological psychology.

[10]  W. T. Nickell,et al.  The brain nucleus locus coeruleus: restricted afferent control of a broad efferent network. , 1986, Science.

[11]  J. Pearce,et al.  The orienting response as an index of stimulus associability in rats. , 1988, Journal of experimental psychology. Animal behavior processes.

[12]  M. Posner,et al.  The attention system of the human brain. , 1990, Annual review of neuroscience.

[13]  G. V. Van Hoesen,et al.  Frontal granular cortex input to the cingulate (M3), supplementary (M2) and primary (M1) motor cortices in the rhesus monkey , 1993, The Journal of comparative neurology.

[14]  G. Aston-Jones,et al.  Locus coeruleus neurons in monkey are selectively activated by attended cues in a vigilance task , 1994, The Journal of neuroscience : the official journal of the Society for Neuroscience.

[15]  R. Desimone,et al.  Neural mechanisms of selective visual attention. , 1995, Annual review of neuroscience.

[16]  Peter Dayan,et al.  Bee foraging in uncertain environments using predictive hebbian learning , 1995, Nature.

[17]  Craig R. Fox,et al.  Ambiguity Aversion and Comparative Ignorance , 1995 .

[18]  P. Dayan,et al.  A framework for mesencephalic dopamine systems based on predictive Hebbian learning , 1996, The Journal of neuroscience : the official journal of the Society for Neuroscience.

[19]  Peter Dayan,et al.  A Neural Substrate of Prediction and Reward , 1997, Science.

[20]  G. Aston-Jones,et al.  Conditioned responses of monkey locus coeruleus neurons anticipate acquisition of discriminative behavior in a vigilance task , 1997, Neuroscience.

[21]  D H Brainard,et al.  The Psychophysics Toolbox. , 1997, Spatial vision.

[22]  S. Yantis,et al.  Visual attention: control, representation, and time course. , 1997, Annual review of psychology.

[23]  Michael I. Posner,et al.  Mapping the Cingulate Cortex in Response Selection and Monitoring , 1998, NeuroImage.

[24]  J. Tanji,et al.  Role for cingulate motor area cells in voluntary movement selection based on reward. , 1998, Science.

[25]  P. Holland,et al.  Amygdala circuitry in attentional and representational processes , 1999, Trends in Cognitive Sciences.

[26]  M. Mesulam Spatial attention and neglect: parietal, frontal and cingulate contributions to the mental representation and attentional targeting of salient extrapersonal events. , 1999, Philosophical transactions of the Royal Society of London. Series B, Biological sciences.

[27]  J. Schall,et al.  Performance monitoring by the supplementary eye ® eld , 2000 .

[28]  M. Botvinick,et al.  Parsing executive processes: strategic vs. evaluative functions of the anterior cingulate cortex. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[29]  J. Dostrovsky,et al.  Human anterior cingulate cortex neurons modulated by attention-demanding tasks. , 2000, Journal of neurophysiology.

[30]  P. Holland,et al.  Lesions of the Amygdala Central Nucleus Alter Performance on a Selective Attention Task , 2000, The Journal of Neuroscience.

[31]  M. Posner,et al.  Cognitive and emotional influences in anterior cingulate cortex , 2000, Trends in Cognitive Sciences.

[32]  E. Procyk,et al.  Anterior cingulate activity during routine and non-routine sequential behaviors in macaques , 2000, Nature Neuroscience.

[33]  M. Botvinick,et al.  Conflict monitoring and cognitive control. , 2001, Psychological review.

[34]  A. Nobre,et al.  Heterogeneity of Cingulate Contributions to Spatial Attention , 2001, NeuroImage.

[35]  Jonathan D. Cohen,et al.  Anterior Cingulate Cortex, Conflict Monitoring, and Levels of Processing , 2001, NeuroImage.

[36]  T. Paus Primate anterior cingulate cortex: Where motor control, drive and cognition interface , 2001, Nature Reviews Neuroscience.

[37]  Adrian R. Willoughby,et al.  The Medial Frontal Cortex and the Rapid Processing of Monetary Gains and Losses , 2002, Science.

[38]  Clay B. Holroyd,et al.  The neural basis of human error processing: reinforcement learning, dopamine, and the error-related negativity. , 2002, Psychological review.

[39]  O. Hikosaka,et al.  Feature-Based Anticipation of Cues that Predict Reward in Monkey Caudate Nucleus , 2002, Neuron.

[40]  B. Richmond,et al.  Anterior Cingulate: Single Neuronal Signals Related to Degree of Reward Expectancy , 2002, Science.

[41]  Frans W Cornelissen,et al.  The Eyelink Toolbox: Eye tracking with MATLAB and the Psychophysics Toolbox , 2002, Behavior research methods, instruments, & computers : a journal of the Psychonomic Society, Inc.

[42]  R. J. McDonald,et al.  Multiple Parallel Memory Systems in the Brain of the Rat , 2002, Neurobiology of Learning and Memory.

[43]  K. A. Hadland,et al.  The anterior cingulate and reward-guided selection of actions. , 2003, Journal of neurophysiology.

[44]  R. Poldrack,et al.  Competition among multiple memory systems: converging evidence from animal and human brain studies , 2003, Neuropsychologia.

[45]  D. H Weissman,et al.  Conflict monitoring in the human anterior cingulate cortex during selective attention to global and local object features , 2003, NeuroImage.

[46]  W. Schultz,et al.  Discrete Coding of Reward Probability and Uncertainty by Dopamine Neurons , 2003, Science.

[47]  Joshua W. Brown,et al.  Performance Monitoring by the Anterior Cingulate Cortex During Saccade Countermanding , 2003, Science.

[48]  Ziv M. Williams,et al.  Human anterior cingulate neurons and the integration of monetary reward with motor responses , 2004, Nature Neuroscience.

[49]  Mariko Osaka,et al.  Cooperation of the anterior cingulate cortex and dorsolateral prefrontal cortex for attention shifting , 2004, NeuroImage.

[50]  Jonathan D. Cohen,et al.  Anterior Cingulate Conflict Monitoring and Adjustments in Control , 2004, Science.

[51]  D. Barraclough,et al.  Prefrontal cortex and decision making in a mixed-strategy game , 2004, Nature Neuroscience.

[52]  J. D. McGaugh The amygdala modulates the consolidation of memories of emotionally arousing experiences. , 2004, Annual review of neuroscience.

[53]  M. Roesch,et al.  Neuronal activity in macaque SEF and ACC during performance of tasks involving conflict. , 2005, Journal of neurophysiology.

[54]  K. Doya,et al.  Representation of Action-Specific Reward Values in the Striatum , 2005, Science.

[55]  Colin Camerer,et al.  Neural Systems Responding to Degrees of Uncertainty in Human Decision-Making , 2005, Science.

[56]  Jonathan D. Cohen,et al.  An integrative theory of locus coeruleus-norepinephrine function: adaptive gain and optimal performance. , 2005, Annual review of neuroscience.

[57]  Joshua W. Brown,et al.  Learned Predictions of Error Likelihood in the Anterior Cingulate Cortex , 2005, Science.

[58]  Angela J. Yu,et al.  Uncertainty, Neuromodulation, and Attention , 2005, Neuron.

[59]  Erratum: Human anterior cingulate neurons and the integration of monetary reward with motor responses , 2005, Nature Neuroscience.

[60]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[61]  E. Procyk,et al.  Anterior cingulate error‐related activity is modulated by predicted reward , 2005, The European journal of neuroscience.

[62]  P. Glimcher,et al.  Midbrain Dopamine Neurons Encode a Quantitative Reward Prediction Error Signal , 2005, Neuron.

[63]  M. E. Walton,et al.  Cognitive Neuroscience: Resolving Conflict in and over the Medial Frontal Cortex , 2005, Current Biology.

[64]  W. Schultz,et al.  Behavioral and Brain Functions , 2005 .

[65]  P. Glimcher,et al.  JOURNAL OF THE EXPERIMENTAL ANALYSIS OF BEHAVIOR 2005, 84, 555–579 NUMBER 3(NOVEMBER) DYNAMIC RESPONSE-BY-RESPONSE MODELS OF MATCHING BEHAVIOR IN RHESUS MONKEYS , 2022 .

[66]  Timothy E. J. Behrens,et al.  Optimal decision making and the anterior cingulate cortex , 2006, Nature Neuroscience.

[67]  P. Holland,et al.  Different Roles for Amygdala Central Nucleus and Substantia Innominata in the Surprise-Induced Enhancement of Learning , 2006, The Journal of Neuroscience.

[68]  H. Yin,et al.  The role of the basal ganglia in habit formation , 2006, Nature Reviews Neuroscience.

[69]  P. Dayan,et al.  Cortical substrates for exploratory decisions in humans , 2006, Nature.

[70]  M. Walton,et al.  Separate neural pathways process different decision costs , 2006, Nature Neuroscience.

[71]  Evan M. Gordon,et al.  Neural Signatures of Economic Preferences for Risk and Ambiguity , 2006, Neuron.

[72]  W. Schultz Behavioral theories and the neurophysiology of reward. , 2006, Annual review of psychology.

[73]  E. Procyk,et al.  Reward encoding in the monkey anterior cingulate cortex. , 2006, Cerebral cortex.

[74]  Aaron C. Courville,et al.  Bayesian theories of conditioning in a changing world , 2006, Trends in Cognitive Sciences.

[75]  O. Hikosaka,et al.  Lateral habenula as a source of negative reward signals in dopamine neurons , 2007, Nature.

[76]  Keiji Tanaka,et al.  Medial prefrontal cell activity signaling prediction errors of action values , 2007, Nature Neuroscience.

[77]  Joseph J. Paton,et al.  Expectation Modulates Neural Responses to Pleasant and Aversive Stimuli in Primate Amygdala , 2007, Neuron.

[78]  H. Seo,et al.  Mechanisms of Reinforcement Learning and Decision Making in the Primate Dorsolateral Prefrontal Cortex , 2007, Annals of the New York Academy of Sciences.

[79]  Timothy E. J. Behrens,et al.  Functional organization of the medial frontal cortex , 2007, Current Opinion in Neurobiology.

[80]  Timothy E. J. Behrens,et al.  Learning the value of information in an uncertain world , 2007, Nature Neuroscience.

[81]  H. Seo,et al.  Temporal Filtering of Reward Signals in the Dorsal Anterior Cingulate Cortex during a Mixed-Strategy Game , 2007, The Journal of Neuroscience.

[82]  E. Procyk,et al.  Expectations, gains, and losses in the anterior cingulate cortex , 2007, Cognitive, affective & behavioral neuroscience.

[83]  M. Walton,et al.  Calculating the Cost of Acting in Frontal Cortex , 2007, Annals of the New York Academy of Sciences.

[84]  Benjamin Y. Hayden,et al.  Posterior Cingulate Cortex Mediates Outcome-Contingent Allocation of Behavior , 2008, Neuron.

[85]  Timothy E. J. Behrens,et al.  Choice, uncertainty and value in prefrontal and cingulate cortex , 2008, Nature Neuroscience.

[86]  E. Procyk,et al.  Behavioral Shifts and Action Valuation in the Anterior Cingulate Cortex , 2008, Neuron.

[87]  Pearl H. Chiu,et al.  Smokers' brains compute, but ignore, a fictive error signal in a sequential investment task , 2008, Nature Neuroscience.

[88]  M. Nicolelis,et al.  Neuronal Ensemble Bursting in the Basal Forebrain Encodes Salience Irrespective of Valence , 2008, Neuron.

[89]  O. Hikosaka,et al.  Representation of negative motivational value in the primate lateral habenula , 2009, Nature Neuroscience.

[90]  H. Barlow,et al.  Single Units and Sensation: A Neuron Doctrine for Perceptual Psychology? , 1972, Perception.

[91]  John M. Pearson,et al.  Fictive Reward Signals in the Anterior Cingulate Cortex , 2009, Science.

[92]  S. Kennerley,et al.  Evaluating choices by single neurons in the frontal lobe: outcome value encoded across multiple decision variables , 2009, The European journal of neuroscience.

[93]  Jonathan D. Wallis,et al.  Neurons in the Frontal Lobe Encode the Value of Multiple Decision Variables , 2009, Journal of Cognitive Neuroscience.

[94]  J. Wallis,et al.  Dynamic Encoding of Responses and Outcomes by Neurons in Medial Prefrontal Cortex , 2009, The Journal of Neuroscience.

[95]  John M. Pearson,et al.  Neurons in Posterior Cingulate Cortex Signal Exploratory Decisions in a Dynamic Multioption Choice Task , 2009, Current Biology.

[96]  Ethan S. Bromberg-Martin,et al.  Midbrain Dopamine Neurons Signal Preference for Advance Information about Upcoming Rewards , 2009, Neuron.

[97]  Sarah R. Heilbronner,et al.  Ambiguity Aversion in Rhesus Macaques , 2010, Front. Neurosci..

[98]  Guillem R. Esber,et al.  All that glitters ... dissociating attention and outcome expectancy from prediction errors signals. , 2010, Journal of neurophysiology.

[99]  E D Adrian,et al.  The action of light on the eye: Part I. The discharge of impulses in the optic nerve and its relation to the electric changes in the retina. , 2022, The Journal of physiology.