The Good, the Bad, and the Irrelevant: Neural Mechanisms of Learning Real and Hypothetical Rewards and Effort

Natural environments are complex, and a single choice can lead to multiple outcomes. Agents should learn which outcomes are due to their choices and therefore relevant for future decisions and which are stochastic in ways common to all choices and therefore irrelevant for future decisions between options. We designed an experiment in which human participants learned the varying reward and effort magnitudes of two options and repeatedly chose between them. The reward associated with a choice was randomly real or hypothetical (i.e., participants only sometimes received the reward magnitude associated with the chosen option). The real/hypothetical nature of the reward on any one trial was, however, irrelevant for learning the longer-term values of the choices, and participants ought to have only focused on the informational content of the outcome and disregarded whether it was a real or hypothetical reward. However, we found that participants showed an irrational choice bias, preferring choices that had previously led, by chance, to a real reward in the last trial. Amygdala and ventromedial prefrontal activity was related to the way in which participants' choices were biased by real reward receipt. By contrast, activity in dorsal anterior cingulate cortex, frontal operculum/anterior insula, and especially lateral anterior prefrontal cortex was related to the degree to which participants resisted this bias and chose effectively in a manner guided by aspects of outcomes that had real and more sustained relationships with particular choices, suppressing irrelevant reward information for more optimal learning and decision making. SIGNIFICANCE STATEMENT In complex natural environments, a single choice can lead to multiple outcomes. Human agents should only learn from outcomes that are due to their choices, not from outcomes without such a relationship. We designed an experiment to measure learning about reward and effort magnitudes in an environment in which other features of the outcome were random and had no relationship with choice. We found that, although people could learn about reward magnitudes, they nevertheless were irrationally biased toward repeating certain choices as a function of the presence or absence of random reward features. Activity in different brain regions in the prefrontal cortex either reflected the bias or reflected resistance to the bias.

[1]  M. Rushworth,et al.  Connectivity reveals relationship of brain areas for reward-guided learning and decision making in human and monkey frontal cortex , 2015, Proceedings of the National Academy of Sciences.

[2]  M. Pessiglione,et al.  Learning To Minimize Efforts versus Maximizing Rewards: Computational Principles and Neural Correlates , 2014, The Journal of Neuroscience.

[3]  Etienne Koechlin,et al.  Foundations of human reasoning in the prefrontal cortex , 2014, Science.

[4]  M. Lee,et al.  Bayesian Cognitive Modeling: A Practical Course , 2014 .

[5]  Marco K. Wittmann,et al.  Multiple Neural Mechanisms of Decision Making and Their Competition under Changing Risk Pressure , 2014, Neuron.

[6]  Nils Kolling,et al.  A neural mechanism underlying failure of optimal choice with multiple alternatives , 2014, Nature Neuroscience.

[7]  Adam G. Thomas,et al.  Comparison of Human Ventral Frontal Cortex Areas for Cognitive Control and Language with Areas in Monkey Frontal Cortex , 2014, Neuron.

[8]  M. Rushworth,et al.  A Role Beyond Learning for NMDA Receptors in Reward-Based Decision-Making—a Pharmacological Study Using d-Cycloserine , 2014, Neuropsychopharmacology.

[9]  Roshan Cools,et al.  Serotonin and Aversive Pavlovian Control of Instrumental Behavior in Humans , 2013, The Journal of Neuroscience.

[10]  Adam G. Thomas,et al.  The Organization of Dorsal Frontal Cortex in Humans and Macaques , 2013, The Journal of Neuroscience.

[11]  P. Dayan,et al.  Effort and Valuation in the Brain: The Effects of Anticipation and Execution , 2013, The Journal of Neuroscience.

[12]  Timothy Edward John Behrens,et al.  Ventromedial Prefrontal and Anterior Cingulate Cortex Adopt Choice and Default Reference Frames during Sequential Multi-Alternative Choice , 2013, The Journal of Neuroscience.

[13]  Soyoung Q. Park,et al.  Neural Integration of Risk and Effort Costs by the Frontal Pole: Only upon Request , 2013, The Journal of Neuroscience.

[14]  M. Pessiglione,et al.  Critical Roles for Anterior Insula and Dorsal Striatum in Punishment-Based Avoidance Learning , 2012, Neuron.

[15]  M. Rushworth,et al.  Valuation and decision-making in frontal cortex: one or many serial or parallel systems? , 2012, Current Opinion in Neurobiology.

[16]  Mimi Liljeholm,et al.  Neural Correlates of Specific and General Pavlovian-to-Instrumental Transfer within Human Amygdalar Subregions: A High-Resolution fMRI Study , 2012, The Journal of Neuroscience.

[17]  Timothy E. J. Behrens,et al.  Tools of the trade: psychophysiological interactions and functional connectivity. , 2012, Social cognitive and affective neuroscience.

[18]  Timothy E. J. Behrens,et al.  Neural Mechanisms of Foraging , 2012, Science.

[19]  M. Walton,et al.  Re‐evaluating the role of the orbitofrontal cortex in reward and reinforcement , 2012, The European journal of neuroscience.

[20]  P. Dayan,et al.  Mapping value based planning and extensively trained choice in the human brain , 2012, Nature Neuroscience.

[21]  Woojae Kim,et al.  A Model-Based fMRI Analysis with Hierarchical Bayesian Parameter Estimation. , 2013, Journal of neuroscience, psychology, and economics.

[22]  Daeyeol Lee,et al.  Ubiquity and Specificity of Reinforcement Signals throughout the Human Brain , 2011, Neuron.

[23]  Timothy E. J. Behrens,et al.  Counterfactual Choice and Learning in a Neural Network Centered on Human Lateral Frontopolar Cortex , 2011, PLoS biology.

[24]  Timothy Edward John Behrens,et al.  Diffusion-Weighted Imaging Tractography-Based Parcellation of the Human Parietal Cortex and Comparison with Human and Macaque Resting-State Functional Connectivity , 2011, The Journal of Neuroscience.

[25]  P. Dayan,et al.  Model-based influences on humans’ choices and striatal prediction errors , 2011, Neuron.

[26]  Mathias Pessiglione,et al.  Separate Valuation Subsystems for Delay and Effort Decision Costs , 2010, The Journal of Neuroscience.

[27]  Zhishun Wang,et al.  Visual inspection of independent components: Defining a procedure for artifact removal from fMRI data , 2010, Journal of Neuroscience Methods.

[28]  Antonio Rangel,et al.  Neural computations associated with goal-directed choice , 2010, Current Opinion in Neurobiology.

[29]  Warren K Bickel,et al.  Congruence of BOLD Response across Intertemporal Choice Conditions: Fictive and Real Money Gains and Losses , 2009, The Journal of Neuroscience.

[30]  Timothy Edward John Behrens,et al.  How Green Is the Grass on the Other Side? Frontopolar Cortex and the Evidence in Favor of Alternative Courses of Action , 2009, Neuron.

[31]  D. Kumaran,et al.  A Genetically Mediated Bias in Decision Making Driven by Failure of Amygdala Control , 2009, The Journal of Neuroscience.

[32]  Timothy Edward John Behrens,et al.  Effort-Based Cost–Benefit Valuation and the Human Brain , 2009, The Journal of Neuroscience.

[33]  M. Rushworth,et al.  Behavioral / Systems / Cognitive Connectivity-Based Parcellation of Human Cingulate Cortex and Its Relation to Functional Specialization , 2008 .

[34]  Michael D. Lee,et al.  A Survey of Model Evaluation Approaches With a Tutorial on Hierarchical Bayesian Methods , 2008, Cogn. Sci..

[35]  B. Balleine,et al.  Calculating Consequences: Brain Systems That Encode the Causal Effects of Actions , 2008, The Journal of Neuroscience.

[36]  B. Balleine,et al.  The Neural Mechanisms Underlying the Influence of Pavlovian Cues on Human Decision Making , 2008, The Journal of Neuroscience.

[37]  Pearl H. Chiu,et al.  Smokers' brains compute, but ignore, a fictive error signal in a sequential investment task , 2008, Nature Neuroscience.

[38]  P. Dayan,et al.  Human Pavlovian–Instrumental Transfer , 2008, The Journal of Neuroscience.

[39]  John C Gore,et al.  Assessing functional connectivity in the human brain by fMRI. , 2007, Magnetic resonance imaging.

[40]  Timothy E. J. Behrens,et al.  Learning the value of information in an uncertain world , 2007, Nature Neuroscience.

[41]  Kevin McCabe,et al.  Neural signature of fictive learning signals in a sequential investment task , 2007, Proceedings of the National Academy of Sciences.

[42]  R. Dolan,et al.  How the Brain Translates Money into Force: A Neuroimaging Study of Subliminal Motivation , 2007, Science.

[43]  Matthew F. S. Rushworth,et al.  Weighing up the benefits of work: Behavioral and neural analyses of effort-related decision making , 2006, Neural Networks.

[44]  M. Walton,et al.  Separate neural pathways process different decision costs , 2006, Nature Neuroscience.

[45]  D. Kumaran,et al.  Frames, Biases, and Rational Decision-Making in the Human Brain , 2006, Science.

[46]  P. Dayan,et al.  Cortical substrates for exploratory decisions in humans , 2006, Nature.

[47]  Mark W. Woolrich,et al.  Advances in functional and structural MR image analysis and implementation as FSL , 2004, NeuroImage.

[48]  Kristopher J Preacher,et al.  SPSS and SAS procedures for estimating indirect effects in simple mediation models , 2004, Behavior research methods, instruments, & computers : a journal of the Psychonomic Society, Inc.

[49]  Stephen M. Smith,et al.  Probabilistic independent component analysis for functional magnetic resonance imaging , 2004, IEEE Transactions on Medical Imaging.

[50]  Stephen M. Smith,et al.  General multilevel linear modeling for group analysis in FMRI , 2003, NeuroImage.

[51]  Matthew F S Rushworth,et al.  Functional Specialization within Medial Frontal Cortex of the Anterior Cingulate for Evaluating Effort-Related Decisions , 2003, The Journal of Neuroscience.

[52]  R Turner,et al.  Optimized EPI for fMRI studies of the orbitofrontal cortex , 2003, NeuroImage.

[53]  J. Salamone,et al.  Nucleus Accumbens Dopamine and the Regulation of Effort in Food-Seeking Behavior: Implications for Studies of Natural Motivation, Psychiatry, and Drug Abuse , 2003, Journal of Pharmacology and Experimental Therapeutics.

[54]  Martyn Plummer,et al.  JAGS: A program for analysis of Bayesian graphical models using Gibbs sampling , 2003 .

[55]  M. Walton,et al.  The Role of Rat Medial Frontal Cortex in Effort-Based Decision Making , 2002, The Journal of Neuroscience.

[56]  Stephen M Smith,et al.  Fast robust automated brain extraction , 2002, Human brain mapping.

[57]  Bradley P. Carlin,et al.  Bayesian measures of model complexity and fit , 2002 .

[58]  Stephen M. Smith,et al.  Temporal Autocorrelation in Univariate Linear Modeling of FMRI Data , 2001, NeuroImage.

[59]  J. Price,et al.  Limbic connections of the orbital and medial prefrontal cortex in macaque monkeys , 1995, The Journal of comparative neurology.

[60]  J. Salamone,et al.  Nucleus accumbens dopamine release increases during instrumental lever pressing for food but not free food consumption , 1994, Pharmacology Biochemistry and Behavior.

[61]  Karl J. Friston,et al.  The Relationship between Global and Local Changes in PET Scans , 1990, Journal of cerebral blood flow and metabolism : official journal of the International Society of Cerebral Blood Flow and Metabolism.