Counterfactual Choice and Learning in a Neural Network Centered on Human Lateral Frontopolar Cortex

Decision making and learning in a real-world context require organisms to track not only the choices they make and the outcomes that follow but also other untaken, or counterfactual, choices and their outcomes. Although the neural system responsible for tracking the value of choices actually taken is increasingly well understood, whether a neural system tracks counterfactual information is currently unclear. Using a three-alternative decision-making task, a Bayesian reinforcement-learning algorithm, and fMRI, we investigated the coding of counterfactual choices and prediction errors in the human brain. Rather than representing evidence favoring multiple counterfactual choices, lateral frontal polar cortex (lFPC), dorsomedial frontal cortex (DMFC), and posteromedial cortex (PMC) encode the reward-based evidence favoring the best counterfactual option at future decisions. In addition to encoding counterfactual reward expectations, the network carries a signal for learning about counterfactual options when feedback is available—a counterfactual prediction error. Unlike other brain regions that have been associated with the processing of counterfactual outcomes, counterfactual prediction errors within the identified network cannot be related to regret theory. Furthermore, individual variation in counterfactual choice-related activity and prediction error-related activity, respectively, predicts variation in the propensity to switch to profitable choices in the future and the ability to learn from hypothetical feedback. Taken together, these data provide both neural and behavioral evidence to support the existence of a previously unidentified neural system responsible for tracking both counterfactual choice options and their outcomes.

[1]  P. Goldman-Rakic,et al.  Prefrontal connections of medial motor areas in the rhesus monkey , 1993, The Journal of comparative neurology.

[2]  Etienne Koechlin The cognitive architecture of the human lateral prefrontal cortex , 1993 .

[3]  P. Haggard,et al.  Sensorimotor foundations of higher cognition , 1993 .

[4]  Peter Dayan,et al.  A Neural Substrate of Prediction and Reward , 1997, Science.

[5]  E. Koechlin,et al.  The role of the anterior prefrontal cortex in human cognition , 1999, Nature.

[6]  Colin Camerer,et al.  Experience‐weighted Attraction Learning in Normal Form Games , 1999 .

[7]  E. Koechlin,et al.  Dissociating the role of the medial and lateral anterior prefrontal cortex in human planning. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[8]  Stephen M. Smith,et al.  A global optimisation method for robust affine registration of brain images , 2001, Medical Image Anal..

[9]  Stephen M. Smith,et al.  Temporal Autocorrelation in Univariate Linear Modeling of FMRI Data , 2001, NeuroImage.

[10]  R. Passingham,et al.  Active maintenance in prefrontal area 46 creates distractor-resistant memory , 2002, Nature Neuroscience.

[11]  R Turner,et al.  Optimized EPI for fMRI studies of the orbitofrontal cortex , 2003, NeuroImage.

[12]  Stephen M. Smith,et al.  General multilevel linear modeling for group analysis in FMRI , 2003, NeuroImage.

[13]  S. Scott,et al.  The role of the rostral frontal cortex (area 10) in prospective memory: a lateral versus medial dissociation , 2003, Neuropsychologia.

[14]  Mark W. Woolrich,et al.  Multilevel linear modelling for FMRI group analysis using Bayesian inference , 2004, NeuroImage.

[15]  A. Owen,et al.  Anterior prefrontal cortex: insights into function from anatomy and neuroimaging , 2004, Nature Reviews Neuroscience.

[16]  W. Schultz,et al.  Adaptive Coding of Reward Value by Dopamine Neurons , 2005, Science.

[17]  Karl J. Friston,et al.  A theory of cortical responses , 2005, Philosophical Transactions of the Royal Society B: Biological Sciences.

[18]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[19]  Giorgio Coricelli,et al.  Response to Comment on "The Involvement of the Orbitofrontal Cortex in the Experience of Regret" , 2005, Science.

[20]  J. O'Doherty,et al.  Regret and its avoidance: a neuroimaging study of choice behavior , 2005, Nature Neuroscience.

[21]  R. Deichmann,et al.  Optimized EPI for fMRI studies of the orbitofrontal cortex: compensation of susceptibility-induced gradients in the readout direction , 2007, Magnetic Resonance Materials in Physics, Biology and Medicine.

[22]  M. Schölvinck,et al.  Differential components of prospective memory? Evidence from fMRI , 2006, Neuropsychologia.

[23]  C. Frith,et al.  Meeting of minds: the medial frontal cortex and social cognition , 2006, Nature Reviews Neuroscience.

[24]  P. Dayan,et al.  Cortical substrates for exploratory decisions in humans , 2006, Nature.

[25]  J. O'Doherty,et al.  The Role of the Ventromedial Prefrontal Cortex in Abstract State-Based Inference during Decision Making in Humans , 2006, The Journal of Neuroscience.

[26]  G. V. Van Hoesen,et al.  Neural connections of the posteromedial cortex in the macaque , 2006, Proceedings of the National Academy of Sciences of the United States of America.

[27]  R. Saxe Uniquely human social cognition , 2006, Current Opinion in Neurobiology.

[28]  Katsuyuki Sakai,et al.  Prefrontal Set Activity Predicts Rule-Specific Neural Processing during Subsequent Cognitive Performance , 2006, The Journal of Neuroscience.

[29]  E. Koechlin,et al.  Serial Organization of Human Behavior in the Inferior Parietal Cortex , 2007, The Journal of Neuroscience.

[30]  Ventrolateral and Medial Frontal Contributions to Decision-Making and Action Selection , 2007 .

[31]  E. Koechlin,et al.  Anterior Prefrontal Function and the Limits of Human Decision-Making , 2007, Science.

[32]  Katsuyuki Sakai,et al.  Is the Prefrontal Cortex Necessary for Establishing Cognitive Sets? , 2007, The Journal of Neuroscience.

[33]  Timothy E. J. Behrens,et al.  Learning the value of information in an uncertain world , 2007, Nature Neuroscience.

[34]  J. Wallis,et al.  Neuroscience of Rule-Guided Behavior , 2007 .

[35]  Kevin McCabe,et al.  Neural signature of fictive learning signals in a sequential investment task , 2007, Proceedings of the National Academy of Sciences.

[36]  D. Pandya,et al.  Efferent Association Pathways from the Rostral Prefrontal Cortex in the Macaque Monkey , 2007, The Journal of Neuroscience.

[37]  Colin Camerer,et al.  A framework for studying the neurobiology of value-based decision making , 2008, Nature Reviews Neuroscience.

[38]  Jim M. Monti,et al.  Neural repetition suppression reflects fulfilled perceptual expectations , 2008, Nature Neuroscience.

[39]  Samuel M. McClure,et al.  Anchors, scales and the relative coding of value in the brain , 2008, Current Opinion in Neurobiology.

[40]  Peter Bossaerts,et al.  Neural correlates of mentalizing-related computations during strategic interactions in humans , 2008, Proceedings of the National Academy of Sciences.

[41]  Mark W Woolrich,et al.  Associative learning of social value , 2008, Nature.

[42]  M. Brass,et al.  Unconscious determinants of free decisions in the human brain , 2008, Nature Neuroscience.

[43]  C. Summerfield,et al.  A Neural Representation of Prior Information during Perceptual Inference , 2008, Neuron.

[44]  Timothy E. J. Behrens,et al.  Choice, uncertainty and value in prefrontal and cingulate cortex , 2008, Nature Neuroscience.

[45]  Pearl H. Chiu,et al.  Smokers' brains compute, but ignore, a fictive error signal in a sequential investment task , 2008, Nature Neuroscience.

[46]  H. Seo,et al.  Cortical mechanisms for reinforcement learning in competitive games , 2008, Philosophical Transactions of the Royal Society B: Biological Sciences.

[47]  J. Gläscher,et al.  Determining a role for ventromedial prefrontal cortex in encoding action-based value signals during reward-related decision making. , 2009, Cerebral cortex.

[48]  Timothy Edward John Behrens,et al.  How Green Is the Grass on the Other Side? Frontopolar Cortex and the Evidence in Favor of Alternative Courses of Action , 2009, Neuron.

[49]  John M. Pearson,et al.  Fictive Reward Signals in the Anterior Cingulate Cortex , 2009, Science.

[50]  M. Rushworth,et al.  Behavioral / Systems / Cognitive Connectivity-Based Parcellation of Human Cingulate Cortex and Its Relation to Functional Specialization , 2008 .

[51]  P. Glimcher,et al.  The Neurobiology of Decision: Consensus and Controversy , 2009, Neuron.

[52]  Matthew F S Rushworth,et al.  The Computation of Social Behavior , 2009, Science.

[53]  Thomas H. B. FitzGerald,et al.  The Role of Human Orbitofrontal Cortex in Value Comparison for Incommensurable Objects , 2009, The Journal of Neuroscience.

[54]  John M. Pearson,et al.  Neurons in Posterior Cingulate Cortex Signal Exploratory Decisions in a Dynamic Multioption Choice Task , 2009, Current Biology.

[55]  Striatal Prediction Error Activity Drives Cortical Connectivity Changes During Associative Learning , 2009, NeuroImage.

[56]  M. Rushworth,et al.  General Mechanisms for Making Decisions? This Review Comes from a Themed Issue on Cognitive Neuroscience Edited the Representation of Value and Reward Expectations in Frontal Cortex Reward Prediction Errors and Learning Rates Other Types of Prediction Error , 2022 .

[57]  Karl J. Friston,et al.  A Dual Role for Prediction Error in Associative Learning , 2008, Cerebral cortex.

[58]  M. Roesch,et al.  A new perspective on the role of the orbitofrontal cortex in adaptive behaviour , 2009, Nature Reviews Neuroscience.

[59]  Etienne Koechlin,et al.  Divided Representation of Concurrent Goals in the Human Frontal Lobes , 2010, Science.

[60]  Antonio Rangel,et al.  Neural computations associated with goal-directed choice , 2010, Current Opinion in Neurobiology.

[61]  Aldo Genovesio,et al.  Evaluating self-generated decisions in frontal pole cortex of monkeys , 2009, Nature Neuroscience.