Adaptive learning under expected and unexpected uncertainty

The outcome of a decision is often uncertain, and outcomes can vary over repeated decisions. Whether decision outcomes should substantially affect behaviour and learning depends on whether they are representative of a typically experienced range of outcomes or signal a change in the reward environment. Successful learning and decision-making therefore require the ability to estimate expected uncertainty (related to the variability of outcomes) and unexpected uncertainty (related to the variability of the environment). Understanding the bases and effects of these two types of uncertainty and the interactions between them — at the computational and the neural level — is crucial for understanding adaptive learning. Here, we examine computational models and experimental findings to distil computational principles and neural mechanisms for adaptive learning under uncertainty.Successful learning and decision-making require estimates of expected uncertainty and unexpected uncertainty. Soltani and Izquierdo define these concepts, describe proposed models of how they may be computed and discuss their neural substrates.

[1]  H. Seo,et al.  Temporal Filtering of Reward Signals in the Dorsal Anterior Cingulate Cortex during a Mixed-Strategy Game , 2007, The Journal of Neuroscience.

[2]  Mathieu Wolff,et al.  The Cognitive Thalamus as a Gateway to Mental Representations , 2018, The Journal of Neuroscience.

[3]  R. Vertes,et al.  Projections of the medial orbital and ventral orbital cortex in the rat , 2011, The Journal of comparative neurology.

[4]  S. Floresco,et al.  Preferential Involvement by Nucleus Accumbens Shell in Mediating Probabilistic Learning and Reversal Shifts , 2014, The Journal of Neuroscience.

[5]  D. Grupe,et al.  Uncertainty and anticipation in anxiety: an integrated neurobiological and psychological perspective , 2013, Nature Reviews Neuroscience.

[6]  A. Izquierdo,et al.  Complementary contributions of basolateral amygdala and orbitofrontal cortex to value learning under uncertainty , 2017, eLife.

[7]  Wolfram Schultz,et al.  Scaling prediction errors to reward variability benefits error-driven learning in humans , 2015, Journal of neurophysiology.

[8]  Guillem R. Esber,et al.  Neural Correlates of Variations in Event Processing during Learning in Basolateral Amygdala , 2010, The Journal of Neuroscience.

[9]  C. H. Donahue,et al.  Volatility Facilitates Value Updating in the Prefrontal Cortex , 2018, Neuron.

[10]  Laura A. Bradfield,et al.  Medial Orbitofrontal Cortex Mediates Outcome Retrieval in Partially Observable Task Situations , 2015, Neuron.

[11]  Yi Li,et al.  Mice infer probabilistic models for timing , 2013, Proceedings of the National Academy of Sciences.

[12]  Timothy Edward John Behrens,et al.  Separate value comparison and learning mechanisms in macaque medial and lateral orbitofrontal cortex , 2010, Proceedings of the National Academy of Sciences.

[13]  B. Averbeck,et al.  Amygdala Contributions to Stimulus–Reward Encoding in the Macaque Medial and Orbital Frontal Cortex during Learning , 2017, The Journal of Neuroscience.

[14]  Benjamin Y Hayden,et al.  Dorsal Anterior Cingulate Cortex: A Bottom-Up View. , 2016, Annual review of neuroscience.

[15]  J. Deakin,et al.  Effects of lesions of the orbitofrontal cortex on sensitivity to delayed and probabilistic reinforcement , 2002, Psychopharmacology.

[16]  S. Floresco,et al.  Deciphering Decision Making: Variation in Animal Models of Effort- and Uncertainty-Based Choice Reveals Distinct Neural Circuitries Underlying Core Cognitive Processes , 2016, The Journal of Neuroscience.

[17]  Xiao-Jing Wang,et al.  From biophysics to cognition: reward-dependent adaptive choice behavior , 2008, Current Opinion in Neurobiology.

[18]  J. O'Doherty,et al.  Reward Value Coding Distinct From Risk Attitude-Related Uncertainty Coding in Human Reward Systems , 2006, Journal of neurophysiology.

[19]  C. Salzman,et al.  Abstract Context Representations in Primate Amygdala and Prefrontal Cortex , 2015, Neuron.

[20]  T. Paus Primate anterior cingulate cortex: Where motor control, drive and cognition interface , 2001, Nature Reviews Neuroscience.

[21]  G. Vanni-Mercier,et al.  The Hippocampus Codes the Uncertainty of Cue–Outcome Associations: An Intracranial Electrophysiological Study in Humans , 2009, The Journal of Neuroscience.

[22]  Jonathan D. Cohen,et al.  The Expected Value of Control: An Integrative Theory of Anterior Cingulate Cortex Function , 2013, Neuron.

[23]  G. Schoenbaum,et al.  What the orbitofrontal cortex does not do , 2015, Nature Neuroscience.

[24]  P. Dayan,et al.  Cortical substrates for exploratory decisions in humans , 2006, Nature.

[25]  M. Cassell,et al.  Topography of projections from the medial prefrontal cortex to the amygdala in the rat , 1986, Brain Research Bulletin.

[26]  C. H. Donahue,et al.  Metaplasticity as a Neural Substrate for Adaptive Learning and Choice under Uncertainty , 2017, Neuron.

[27]  Nathaniel J. Smith,et al.  Dopamine D3 Receptor Availability Is Associated with Inflexible Decision Making , 2016, The Journal of Neuroscience.

[28]  M. Shapiro,et al.  Reward Stability Determines the Contribution of Orbitofrontal Cortex to Adaptive Behavior , 2012, The Journal of Neuroscience.

[29]  R. Rogers The Roles of Dopamine and Serotonin in Decision Making: Evidence from Pharmacological Experiments in Humans , 2011, Neuropsychopharmacology.

[30]  Alireza Soltani,et al.  Optimal structure of metaplasticity for adaptive learning , 2017, bioRxiv.

[31]  E. Murray,et al.  Functional Interaction of Medial Mediodorsal Thalamic Nucleus But Not Nucleus Accumbens with Amygdala and Orbital Prefrontal Cortex Is Essential for Adaptive Response Selection after Reinforcer Devaluation , 2010, The Journal of Neuroscience.

[32]  Clay B. Holroyd,et al.  A Novel Neural Prediction Error Found in Anterior Cingulate Cortex Ensembles , 2017, Neuron.

[33]  Timothy E. J. Behrens,et al.  Choice, uncertainty and value in prefrontal and cingulate cortex , 2008, Nature Neuroscience.

[34]  T. Robbins,et al.  Compulsivity Reveals a Novel Dissociation between Action and Confidence , 2017, Neuron.

[35]  Angela J. Yu,et al.  Uncertainty, Neuromodulation, and Attention , 2005, Neuron.

[36]  Peter Somogyi,et al.  Synaptic Targets of Medial Septal Projections in the Hippocampus and Extrahippocampal Cortices of the Mouse , 2015, The Journal of Neuroscience.

[37]  Ryan D Ward,et al.  Mediodorsal Thalamus Hypofunction Impairs Flexible Goal-Directed Behavior , 2015, Biological Psychiatry.

[38]  K. Berridge,et al.  What is the role of dopamine in reward: hedonic impact, reward learning, or incentive salience? , 1998, Brain Research Reviews.

[39]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[40]  J. Roitman,et al.  Orbitofrontal cortex reflects changes in response–outcome contingencies during probabilistic reversal learning , 2017, Neuroscience.

[41]  Rudolf N. Cardinal,et al.  Neural systems implicated in delayed and probabilistic reinforcement , 2006, Neural Networks.

[42]  J. Salamone,et al.  Beyond the reward hypothesis: alternative functions of nucleus accumbens dopamine. , 2005, Current opinion in pharmacology.

[43]  S. Floresco,et al.  Overriding Phasic Dopamine Signals Redirects Action Selection during Risk/Reward Decision Making , 2014, Neuron.

[44]  M. Shapiro,et al.  Orbitofrontal Cortex Signals Expected Outcomes with Predictive Codes When Stable Contingencies Promote the Integration of Reward History , 2017, The Journal of Neuroscience.

[45]  Karl J. Friston,et al.  A Bayesian Foundation for Individual Learning Under Uncertainty , 2011, Front. Hum. Neurosci..

[46]  Timothy E. J. Behrens,et al.  Learning the value of information in an uncertain world , 2007, Nature Neuroscience.

[47]  Joseph J. Paton,et al.  Distinct Roles for the Amygdala and Orbitofrontal Cortex in Representing the Relative Amount of Expected Reward , 2017, Neuron.

[48]  A. Reiter,et al.  Model-Based Control in Dimensional Psychiatry , 2017, Biological Psychiatry.

[49]  Suzanne N. Haber,et al.  Circuit-Based Corticostriatal Homologies Between Rat and Primate , 2016, Biological Psychiatry.

[50]  Aaron C. Courville,et al.  Bayesian theories of conditioning in a changing world , 2006, Trends in Cognitive Sciences.

[51]  E. Procyk,et al.  Anterior cingulate error‐related activity is modulated by predicted reward , 2005, The European journal of neuroscience.

[52]  Andrew M. Wikenheiser,et al.  Decoding the cognitive map: ensemble hippocampal sequences and decision making , 2015, Current Opinion in Neurobiology.

[53]  Y. Niv,et al.  Model-based predictions for dopamine , 2018, Current Opinion in Neurobiology.

[54]  Okihide Hikosaka,et al.  Selective and graded coding of reward uncertainty by neurons in the primate anterodorsal septal region , 2013, Nature Neuroscience.

[55]  Ilya E Monosov Anterior cingulate is a source of valence-specific information about value and uncertainty , 2016, bioRxiv.

[56]  A. Izquierdo,et al.  The basolateral amygdala in reward learning and addiction , 2015, Neuroscience & Biobehavioral Reviews.

[57]  Kiyohito Iigaya,et al.  Adaptive learning and decision-making under uncertainty by metaplastic synapses guided by a surprise detection system , 2016, eLife.

[58]  Nathaniel J. Smith,et al.  Methylphenidate modifies the motion of the circadian clock Lamotrigine in mood disorders and cocaine dependence Cortical glutamate in postpartum depression Chronic Exposure to Methamphetamine Disrupts Reinforcement-Based Decision-Making in Rats , 2017 .

[59]  S. Kakade,et al.  Learning and selective attention , 2000, Nature Neuroscience.

[60]  W. Abraham Metaplasticity: tuning synapses and networks for plasticity , 2008, Nature Reviews Neuroscience.

[61]  Peter Dayan,et al.  A Neural Substrate of Prediction and Reward , 1997, Science.

[62]  Elisabeth A. Murray,et al.  Specialized Representations of Value in the Orbital and Ventrolateral Prefrontal Cortex: Desirability versus Availability of Outcomes , 2017, Neuron.

[63]  Samuel Gershman,et al.  Dopamine, Inference, and Uncertainty , 2017, bioRxiv.

[64]  A. Izquierdo Functional Heterogeneity within Rat Orbitofrontal Cortex in Reward Learning and Decision Making , 2017, The Journal of Neuroscience.

[65]  Ilya E. Monosov,et al.  Anterior cingulate is a source of valence-specific information about value and uncertainty , 2016, Nature Communications.

[66]  Marc A Sommer,et al.  Advances in Understanding Mechanisms of Thalamic Relays in Cognition and Behavior , 2016 .

[67]  Timothy E. J. Behrens,et al.  Review Frontal Cortex and Reward-guided Learning and Decision-making Figure 1. Frontal Brain Regions in the Macaque Involved in Reward-guided Learning and Decision-making Finer Grained Anatomical Divisions with Frontal Cortical Systems for Reward-guided Behavior , 2022 .

[68]  K. Preuschoff,et al.  Adding Prediction Risk to the Theory of Reward Learning , 2007, Annals of the New York Academy of Sciences.

[69]  Erin L. Rich,et al.  Decoding subjective decisions from orbitofrontal cortex , 2016, Nature Neuroscience.

[70]  Bruno B. Averbeck,et al.  Theory of Choice in Bandit, Information Sampling and Foraging Tasks , 2015, PLoS Comput. Biol..

[71]  W. Schultz,et al.  Coding of Reward Risk by Orbitofrontal Neurons Is Mostly Distinct from Coding of Reward Value , 2010, Neuron.

[72]  Geoffrey Schoenbaum,et al.  Neural Estimates of Imagined Outcomes in Basolateral Amygdala Depend on Orbitofrontal Cortex , 2015, The Journal of Neuroscience.

[73]  Sara E. Morrison,et al.  Different Time Courses for Learning-Related Changes in Amygdala and Orbitofrontal Cortex , 2011, Neuron.

[74]  Kerstin Preuschoff,et al.  Balancing New against Old Information: The Role of Puzzlement Surprise in Learning , 2018, Neural Computation.

[75]  Rudolf N Cardinal,et al.  Effects of lesions of the nucleus accumbens core on choice between small certain rewards and large uncertain rewards in rats , 2005, BMC Neuroscience.

[76]  Xiao-Jing Wang,et al.  A Biophysically Based Neural Model of Matching Law Behavior: Melioration by Stochastic Synapses , 2006, The Journal of Neuroscience.

[77]  Ilya E Monosov,et al.  Neurons in the primate dorsal striatum signal the uncertainty of object–reward associations , 2016, Nature Communications.

[78]  C. H. Donahue,et al.  Dynamic Routing of Task-relevant Signals for Decision Making in Dorsolateral Prefrontal Cortex , 2015, Nature Neuroscience.

[79]  P. Dayan,et al.  Dopamine, uncertainty and TD learning , 2005, Behavioral and Brain Functions.

[80]  D. Amaral,et al.  Amygdalo‐cortical projections in the monkey (Macaca fascicularis) , 1984, The Journal of comparative neurology.

[81]  Joseph T. McGuire,et al.  Functionally Dissociable Influences on Learning Rate in a Dynamic Environment , 2014, Neuron.

[82]  Peter Bossaerts,et al.  The Neural Representation of Unexpected Uncertainty during Value-Based Decision Making , 2013, Neuron.

[83]  Timothy E. J. Behrens,et al.  Optimal decision making and the anterior cingulate cortex , 2006, Nature Neuroscience.

[84]  Timothy Edward John Behrens,et al.  Separable Learning Systems in the Macaque Brain and the Role of Orbitofrontal Cortex in Contingent Learning , 2010, Neuron.

[85]  Vincent D Costa,et al.  Motivational neural circuits underlying reinforcement learning , 2017, Nature Neuroscience.

[86]  Ian Krajbich,et al.  Computational modeling of epiphany learning , 2017, Proceedings of the National Academy of Sciences.

[87]  Vincent D Costa,et al.  The Role of Frontal Cortical and Medial-Temporal Lobe Brain Areas in Learning a Bayesian Prior Belief on Reversals , 2015, The Journal of Neuroscience.

[88]  Andrew M. Wikenheiser,et al.  Over the river, through the woods: cognitive maps in the hippocampus and orbitofrontal cortex , 2016, Nature Reviews Neuroscience.

[89]  J. Wallis Cross-species studies of orbitofrontal cortex and value-based decision-making , 2011, Nature Neuroscience.

[90]  Anna S. Mitchell,et al.  Critical role for the mediodorsal thalamus in permitting rapid reward-guided updating in stochastic reward environments , 2016, eLife.

[91]  R. Dolan,et al.  Knowing how much you don't know: a neural organization of uncertainty estimates , 2012, Nature Reviews Neuroscience.

[92]  S. Floresco,et al.  Multifaceted Contributions by Different Regions of the Orbitofrontal and Medial Prefrontal Cortex to Probabilistic Reversal Learning , 2016, The Journal of Neuroscience.

[93]  K. Doya Modulators of decision making , 2008, Nature Neuroscience.

[94]  Anna S. Mitchell,et al.  Dissociable Performance on Scene Learning and Strategy Implementation after Lesions to Magnocellular Mediodorsal Thalamic Nucleus , 2007, The Journal of Neuroscience.

[95]  S. Floresco,et al.  Separate Prefrontal-Subcortical Circuits Mediate Different Components of Risk-Based Decision Making , 2012, The Journal of Neuroscience.

[96]  Vincent D Costa,et al.  Amygdala and Ventral Striatum Make Distinct Contributions to Reinforcement Learning , 2016, Neuron.

[97]  P. Rudebeck,et al.  The neural basis of reversal learning: An updated perspective , 2017, Neuroscience.

[98]  G. Schoenbaum,et al.  Model‐based learning and the contribution of the orbitofrontal cortex to the model‐free world , 2012, The European journal of neuroscience.

[99]  K. Doya,et al.  Uncertainty in action-value estimation affects both action choice and learning rate of the choice behaviors of rats , 2012, The European journal of neuroscience.

[100]  D. Kumaran,et al.  An Unexpected Sequence of Events: Mismatch Detection in the Human Hippocampus , 2006, PLoS biology.

[101]  Guillem R. Esber,et al.  Surprise! Neural correlates of Pearce–Hall and Rescorla–Wagner coexist within the brain , 2012, The European journal of neuroscience.

[102]  Jung Hoon Sul,et al.  Distinct Roles of Rodent Orbitofrontal and Medial Prefrontal Cortex in Decision Making , 2010, Neuron.

[103]  S. Floresco,et al.  Fundamental Contribution by the Basolateral Amygdala to Different Forms of Decision Making , 2009, The Journal of Neuroscience.

[104]  Jill X. O'Reilly,et al.  Making predictions in a changing world—inference, uncertainty, and learning , 2013, Front. Neurosci..

[105]  G. Schoenbaum,et al.  Orbitofrontal neurons signal reward predictions, not reward prediction errors , 2018, Neurobiology of Learning and Memory.

[106]  Vincent D Costa,et al.  Reversal Learning and Dopamine: A Bayesian Perspective , 2015, The Journal of Neuroscience.

[107]  Joshua I. Gold,et al.  A Mixture of Delta-Rules Approximation to Bayesian Inference in Change-Point Problems , 2013, PLoS Comput. Biol..

[108]  J. Pearce,et al.  A model for Pavlovian learning: variations in the effectiveness of conditioned but not of unconditioned stimuli. , 1980, Psychological review.

[109]  Peter Bossaerts,et al.  Risk, Unexpected Uncertainty, and Estimation Uncertainty: Bayesian Learning in Unstable Settings , 2011, PLoS Comput. Biol..

[110]  Robert C. Wilson,et al.  An Approximately Bayesian Delta-Rule Model Explains the Dynamics of Belief Updating in a Changing Environment , 2010, The Journal of Neuroscience.

[111]  G. Schoenbaum,et al.  Back to basics: Making predictions in the orbitofrontal–amygdala circuit , 2016, Neurobiology of Learning and Memory.

[112]  John M. Pearson,et al.  Surprise Signals in Anterior Cingulate Cortex: Neuronal Encoding of Unsigned Reward Prediction Errors Driving Adjustment in Behavior , 2011, The Journal of Neuroscience.

[113]  M. Jung,et al.  Differential coding of uncertain reward in rat insular and orbitofrontal cortex , 2016, Scientific Reports.

[114]  Brent A. Vogt,et al.  Cytoarchitecture of mouse and rat cingulate cortex with human homologies , 2012, Brain Structure and Function.

[115]  Jacqueline Scholl,et al.  Simultaneous representation of a spectrum of dynamically changing value estimates during decision making , 2017, Nature Communications.

[116]  Peter N. C. Mohr,et al.  Genetic variation in dopaminergic neuromodulation influences the ability to rapidly and flexibly adapt decisions , 2009, Proceedings of the National Academy of Sciences.

[117]  Laurence T. Hunt,et al.  Triple Dissociation of Attention and Decision Computations across Prefrontal Cortex , 2017, Nature Neuroscience.