Learning optimal decisions with confidence

Diffusion decision models are immensely successful models for human and animal decisions under uncertainty. The decisions they model require the temporal accumulation of evidence to improve choice accuracy, and thus balance the trade-off between the decisions’ speed and their accuracy. Commonly, diffusion models have a one-dimensional abstract input that represents noisy momentary decision-related evidence. However, the nervous system typically uses population codes to represent sensory variables, which implies that the momentary evidence is distributed across multiple inputs. It is currently unknown how decision makers could learn how to combine these multiple inputs to obtain a one-dimensional abstract decision variable. We present here a Bayesian learning rule for learning a near-optimal linear combination of these inputs based on trial-by-trial feedback. The rule is Bayesian in the sense that it learns not only the mean of the weights but also the uncertainty around this mean in the form of a covariance matrix. This yields a rule whose learning rate is strongly modulated by decision confidence, providing a computational role of the latter for every-day decisions. Furthermore, we show that, in volatile environments, the rule predicts a bias towards repeating the same choice after correct decisions, with a bias strength that is modulated by the previous choice’ difficulty. Last, we extend our learning rule to cases for which one of the choices is more likely a-priori, which provides new insights into how such biases modulate the mechanisms leading to optimal decisions in diffusion models.

[1]  Radford M. Neal Pattern Recognition and Machine Learning , 2007, Technometrics.

[2]  Alexandre Pouget,et al.  Optimal policy for value-based decision-making , 2016, Nature Communications.

[3]  Andrew D. Zaharia,et al.  The Detection of Visual Contrast in the Behaving Mouse , 2011, The Journal of Neuroscience.

[4]  S. Kakade,et al.  Learning and selective attention , 2000, Nature Neuroscience.

[5]  Rajesh P. N. Rao,et al.  Bayesian brain : probabilistic approaches to neural coding , 2006 .

[6]  Roger Ratcliff,et al.  A Theory of Memory Retrieval. , 1978 .

[7]  J. Han,et al.  The transcription factor Pou3f1 promotes neural fate commitment via activation of neural lineage genes and inhibition of external signaling pathways , 2014, eLife.

[8]  Alexandre Pouget,et al.  Relation between Belief and Performance in Perceptual Decision Making , 2014, PloS one.

[9]  Alexandre Pouget,et al.  Tuning the speed-accuracy trade-off to maximize reward rate in multisensory decision-making , 2015, eLife.

[10]  Thomas M. Cover,et al.  Elements of Information Theory: Cover/Elements of Information Theory, Second Edition , 2005 .

[11]  Kevin P. Murphy,et al.  Machine learning - a probabilistic perspective , 2012, Adaptive computation and machine learning series.

[12]  A. Pouget,et al.  The Cost of Accumulating Evidence in Perceptual Decision Making , 2012, The Journal of Neuroscience.

[13]  M. Shadlen,et al.  Representation of Confidence Associated with a Decision by Neurons in the Parietal Cortex , 2009, Science.

[14]  Alexandre Pouget,et al.  Confidence and certainty: distinct probabilistic quantities for different goals , 2016, Nature Neuroscience.

[15]  Felix A. Wichmann,et al.  Rapid animal detection in natural scenes: Critical features are local , 2010 .

[16]  A. Pouget,et al.  Neural correlations, population coding and computation , 2006, Nature Reviews Neuroscience.

[17]  Wei Ji Ma,et al.  Bayesian inference with probabilistic population codes , 2006, Nature Neuroscience.

[18]  Tom Minka,et al.  Expectation Propagation for approximate Bayesian inference , 2001, UAI.

[19]  Roger Ratcliff,et al.  Modeling confidence judgments, response times, and multiple choices in decision making: recognition memory and motion discrimination. , 2013, Psychological review.

[20]  J. Berger Statistical Decision Theory and Bayesian Analysis , 1988 .

[21]  Manfred Opper,et al.  A Bayesian approach to on-line learning , 1999 .

[22]  S. Dehaene,et al.  Brain networks for confidence weighting and hierarchical inference during probabilistic learning , 2017, Proceedings of the National Academy of Sciences.

[23]  Aaron C. Courville,et al.  Bayesian theories of conditioning in a changing world , 2006, Trends in Cognitive Sciences.

[24]  Rubén Moreno-Bote,et al.  Decision Confidence and Uncertainty in Diffusion Models with Partially Correlated Neuronal Integrators , 2010, Neural Computation.

[25]  Lihong Li,et al.  Unbiased online active learning in data streams , 2011, KDD.

[26]  G. DeAngelis,et al.  Neural Correlates of Prior Expectations of Motion in the Lateral Intraparietal and Middle Temporal Areas , 2012, The Journal of Neuroscience.

[27]  Peter I. Frazier,et al.  Sequential Hypothesis Testing under Stochastic Deadlines , 2007, NIPS.

[28]  Timothy D. Hanks,et al.  Probabilistic Population Codes for Bayesian Decision Making , 2008, Neuron.

[29]  Roger Ratcliff,et al.  The Diffusion Decision Model: Theory and Data for Two-Choice Decision Tasks , 2008, Neural Computation.

[30]  Philip L. Smith,et al.  A comparison of sequential sampling models for two-choice reaction time. , 2004, Psychological review.

[31]  Alexandre Pouget,et al.  The impact of learning on perceptual decisions and its implication for speed-accuracy tradeoffs , 2018, bioRxiv.

[32]  Jonathan D. Cohen,et al.  The physics of optimal decision making: a formal analysis of models of performance in two-alternative forced-choice tasks. , 2006, Psychological review.

[33]  Timothy D. Hanks,et al.  Elapsed Decision Time Affects the Weighting of Prior Probability in a Perceptual Decision Task , 2011, The Journal of Neuroscience.

[34]  M. Shadlen,et al.  The effect of stimulus strength on the speed and accuracy of a perceptual decision. , 2005, Journal of vision.

[35]  Anne E. Urai,et al.  Pupil-linked arousal is driven by decision uncertainty and alters serial choice bias , 2017, Nature Communications.

[36]  David J. Anderson,et al.  Ventromedial hypothalamic neurons control a defensive emotion state , 2015, eLife.

[37]  H. D. Miller,et al.  The Theory Of Stochastic Processes , 1977, The Mathematical Gazette.

[38]  Joaquin Quiñonero Candela,et al.  Web-Scale Bayesian Click-Through rate Prediction for Sponsored Search Advertising in Microsoft's Bing Search Engine , 2010, ICML.

[39]  Jan Drugowitsch,et al.  Fast and accurate Monte Carlo sampling of first-passage times from Wiener diffusion models , 2016, Scientific Reports.

[40]  Daniel M. Wolpert,et al.  Counterfactual Reasoning Underlies the Learning of Priors in Decision Making , 2017, Neuron.

[41]  Alexandre Pouget,et al.  Optimal multisensory decision-making in a reaction-time task , 2014, eLife.

[42]  Peter Dayan,et al.  Uncertainty and Learning , 2003 .

[43]  J. Gold,et al.  Banburismus and the Brain Decoding the Relationship between Sensory Stimuli, Decisions, and Reward , 2002, Neuron.

[44]  Jonathan D. Cohen,et al.  Sequential effects: Superstition or rational behavior? , 2008, NIPS.

[45]  A. Pouget,et al.  Information-limiting correlations , 2014, Nature Neuroscience.

[46]  Konrad Paul Kording,et al.  Bayesian integration in sensorimotor learning , 2004, Nature.

[47]  Manuel Perea,et al.  The overlap model: a model of letter position coding. , 2008, Psychological review.

[48]  Roger Ratcliff,et al.  A Theory of Order Relations in Perceptual Matching , 2005 .