Optimization of Decision Making in Multilayer Networks: The Role of Locus Coeruleus

Previous theoretical work has shown that a single-layer neural network can implement the optimal decision process for simple, two-alternative forced-choice (2AFC) tasks. However, it is likely that the mammalian brain comprises multilayer networks, raising the question of whether and how optimal performance can be approximated in such an architecture. Here, we present theoretical work suggesting that the noradrenergic nucleus locus coeruleus (LC) may help optimize 2AFC decision making in the brain. This is based on the observations that neurons of the LC selectively fire following the presentation of salient stimuli in decision tasks and that the corresponding release of norepinephrine can transiently increase the responsivity, or gain, of cortical processing units. We describe computational simulations that investigate the role of such gain changes in optimizing performance of 2AFC decision making. In the tasks we model, no prior cueing or knowledge of stimulus onset time is assumed.Performance is assessed in terms of the rate of correct responses over time (the reward rate). We first present the results of a single-layer model that accumulates (integrates) sensory input and implements the decision process as a threshold crossing. Gain transients, representing the modulatory effect of the LC, are driven by separate threshold crossings in this layer. We optimize over all free parameters to determine the maximum reward rate achievable by this model and compare it to the maximum reward rate when gain is held fixed. We find that the dynamic gain mechanism yields no improvement in reward for this single-layer model.We then examine a two-layer model, in which competing sensory accumulators in the first layer (capable of implementing the task relevant decision) pass activity to response accumulators in a second layer. Again, we compare a version in which threshold crossing in the first (decision) layer elicits an LC response (and a concomitant increase in gain) with a fixed-gain version of the model. Here, we find that gain transients modeling the LC phasic response yield an improvement in reward rate of 12 to 24. Furthermore, we show that the timing characteristics of these gain transients agree with observations concerning LC firing patterns reported in recent experimental studies. This provides converging evidence for the hypothesis that the LC optimizes processes underlying 2AFC decision making in multilayer networks.

[1]  Philip L. Smith,et al.  Psychology and neurobiology of simple decisions , 2004, Trends in Neurosciences.

[2]  James L. McClelland,et al.  On the control of automatic processes: a parallel distributed processing account of the Stroop effect. , 1990, Psychological review.

[3]  Wulfram Gerstner,et al.  Spiking Neuron Models: An Introduction , 2002 .

[4]  S. Sara,et al.  Reward expectation, orientation of attention and locus coeruleus‐medial frontal cortex interplay during learning , 2004, The European journal of neuroscience.

[5]  John A. Nelder,et al.  A Simplex Method for Function Minimization , 1965, Comput. J..

[6]  Wulfram Gerstner,et al.  Spiking Neuron Models , 2002 .

[7]  Jonathan D. Cohen,et al.  Simplified dynamics in a model of noradrenergic modulation of cognitive performance , 2002, Neural Networks.

[8]  T. Rowan Functional stability analysis of numerical algorithms , 1990 .

[9]  F. AbbottPhysics FIRING-RATE MODELS FOR NEURAL POPULATIONSL , 1991 .

[10]  G. Aston-Jones,et al.  Locus coeruleus neurons in monkey are selectively activated by attended cues in a vigilance task , 1994, The Journal of neuroscience : the official journal of the Society for Neuroscience.

[11]  Angela J. Yu,et al.  Should I stay or should I go? How the human brain manages the trade-off between exploitation and exploration , 2007, Philosophical Transactions of the Royal Society B: Biological Sciences.

[12]  J. Schall Neural correlates of decision processes: neural and mental chronometry , 2003, Current Opinion in Neurobiology.

[13]  M. Fricker,et al.  New approaches to investigating the function of mycelial networks , 2005 .

[14]  Jonathan D. Cohen,et al.  The physics of optimal decision making: a formal analysis of models of performance in two-alternative forced-choice tasks. , 2006, Psychological review.

[15]  Michèle Basseville,et al.  Detection of abrupt changes: theory and application , 1993 .

[16]  James L. McClelland On the time relations of mental processes: An examination of systems of processes in cascade. , 1979 .

[17]  D. Heeger,et al.  Motion Opponency in Visual Cortex , 1999, The Journal of Neuroscience.

[18]  M N Shadlen,et al.  Motion perception: seeing and deciding. , 1996, Proceedings of the National Academy of Sciences of the United States of America.

[19]  B A Reddi Decision making: The two stages of neuronal judgement , 2001, Current Biology.

[20]  J. Gold,et al.  Banburismus and the Brain Decoding the Relationship between Sensory Stimuli, Decisions, and Reward , 2002, Neuron.

[21]  Xiao-Jing Wang,et al.  A Recurrent Network Mechanism of Time Integration in Perceptual Decisions , 2006, The Journal of Neuroscience.

[22]  P. Holmes,et al.  MODELING A SIMPLE CHOICE TASK: STOCHASTIC DYNAMICS OF MUTUALLY INHIBITORY NEURAL GROUPS , 2001 .

[23]  Xiao-Jing Wang,et al.  Cortico–basal ganglia circuit mechanism for a decision threshold in reaction time tasks , 2006, Nature Neuroscience.

[24]  Stephen Grossberg,et al.  Nonlinear neural networks: Principles, mechanisms, and architectures , 1988, Neural Networks.

[25]  E. Miller,et al.  An integrative theory of prefrontal cortex function. , 2001, Annual review of neuroscience.

[26]  J. Schall,et al.  Neural Control of Voluntary Movement Initiation , 1996, Science.

[27]  James L. McClelland,et al.  The time course of perceptual choice: the leaky, competing accumulator model. , 2001, Psychological review.

[28]  Angela J. Yu,et al.  Phasic norepinephrine: A neural interrupt signal for unexpected events , 2006, Network.

[29]  E. Lehmann Testing Statistical Hypotheses , 1960 .

[30]  W. Newsome,et al.  Neural basis of a perceptual decision in the parietal cortex (area LIP) of the rhesus monkey. , 2001, Journal of neurophysiology.

[31]  J. Cowan,et al.  Excitatory and inhibitory interactions in localized populations of model neurons. , 1972, Biophysical journal.

[32]  Ranulfo Romo,et al.  Flexible Control of Mutual Inhibition: A Neural Model of Two-Interval Discrimination , 2005, Science.

[33]  R. Ratcliff,et al.  Connectionist and diffusion models of reaction time. , 1999, Psychological review.

[34]  J D Cohen,et al.  A network model of catecholamine effects: gain, signal-to-noise ratio, and behavior. , 1990, Science.

[35]  Paul S. Morgan,et al.  In vivo mapping of the human locus coeruleus , 2009, NeuroImage.

[36]  J. Cohen,et al.  The role of locus coeruleus in the regulation of cognitive performance. , 1999, Science.

[37]  Roger Ratcliff,et al.  A Theory of Memory Retrieval. , 1978 .

[38]  Takashi R Sato,et al.  Search Efficiency but Not Response Interference Affects Visual Selection in Frontal Eye Field , 2001, Neuron.

[39]  J. Andel Sequential Analysis , 2022, The SAGE Encyclopedia of Research Design.

[40]  Donald Laming,et al.  Information theory of choice-reaction times , 1968 .

[41]  M. Stone Models for choice-reaction time , 1960 .

[42]  R. Ratcliff,et al.  A diffusion model analysis of the effects of aging on brightness discrimination , 2003, Perception & psychophysics.

[43]  Donald J. Woodward,et al.  Phasic activation of the locus coeruleus enhances responses of primary sensory cortical neurons to peripheral receptive field stimulation , 1998, Brain Research.

[44]  Jonathan D. Cohen,et al.  An integrative theory of locus coeruleus-norepinephrine function: adaptive gain and optimal performance. , 2005, Annual review of neuroscience.

[45]  S. Sara,et al.  Network reset: a simplified overarching theory of locus coeruleus noradrenaline function , 2005, Trends in Neurosciences.

[46]  Jonathan D. Cohen,et al.  An exploration-exploitation model based on norepinepherine and dopamine activity , 2005, NIPS.

[47]  Philip Holmes,et al.  Rapid decision threshold modulation by reward rate in a neural network , 2006, Neural Networks.

[48]  S. Pratt,et al.  A tunable algorithm for collective decision-making , 2006, Proceedings of the National Academy of Sciences.

[49]  Jonathan D. Cohen,et al.  Phasic Activation of Monkey Locus Ceruleus Neurons by Simple Decisions in a Forced-Choice Task , 2004, The Journal of Neuroscience.

[50]  Philip Holmes,et al.  Simple Neural Networks that Optimize Decisions , 2005, Int. J. Bifurc. Chaos.

[51]  Eero P. Simoncelli,et al.  A model of neuronal responses in visual area MT , 1998, Vision Research.

[52]  J. Schall,et al.  Neural selection and control of visually guided eye movements. , 1999, Annual review of neuroscience.

[53]  Eddy J. Davelaar,et al.  Neuromodulation of decision and response selection , 2002, Neural Networks.

[54]  J J Hopfield,et al.  Neurons with graded response have collective computational properties like those of two-state neurons. , 1984, Proceedings of the National Academy of Sciences of the United States of America.