Pupil Diameter Predicts Changes in the Exploration–Exploitation Trade-off: Evidence for the Adaptive Gain Theory

The adaptive regulation of the balance between exploitation and exploration is critical for the optimization of behavioral performance. Animal research and computational modeling have suggested that changes in exploitative versus exploratory control state in response to changes in task utility are mediated by the neuromodulatory locus coeruleus–norepinephrine (LC–NE) system. Recent studies have suggested that utility-driven changes in control state correlate with pupil diameter, and that pupil diameter can be used as an indirect marker of LC activity. We measured participants' pupil diameter while they performed a gambling task with a gradually changing payoff structure. Each choice in this task can be classified as exploitative or exploratory using a computational model of reinforcement learning. We examined the relationship between pupil diameter, task utility, and choice strategy (exploitation vs. exploration), and found that (i) exploratory choices were preceded by a larger baseline pupil diameter than exploitative choices; (ii) individual differences in baseline pupil diameter were predictive of an individual's tendency to explore; and (iii) changes in pupil diameter surrounding the transition between exploitative and exploratory choices correlated with changes in task utility. These findings provide novel evidence that pupil diameter correlates closely with control state, and are consistent with a role for the LC–NE system in the regulation of the exploration–exploitation trade-off in humans.

[1]  E. Hess,et al.  Pupillometry: The Psychology of the Pupillary Response , 1978 .

[2]  B. Anderson,et al.  Optimal Filtering , 1979, IEEE Transactions on Systems, Man, and Cybernetics.

[3]  M. Elam,et al.  Locus coeruleus neurons and sympathetic nerves: Activation by cutaneous sensory afferents , 1986, Brain Research.

[4]  W. T. Nickell,et al.  The brain nucleus locus coeruleus: restricted afferent control of a broad efferent network. , 1986, Science.

[5]  P. Reiner Correlational analysis of central noradrenergic neuronal activity and sympathetic tone in behaving cats , 1986, Brain Research.

[6]  B. Jacobs,et al.  Single-unit response of noradrenergic neurons in the locus coeruleus of freely moving cats. II. Adaptation to chronically presented stressful stimuli , 1987, The Journal of neuroscience : the official journal of the Society for Neuroscience.

[7]  S. Sara,et al.  Activation of the noradrenergic system facilitates an attentional shift in the rat , 1990, Behavioural Brain Research.

[8]  J. L. Myers,et al.  Regression analyses of repeated measures data in cognitive research. , 1990, Journal of experimental psychology. Learning, memory, and cognition.

[9]  Richard S. Sutton,et al.  Integrated Architectures for Learning, Planning, and Reacting Based on Approximating Dynamic Programming , 1990, ML.

[10]  J D Cohen,et al.  A network model of catecholamine effects: gain, signal-to-noise ratio, and behavior. , 1990, Science.

[11]  G. Aston-Jones,et al.  Locus coeruleus activity in monkey: Phasic and tonic changes are associated with altered vigilance , 1994, Brain Research Bulletin.

[12]  M. Coles,et al.  "Where did I go wrong?" A psychophysiological analysis of error detection. , 1995, Journal of experimental psychology. Human perception and performance.

[13]  J. Steere,et al.  The alpha-2A noradrenergic receptor agonist guanfacine improves visual object discrimination reversal performance in aged rhesus monkeys. , 1997, Behavioral neuroscience.

[14]  G. Aston-Jones,et al.  Conditioned responses of monkey locus coeruleus neurons anticipate acquisition of discriminative behavior in a vigilance task , 1997, Neuroscience.

[15]  J. Steere,et al.  The α-2A noradrenergic receptor agonist guanfacine improves visual object discrimination reversal performance in aged rhesus monkeys. , 1997 .

[16]  T. Robbins Arousal systems and attentional processes , 1997, Biological Psychology.

[17]  J. Cohen,et al.  The role of locus coeruleus in the regulation of cognitive performance. , 1999, Science.

[18]  I. E. Loewenfeld,et al.  The Pupil: Anatomy, Physiology, and Clinical Applications , 1999 .

[19]  W. Tse,et al.  Difference in serotonergic and noradrenergic regulation of human social behaviours , 2001, Psychopharmacology.

[20]  Kenji Doya,et al.  Metalearning and neuromodulation , 2002, Neural Networks.

[21]  C. Berridge,et al.  The locus coeruleus–noradrenergic system: modulation of behavioral state and state-dependent cognitive processes , 2003, Brain Research Reviews.

[22]  Greg J Siegle,et al.  Use of concurrent pupil dilation assessment to inform interpretation and analysis of fMRI data , 2003, NeuroImage.

[23]  K. R. Ridderinkhof,et al.  The Role of the Medial Frontal Cortex in Cognitive Control , 2004, Science.

[24]  K. Lesch,et al.  Dopamine and cognitive control: the influence of spontaneous eyeblink rate and dopamine gene polymorphisms on perseveration and distractibility. , 2005, Behavioral neuroscience.

[25]  Jonathan D. Cohen,et al.  An integrative theory of locus coeruleus-norepinephrine function: adaptive gain and optimal performance. , 2005, Annual review of neuroscience.

[26]  Jonathan D. Cohen,et al.  An exploration-exploitation model based on norepinepherine and dopamine activity , 2005, NIPS.

[27]  Angela J. Yu,et al.  Uncertainty, Neuromodulation, and Attention , 2005, Neuron.

[28]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 2005, IEEE Transactions on Neural Networks.

[29]  Raymond J. Dolan,et al.  Anterior cingulate activity during error and autonomic response , 2005, NeuroImage.

[30]  T. Robbins,et al.  Neurochemical Modulation of Response Inhibition and Probabilistic Learning in Humans , 2006, Science.

[31]  P. Dayan,et al.  Cortical substrates for exploratory decisions in humans , 2006, Nature.

[32]  D. Morilak,et al.  Noradrenergic modulation of cognitive function in rat medial prefrontal cortex as measured by attentional set shifting capability , 2006, Neuroscience.

[33]  Theodore D. Satterthwaite,et al.  Dissociable but inter-related systems of cognitive control and reward during decision making: Evidence from pupillometry and event-related fMRI , 2007, NeuroImage.

[34]  Angela J. Yu,et al.  Should I stay or should I go? How the human brain manages the trade-off between exploitation and exploration , 2007, Philosophical Transactions of the Royal Society B: Biological Sciences.

[35]  M. Botvinick Conflict monitoring and decision making: Reconciling two perspectives on anterior cingulate function , 2007, Cognitive, affective & behavioral neuroscience.

[36]  D. Morilak,et al.  Chronic Treatment with Desipramine Improves Cognitive Performance of Rats in an Attentional Set-Shifting Test , 2007, Neuropsychopharmacology.

[37]  J. del R. Millán,et al.  Characterizing the EEG Correlates of Exploratory Behavior , 2008, IEEE Transactions on Neural Systems and Rehabilitation Engineering.

[38]  E. Seu,et al.  Inhibition of the norepinephrine transporter improves behavioral flexibility in rats and monkeys , 2008, Psychopharmacology.

[39]  C. Koch,et al.  Pupil dilation reflects perceptual selection and predicts subsequent stability in perceptual rivalry , 2008, Proceedings of the National Academy of Sciences.

[40]  Timothy Edward John Behrens,et al.  How Green Is the Grass on the Other Side? Frontopolar Cortex and the Evidence in Favor of Alternative Courses of Action , 2009, Neuron.

[41]  S. Sara The locus coeruleus and noradrenergic modulation of cognition , 2009, Nature Reviews Neuroscience.

[42]  M. Frank,et al.  Prefrontal and striatal dopaminergic genes predict individual differences in exploration and exploitation. , 2009, Nature neuroscience.

[43]  Mark S. Gilzenrat,et al.  Pupil diameter tracks changes in control state predicted by the adaptive gain theory of locus coeruleus function , 2010, Cognitive, affective & behavioral neuroscience.

[44]  Berrin Maraşligil,et al.  İnsanlarda Yenilik N2 Yanıtı Hedef Uyaranların Zamansal Sınıflamasını Yansıtır , 2011 .

[45]  S. Nieuwenhuis,et al.  The anatomical and functional relationship between the P3 and autonomic components of the orienting response. , 2011, Psychophysiology.

[46]  R. O’Connell,et al.  Pupillometry and P3 index the locus coeruleus-noradrenergic arousal function in humans. , 2011, Psychophysiology.

[47]  Stefan M. Wierda,et al.  Pupil dilation deconvolution reveals the dynamics of attention at high temporal resolution , 2012, Proceedings of the National Academy of Sciences.

[48]  Ricardo Chavarriaga,et al.  The timing of exploratory decision-making revealed by single-trial topographic EEGanalyses , 2012, NeuroImage.

[49]  M. Inglis,et al.  Expert and Novice Approaches to Reading Mathematical Proofs , 2012 .

[50]  Angela J. Yu Change is in the eye of the beholder , 2012, Nature Neuroscience.

[51]  S. Sara,et al.  Orienting and Reorienting: The Locus Coeruleus Mediates Cognition through Arousal , 2012, Neuron.

[52]  Makio Kashino,et al.  Functional brain networks underlying perceptual switching: auditory streaming and verbal transformations , 2012, Philosophical Transactions of the Royal Society B: Biological Sciences.

[53]  Robert C. Wilson,et al.  Rational regulation of learning dynamics by pupil–linked arousal systems , 2012, Nature Neuroscience.

[54]  Jonathan D. Cohen,et al.  The effects of neural gain on attention and learning , 2013, Nature Neuroscience.

[55]  G. Siegle,et al.  Pupillary Motility: Bringing Neuroscience to the Psychiatry Clinic of the Future , 2013, Current Neurology and Neuroscience Reports.

[56]  I. Robertson A noradrenergic theory of cognitive reserve: implications for Alzheimer's disease , 2013, Neurobiology of Aging.

[57]  Marlies E. van Bochove,et al.  Blinking predicts enhanced cognitive control , 2013, Cognitive, affective & behavioral neuroscience.

[58]  Massimo Silvetti,et al.  Value and prediction error estimation account for volatility effects in ACC: A model-based fMRI study , 2013, Cortex.

[59]  G. Thierry,et al.  ERP-pupil size correlations reveal how bilingualism enhances cognitive flexibility , 2013, Cortex.

[60]  Stefanie E. Kuchinsky,et al.  Pupil size varies with word listening and response selection difficulty in older adults with hearing loss. , 2013, Psychophysiology.

[61]  R. K. Simpson Nature Neuroscience , 2022 .