Risk prediction error signaling: A two-component response?

Organisms use rewards to navigate and adapt to (uncertain) environments. Error-based learning about rewards is supported by the dopaminergic system, which is thought to signal reward prediction errors to make adjustments to past predictions. More recently, the phasic dopamine response was suggested to have two components: the first rapid component is thought to signal the detection of a potentially rewarding stimulus; the second, slightly later component characterizes the stimulus by its reward prediction error. Error-based learning signals have also been found for risk. However, whether the neural generators of these signals employ a two-component coding scheme like the dopaminergic system is unknown. Here, using human high density EEG, we ask whether risk learning, or more generally speaking surprise-based learning under uncertainty, is similarly comprised of two temporally dissociable components. Using a simple card game, we show that the risk prediction error is reflected in the amplitude of the P3b component. This P3b modulation is preceded by an earlier component, that is modulated by the stimulus salience. Source analyses are compatible with the idea that both the early salience signal and the later risk prediction error signal are generated in insular, frontal, and temporal cortex. The identified sources are parts of the risk processing network that receives input from noradrenergic cells in the locus coeruleus. Finally, the P3b amplitude modulation is mirrored by an analogous modulation of pupil size, which is consistent with the idea that both the P3b and pupil size indirectly reflect locus coeruleus activity.

[1]  C. Brunia,et al.  Waiting to perceive: Reward or punishment? , 2011, Clinical Neurophysiology.

[2]  Clay B. Holroyd,et al.  The neural basis of human error processing: reinforcement learning, dopamine, and the error-related negativity. , 2002, Psychological review.

[3]  R. O’Connell,et al.  Pupil diameter covaries with BOLD activity in human locus coeruleus , 2014, Human brain mapping.

[4]  A. Sanfey,et al.  Independent Coding of Reward Magnitude and Valence in the Human Brain , 2004, The Journal of Neuroscience.

[5]  W. Einhäuser,et al.  Pupil Dilation Signals Surprise: Evidence for Noradrenaline’s Role in Decision Making , 2011, Front. Neurosci..

[6]  S. Nieuwenhuis,et al.  The anatomical and functional relationship between the P3 and autonomic components of the orienting response. , 2011, Psychophysiology.

[7]  Kerstin Preuschoff,et al.  Balancing New Against Old Information: The Role of Surprise , 2016, ArXiv.

[8]  H. Critchley,et al.  Under Pressure: Response Urgency Modulates Striatal and Insula Activity during Decision-Making under Risk , 2011, PloS one.

[9]  Wolfram Schultz,et al.  Reward Contexts Extend Dopamine Signals to Unrewarded Stimuli , 2014, Current Biology.

[10]  Christoph M. Michel,et al.  Electrical neuroimaging based on biophysical constraints , 2004, NeuroImage.

[11]  S. Quartz,et al.  Human Insula Activation Reflects Risk Prediction Errors As Well As Risk , 2008, The Journal of Neuroscience.

[12]  S. Luck,et al.  Sources of attention-sensitive visual event-related potentials , 2005, Brain Topography.

[13]  Jonathan D. Cohen,et al.  Decision making, the P3, and the locus coeruleus-norepinephrine system. , 2005, Psychological bulletin.

[14]  Claudio Pollo,et al.  Electroencephalographic source imaging: a prospective study of 152 operated epileptic patients , 2011, Brain : a journal of neurology.

[15]  S. Hillyard,et al.  Identification of early visual evoked potential generators by retinotopic and topographic analyses , 1994 .

[16]  Niels A. Kloosterman,et al.  Pupil size tracks perceptual content and surprise , 2015, The European journal of neuroscience.

[17]  Christiane,et al.  World Medical Association Declaration of Helsinki: ethical principles for medical research involving human subjects. , 2004, Journal international de bioethique = International journal of bioethics.

[18]  R. Johnson A triarchic model of P300 amplitude. , 1986, Psychophysiology.

[19]  Vincent D Costa,et al.  More than Meets the Eye: the Relationship between Pupil Size and Locus Coeruleus Activity , 2016, Neuron.

[20]  R. O’Connell,et al.  Pupillometry and P3 index the locus coeruleus-noradrenergic arousal function in humans. , 2011, Psychophysiology.

[21]  Michael H. Herzog,et al.  An automatic pre-processing pipeline for EEG analysis (APP) based on robust statistics , 2018, Clinical Neurophysiology.

[22]  G. Mangun Neural mechanisms of visual selective attention. , 1995, Psychophysiology.

[23]  John R. Anderson,et al.  Learning from experience: Event-related potential correlates of reward processing, neural adaptation, and behavioral choice , 2012, Neuroscience & Biobehavioral Reviews.

[24]  E. Donchin Presidential address, 1980. Surprise!...Surprise? , 1981, Psychophysiology.

[25]  W. Schultz Neuronal Reward and Decision Signals: From Theories to Data. , 2015, Physiological reviews.

[26]  R. C. Oldfield The assessment and analysis of handedness: the Edinburgh inventory. , 1971, Neuropsychologia.

[27]  Geert J. M. van Boxtel,et al.  Negative Slow Waves as Indices of Anticipation: The Bereitschaftspotential, the Contingent Negative Variation, and the Stimulus-Preceding Negativity , 2011 .

[28]  Arnaud Delorme,et al.  EEGLAB: an open source toolbox for analysis of single-trial EEG dynamics including independent component analysis , 2004, Journal of Neuroscience Methods.

[29]  F. Mosteller,et al.  Exploring Data Tables, Trends and Shapes. , 1988 .

[30]  M. Bach,et al.  The Freiburg Visual Acuity test--automatic measurement of visual acuity. , 1996, Optometry and vision science : official publication of the American Academy of Optometry.

[31]  W. Schultz,et al.  Dopamine responses comply with basic assumptions of formal learning theory , 2001, Nature.

[32]  Todd C. Handy,et al.  Cognitive load impacts error evaluation within medial-frontal cortex , 2012, Brain Research.

[33]  M. Murray,et al.  EEG source imaging , 2004, Clinical Neurophysiology.

[34]  Irwin P. Levin,et al.  The effects of insula damage on decision-making for risky gains and losses , 2009, Social neuroscience.

[35]  Ole Jensen,et al.  Oxford handbook of event-related potential components. , 2011 .

[36]  J. Polich Updating P 300 : An Integrative Theory of P 3 a and P 3 b , 2009 .

[37]  J. Horvitz,et al.  Burst activity of ventral tegmental dopamine neurons is elicited by sensory stimuli in the awake cat , 1997, Brain Research.

[38]  Angela J. Yu,et al.  Phasic norepinephrine: A neural interrupt signal for unexpected events , 2006, Network.

[39]  H. Critchley,et al.  Neural systems supporting interoceptive awareness , 2004, Nature Neuroscience.

[40]  Wolzt,et al.  World Medical Association Declaration of Helsinki: ethical principles for medical research involving human subjects. , 2003, The Journal of the American College of Dentists.

[41]  Robert C. Wilson,et al.  Rational regulation of learning dynamics by pupil–linked arousal systems , 2012, Nature Neuroscience.

[42]  D H Brainard,et al.  The Psychophysics Toolbox. , 1997, Spatial vision.

[43]  J. Polich Updating P300: An integrative theory of P3a and P3b , 2007, Clinical Neurophysiology.

[44]  Wolfram Schultz,et al.  Dopamine reward prediction-error signalling: a two-component response , 2016, Nature Reviews Neuroscience.

[45]  R. Heuser Surprise, surprise , 2014, Catheterization and cardiovascular interventions : official journal of the Society for Cardiac Angiography & Interventions.

[46]  Edwin S. Dalmaijer,et al.  Is the low-cost EyeTribe eye tracker any good for research? , 2014 .

[47]  E. Vogel,et al.  Sensory gain control (amplification) as a mechanism of selective attention: electrophysiological and neuroimaging evidence. , 1998, Philosophical transactions of the Royal Society of London. Series B, Biological sciences.

[48]  Sabine Himmel,et al.  Exploring Data Tables, Trends, and Shapes , 2007 .

[49]  Rongjun Yu,et al.  Rapid Processing of Both Reward Probability and Reward Uncertainty in the Human Anterior Cingulate Cortex , 2011, PloS one.

[50]  E. Donchin,et al.  On quantifying surprise: the variation of event-related potentials with subjective probability. , 1977, Psychophysiology.

[51]  Irwin P. Levin,et al.  The impact of prior risk experiences on subsequent risky decision-making: The role of the insula , 2010, NeuroImage.

[52]  Rolf Verleger,et al.  From epistemology to P3-ology , 1988, Behavioral and Brain Sciences.

[53]  Y. Benjamini,et al.  Controlling the false discovery rate: a practical and powerful approach to multiple testing , 1995 .

[54]  E. Donchin,et al.  Is the P300 component a manifestation of context updating? , 1988, Behavioral and Brain Sciences.

[55]  Michael Stuart,et al.  Understanding Robust and Exploratory Data Analysis , 1984 .

[56]  C. Michel,et al.  Noninvasive Localization of Electromagnetic Epileptic Activity. I. Method Descriptions and Simulations , 2004, Brain Topography.

[57]  Margot J. Taylor Non-spatial attentional effects on P1 , 2002, Clinical Neurophysiology.

[58]  S. Quartz,et al.  Neural Differentiation of Expected Reward and Risk in Human Subcortical Structures , 2006, Neuron.

[59]  R. Wightman,et al.  Associative learning mediates dynamic shifts in dopamine signaling in the nucleus accumbens , 2007, Nature Neuroscience.

[60]  Marina Schmid,et al.  An Introduction To The Event Related Potential Technique , 2016 .

[61]  Kerstin Preuschoff,et al.  Balancing New against Old Information: The Role of Puzzlement Surprise in Learning , 2018, Neural Computation.

[62]  Antigona Martínez,et al.  Source analysis of event-related cortical activity during visuo-spatial attention. , 2003, Cerebral cortex.

[63]  Mark S. Gilzenrat,et al.  Pupil diameter tracks changes in control state predicted by the adaptive gain theory of locus coeruleus function , 2010, Cognitive, affective & behavioral neuroscience.

[64]  Christoph M. Michel,et al.  Spatiotemporal Analysis of Multichannel EEG: CARTOOL , 2011, Comput. Intell. Neurosci..

[65]  D G Pelli,et al.  The VideoToolbox software for visual psychophysics: transforming numbers into movies. , 1997, Spatial vision.