Coding of Predicted Reward Omission by Dopamine Neurons in a Conditioned Inhibition Paradigm

Animals learn not only about stimuli that predict reward but also about those that signal the omission of an expected reward. We used a conditioned inhibition paradigm derived from animal learning theory to train a discrimination between a visual stimulus that predicted reward (conditioned excitor) and a second stimulus that predicted the omission of reward (conditioned inhibitor). Performing the discrimination required attention to both the conditioned excitor and the inhibitor; however, dopamine neurons showed very different responses to the two classes of stimuli. Conditioned inhibitors elicited considerable depressions in 48 of 69 neurons (median of 35% below baseline) and minor activations in 29 of 69 neurons (69% above baseline), whereas reward-predicting excitors induced pure activations in all 69 neurons tested (242% above baseline), thereby demonstrating that the neurons discriminated between conditioned stimuli predicting reward versus nonreward. The discriminative responses to stimuli with differential reward-predicting but common attentional functions indicate differential neural coding of reward prediction and attention. The neuronal responses appear to reflect reward prediction errors, thus suggesting an extension of the correspondence between learning theory and activity of single dopamine neurons to the prediction of nonreward.

[1]  R. Rescorla Pavlovian conditioned inhibition , 1969 .

[2]  N. Humphrey ‘Interest’ and ‘Pleasure’: Two Determinants of a Monkey's Visual Preferences , 1972, Perception.

[3]  R. Rescorla,et al.  A theory of Pavlovian conditioning : Variations in the effectiveness of reinforcement and nonreinforcement , 1972 .

[4]  R. Eisenberger Explanation of rewards that do not reduce tissue needs. , 1972, Psychological bulletin.

[5]  Robert A. Rescorla,et al.  Second-order conditioning: Implications for theories of learning. , 1973 .

[6]  R. Rescorla,et al.  Extinction of Pavlovian conditioned inhibition. , 1974, Journal of comparative and physiological psychology.

[7]  M Mishkin,et al.  An analysis of short-term visual memory in the monkey. , 1975, Journal of experimental psychology. Animal behavior processes.

[8]  R. Rescorla Second-order conditioning of Pavlovian conditioned inhibition , 1976 .

[9]  R. Rescorla Simultaneous second-order conditioning produces S-S learning in conditioned suppression. , 1982, Journal of experimental psychology. Animal behavior processes.

[10]  C. Bruce,et al.  Cerebral cortical activity associated with the orientation of visual attention in the rhesus monkey , 1985, Vision Research.

[11]  T. Robbins,et al.  Depletion of unilateral striatal dopamine impairs initiation of contralateral actions and not sensory attention , 1985, Nature.

[12]  R. Desimone,et al.  Selective attention gates visual processing in the extrastriate cortex. , 1985, Science.

[13]  W. Schultz,et al.  Responses of nigrostriatal dopamine neurons to high-intensity somatosensory stimulation in the anesthetized monkey. , 1987, Journal of neurophysiology.

[14]  C. Marsden,et al.  Internal versus external cues and the control of attention in Parkinson's disease. , 1988, Brain : a journal of neurology.

[15]  D A Washburn,et al.  Perceived control in rhesus monkeys (Macaca mulatta): enhanced video-task performance. , 1991, Journal of experimental psychology. Animal behavior processes.

[16]  S. Rose Selective attention , 1992, Nature.

[17]  R. Wise,et al.  Localization of drug reward mechanisms by intracranial injections , 1992, Synapse.

[18]  W. Schultz,et al.  Responses of monkey dopamine neurons during learning of behavioral reactions. , 1992, Journal of neurophysiology.

[19]  W. Schultz Activity of dopamine neurons in the behaving primate , 1992 .

[20]  K. Berridge,et al.  The neural basis of drug craving: An incentive-sensitization theory of addiction , 1993, Brain Research Reviews.

[21]  Supernormal conditioning. , 1995, Journal of experimental psychology. Animal behavior processes.

[22]  W. Schultz,et al.  Preferential activation of midbrain dopamine neurons by appetitive rather than aversive stimuli , 1996, Nature.

[23]  John H. R. Maunsell,et al.  Attentional modulation of visual motion processing in cortical areas MT and MST , 1996, Nature.

[24]  V. Brown,et al.  Covert Orienting of Attention in the Rat and the Role of Striatal Dopamine , 1996, The Journal of Neuroscience.

[25]  T. Robbins,et al.  Neurobehavioural mechanisms of reward and motivation , 1996, Current Opinion in Neurobiology.

[26]  J. Horvitz,et al.  Burst activity of ventral tegmental dopamine neurons is elicited by sensory stimuli in the awake cat , 1997, Brain Research.

[27]  S. Yantis,et al.  Visual attention: control, representation, and time course. , 1997, Annual review of psychology.

[28]  S. Yamaguchi,et al.  Contributions of the Dopaminergic System to Voluntary and Automatic Orienting of Visuospatial Attention , 1998, The Journal of Neuroscience.

[29]  J. Hollerman,et al.  Dopamine neurons report an error in the temporal prediction of reward during learning , 1998, Nature Neuroscience.

[30]  P. Redgrave,et al.  Is the short-latency dopamine response too short to signal reward error? , 1999, Trends in Neurosciences.

[31]  Trevor W. Robbins,et al.  Enhanced and Impaired Attentional Performance After Infusion of D1 Dopaminergic Receptor Agents into Rat Prefrontal Cortex , 2000, The Journal of Neuroscience.

[32]  J. Horvitz Mesolimbocortical and nigrostriatal dopamine responses to salient non-reward events , 2000, Neuroscience.

[33]  H. Poizner,et al.  Automatic orienting of visuospatial attention in Parkinson's disease , 2001, Neuropsychologia.

[34]  W. Schultz,et al.  Dopamine responses comply with basic assumptions of formal learning theory , 2001, Nature.

[35]  Peter Dayan,et al.  Dopamine: generalization and bonuses , 2002, Neural Networks.

[36]  W. Schultz,et al.  Discrete Coding of Reward Probability and Uncertainty by Dopamine Neurons , 2003, Science.