Behavioral and Neural Changes after Gains and Losses of Conditioned Reinforcers

Human behaviors can be more powerfully influenced by conditioned reinforcers, such as money, than by primary reinforcers. Moreover, people often change their behaviors to avoid monetary losses. However, the effect of removing conditioned reinforcers on choices has not been explored in animals, and the neural mechanisms mediating the behavioral effects of gains and losses are not well understood. To investigate the behavioral and neural effects of gaining and losing a conditioned reinforcer, we trained rhesus monkeys for a matching pennies task in which the positive and negative values of its payoff matrix were realized by the delivery and removal of a conditioned reinforcer. Consistent with the findings previously obtained with non-negative payoffs and primary rewards, the animal's choice behavior during this task was nearly optimal. Nevertheless, the gain and loss of a conditioned reinforcer significantly increased and decreased, respectively, the tendency for the animal to choose the same target in subsequent trials. We also found that the neurons in the dorsomedial frontal cortex, dorsal anterior cingulate cortex, and dorsolateral prefrontal cortex often changed their activity according to whether the animal earned or lost a conditioned reinforcer in the current or previous trial. Moreover, many neurons in the dorsomedial frontal cortex also signaled the gain or loss occurring as a result of choosing a particular action as well as changes in the animal's behaviors resulting from such gains or losses. Thus, primate medial frontal cortex might mediate the behavioral effects of conditioned reinforcers and their losses.

[1]  J. O'Doherty,et al.  Dissociating Valence of Outcome from Behavioral Control in Human Orbital and Ventral Prefrontal Cortices , 2003, The Journal of Neuroscience.

[2]  Larry E. Toothaker,et al.  Multiple Regression: Testing and Interpreting Interactions , 1991 .

[3]  Tatsuo K Sato,et al.  Correlated Coding of Motivation and Outcome of Decision by Dopamine Neurons , 2003, The Journal of Neuroscience.

[4]  Masataka Watanabe,et al.  Long‐ and short‐range reward expectancy in the primate orbitofrontal cortex , 2004, The European journal of neuroscience.

[5]  E. Rolls,et al.  Abstract reward and punishment representations in the human orbitofrontal cortex , 2001, Nature Neuroscience.

[6]  Kay M. Tye,et al.  Rapid strengthening of thalamo-amygdala synapses mediates cue–reward learning , 2008, Nature.

[7]  K. Doya,et al.  Representation of Action-Specific Reward Values in the Striatum , 2005, Science.

[8]  W. Schultz,et al.  Influences of Rewarding and Aversive Outcomes on Activity in Macaque Lateral Prefrontal Cortex , 2006, Neuron.

[9]  Daeyeol Lee Game theory and neural basis of social decision making , 2008, Nature Neuroscience.

[10]  H. Seo,et al.  Dynamic signals related to choices and outcomes in the dorsolateral prefrontal cortex. , 2007, Cerebral cortex.

[11]  Keiji Tanaka,et al.  Medial prefrontal cell activity signaling prediction errors of action values , 2007, Nature Neuroscience.

[12]  Y. Pawitan In all likelihood : statistical modelling and inference using likelihood , 2002 .

[13]  E. J. Tehovnik,et al.  Eye fields in the frontal lobes of primates , 2000, Brain Research Reviews.

[14]  E. Procyk,et al.  Behavioral Shifts and Action Valuation in the Anterior Cingulate Cortex , 2008, Neuron.

[15]  D. Barraclough,et al.  Learning and decision making in monkeys during a rock-paper-scissors game. , 2005, Brain research. Cognitive brain research.

[16]  L. Nystrom,et al.  Tracking the hemodynamic responses to reward and punishment in the striatum. , 2000, Journal of neurophysiology.

[17]  R. Liberman,et al.  The token economy. , 2000, The American journal of psychiatry.

[18]  Jeff T. Larsen,et al.  Context dependence of the event-related brain potential associated with reward and punishment. , 2004, Psychophysiology.

[19]  Joseph J. Paton,et al.  The primate amygdala represents the positive and negative value of visual stimuli during learning , 2006, Nature.

[20]  Adrian R. Willoughby,et al.  The Medial Frontal Cortex and the Rapid Processing of Monetary Gains and Losses , 2002, Science.

[21]  E. Rolls The orbitofrontal cortex and reward. , 2000, Cerebral cortex.

[22]  B. Richmond,et al.  Anterior Cingulate: Single Neuronal Signals Related to Degree of Reward Expectancy , 2002, Science.

[23]  Geoffrey Schoenbaum,et al.  The role of the orbitofrontal cortex in the pursuit of happiness and more specific rewards , 2008, Nature.

[24]  Dilip Mookherjee,et al.  Learning behavior in an experimental matching pennies game , 1994 .

[25]  Michael L. Platt,et al.  Neural correlates of decision variables in parietal cortex , 1999, Nature.

[26]  R. T. Kelleher,et al.  A review of positive conditioned reinforcement. , 1962, Journal of the experimental analysis of behavior.

[27]  B. Gold,et al.  Functional Dissociation in Frontal and Striatal Areas for Processing of Positive and Negative Reward Information , 2007, The Journal of Neuroscience.

[28]  T. Hackenberg,et al.  Response-cost punishment via token loss with pigeons , 2005, Behavioural Processes.

[29]  A. Kazdin Response cost: The removal of conditioned reinforcers for therapeutic change , 1972 .

[30]  C. Cavada,et al.  The anatomical connections of the macaque monkey orbitofrontal cortex. A review. , 2000, Cerebral cortex.

[31]  K. Doya,et al.  The computational neurobiology of learning and reward , 2006, Current Opinion in Neurobiology.

[32]  A C Roberts,et al.  The Role of the Primate Amygdala in Conditioned Reinforcement , 2001, The Journal of Neuroscience.

[33]  O. Hikosaka,et al.  Representation of negative motivational value in the primate lateral habenula , 2009, Nature Neuroscience.

[34]  H Nishijo,et al.  Single neuron responses in amygdala of alert monkey during complex sensory stimulation with affective significance , 1988, The Journal of neuroscience : the official journal of the Society for Neuroscience.

[35]  E. Vaadia,et al.  Midbrain Dopaminergic Neurons and Striatal Cholinergic Interneurons Encode the Difference between Reward and Aversive Events at Different Epochs of Probabilistic Classical Conditioning Trials , 2008, The Journal of Neuroscience.

[36]  Masato Inoue,et al.  Neurons in the macaque orbitofrontal cortex code relative preference of both rewarding and aversive outcomes , 2007, Neuroscience Research.

[37]  O. Hikosaka,et al.  Dopamine Neurons Can Represent Context-Dependent Prediction Error , 2004, Neuron.

[38]  J. Schall,et al.  Performance monitoring by the supplementary eye ® eld , 2000 .

[39]  C. Bruce,et al.  Primate frontal eye fields. II. Physiological and anatomical correlates of electrically evoked eye movements. , 1985, Journal of neurophysiology.

[40]  H. Seo,et al.  Cortical mechanisms for reinforcement learning in competitive games , 2008, Philosophical Transactions of the Royal Society B: Biological Sciences.

[41]  Colin Camerer Behavioral Game Theory: Experiments in Strategic Interaction , 2003 .

[42]  Michael X. Cohen,et al.  Different neural systems adjust motor behavior in response to reward and punishment , 2007, NeuroImage.

[43]  J. O'Doherty,et al.  Is Avoiding an Aversive Outcome Rewarding? Neural Substrates of Avoidance Learning in the Human Brain , 2006, PLoS biology.

[44]  Joshua W. Brown,et al.  Performance Monitoring by the Anterior Cingulate Cortex During Saccade Countermanding , 2003, Science.

[45]  P. Goldman-Rakic,et al.  Direct and indirect pathways from the amygdala to the frontal lobe in rhesus monkeys , 1981, The Journal of comparative neurology.

[46]  A. Roth,et al.  Predicting How People Play Games: Reinforcement Learning in Experimental Games with Unique, Mixed Strategy Equilibria , 1998 .

[47]  Brian Knutson,et al.  FMRI Visualization of Brain Activity during a Monetary Incentive Delay Task , 2000, NeuroImage.

[48]  S. Kitazawa,et al.  Neuronal Activity Related to Reward Size and Rewarded Target Position in Primate Supplementary Eye Field , 2007, The Journal of Neuroscience.

[49]  H. Seo,et al.  Mechanisms of Reinforcement Learning and Decision Making in the Primate Dorsolateral Prefrontal Cortex , 2007, Annals of the New York Academy of Sciences.

[50]  Daeyeol Lee,et al.  Order-Dependent Modulation of Directional Signals in the Supplementary and Presupplementary Motor Areas , 2007, The Journal of Neuroscience.

[51]  J. Tanji,et al.  Role for cingulate motor area cells in voluntary movement selection based on reward. , 1998, Science.

[52]  M. Nader,et al.  Effects of negative punishment contingencies on cocaine self‐administration by rhesus monkeys , 2001, Behavioural pharmacology.

[53]  J. Nash Equilibrium Points in N-Person Games. , 1950, Proceedings of the National Academy of Sciences of the United States of America.

[54]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 2005, IEEE Transactions on Neural Networks.

[55]  C. Law,et al.  The relative influences of priors and sensory evidence on an oculomotor decision variable during perceptual learning. , 2008, Journal of neurophysiology.

[56]  E. Procyk,et al.  Expectations, gains, and losses in the anterior cingulate cortex , 2007, Cognitive, affective & behavioral neuroscience.

[57]  D. Barraclough,et al.  Reinforcement learning and decision making in monkeys during a competitive game. , 2004, Brain research. Cognitive brain research.

[58]  B. Everitt,et al.  Lesions of the Orbitofrontal but not Medial Prefrontal Cortex Disrupt Conditioned Reinforcement in Primates , 2003, The Journal of Neuroscience.

[59]  David R. Anderson,et al.  Model selection and multimodel inference : a practical information-theoretic approach , 2003 .

[60]  D. Barraclough,et al.  Prefrontal cortex and decision making in a mixed-strategy game , 2004, Nature Neuroscience.

[61]  Masataka Watanabe Reward expectancy in primate prefrental neurons , 1996, Nature.

[62]  Dick J. Veltman,et al.  Neural correlates of a reversal learning task with an affectively neutral baseline: An event-related fMRI study , 2005, NeuroImage.

[63]  Akichika Mikami,et al.  Anterior cingulate activity during pain-avoidance and reward tasks in monkeys , 2001, Neuroscience Research.

[64]  H WEINER,et al.  Some effects of response cost upon human operant behavior. , 1962, Journal of the experimental analysis of behavior.

[65]  R. J. Dolan,et al.  Differential neural response to positive and negative feedback in planning and guessing tasks , 1997, Neuropsychologia.

[66]  Daeyeol Lee,et al.  Prefrontal Coding of Temporally Discounted Values during Intertemporal Choice , 2008, Neuron.

[67]  H. Seo,et al.  Temporal Filtering of Reward Signals in the Dorsal Anterior Cingulate Cortex during a Mixed-Strategy Game , 2007, The Journal of Neuroscience.

[68]  Jeff T. Larsen,et al.  The good, the bad and the neutral: Electrophysiological responses to feedback stimuli , 2006, Brain Research.

[69]  P. Dayan,et al.  Differential Encoding of Losses and Gains in the Human Striatum , 2007, The Journal of Neuroscience.

[70]  A. Cooper,et al.  Predictive Reward Signal of Dopamine Neurons , 2011 .

[71]  M. Roesch,et al.  Neuronal Activity Related to Reward Value and Motivation in Primate Frontal Cortex , 2004, Science.

[72]  P. Glimcher,et al.  Value Representations in the Primate Striatum during Matching Behavior , 2008, Neuron.