Temporal Filtering of Reward Signals in the Dorsal Anterior Cingulate Cortex during a Mixed-Strategy Game

The process of decision making in humans and other animals is adaptive and can be tuned through experience so as to optimize the outcomes of their choices in a dynamic environment. Previous studies have demonstrated that the anterior cingulate cortex plays an important role in updating the animal's behavioral strategies when the action outcome contingencies change. Moreover, neurons in the anterior cingulate cortex often encode the signals related to expected or actual reward. We investigated whether reward-related activity in the anterior cingulate cortex is affected by the animal's previous reward history. This was tested in rhesus monkeys trained to make binary choices in a computer-simulated competitive zero-sum game. The animal's choice behavior was relatively close to the optimal strategy but also revealed small systematic biases that are consistent with the use of a reinforcement learning algorithm. In addition, the activity of neurons in the dorsal anterior cingulate cortex that was related to the reward received by the animal in a given trial often was modulated by the rewards in the previous trials. Some of these neurons encoded the rate of rewards in previous trials, whereas others displayed activity modulations more closely related to the reward prediction errors. In contrast, signals related to the animal's choices were represented only weakly in this cortical area. These results suggest that neurons in the dorsal anterior cingulate cortex might be involved in the subjective evaluation of choice outcomes based on the animal's reward history.

[1]  O. L. Tinklepaugh An experimental study of representative factors in monkeys. , 1928 .

[2]  L. Crespi Quantitative variation of incentive and performance in the white rat. , 1942 .

[3]  H. Helson Adaptation-level as a basis for a quantitative theory of frames of reference. , 1948, Psychological review.

[4]  William C. Stebbins Response latency as a function of amount of reinforcement. , 1962, Journal of the experimental analysis of behavior.

[5]  A. Rosen,et al.  Incentive shift performance in cingulectomized rats. , 1970, Journal of comparative and physiological psychology.

[6]  Masataka Watanabe,et al.  Prefrontal and cingulate unit activity during timing behavior in the monkey , 1979, Brain Research.

[7]  A. Tversky,et al.  Prospect theory: analysis of decision under risk , 1979 .

[8]  C. Flaherty Incentive contrast: A review of behavioral changes following shifts in reward , 1982 .

[9]  G. Rizzolatti,et al.  Architecture of superior and mesial area 6 and the adjacent cingulate cortex in the macaque monkey , 1991, The Journal of comparative neurology.

[10]  Masataka Watanabe Reward expectancy in primate prefrental neurons , 1996, Nature.

[11]  A. Roth,et al.  Predicting How People Play Games: Reinforcement Learning in Experimental Games with Unique, Mixed Strategy Equilibria , 1998 .

[12]  J. Hollerman,et al.  Influence of reward expectation on behavior-related neuronal activity in primate striatum. , 1998, Journal of neurophysiology.

[13]  J. Tanji,et al.  Role for cingulate motor area cells in voluntary movement selection based on reward. , 1998, Science.

[14]  O. Hikosaka,et al.  Expectation of reward modulates cognitive signals in the basal ganglia , 1998, Nature Neuroscience.

[15]  E. Fehr A Theory of Fairness, Competition and Cooperation , 1998 .

[16]  M. Shadlen,et al.  Effect of Expected Reward Magnitude on the Response of Neurons in the Dorsolateral Prefrontal Cortex of the Macaque , 1999, Neuron.

[17]  Michael L. Platt,et al.  Neural correlates of decision variables in parietal cortex , 1999, Nature.

[18]  D. Kahneman,et al.  Well-being : the foundations of hedonic psychology , 1999 .

[19]  Daeyeol Lee,et al.  Neuronal Clusters in the Primate Motor Cortex during Interceptin of Moving Targets. , 2001, Journal of Cognitive Neuroscience.

[20]  Trevor Hastie,et al.  The Elements of Statistical Learning , 2001 .

[21]  J. O'Doherty,et al.  Neural Responses during Anticipation of a Primary Taste Reward , 2002, Neuron.

[22]  B. Richmond,et al.  Anterior Cingulate: Single Neuronal Signals Related to Degree of Reward Expectancy , 2002, Science.

[23]  Y. Pawitan In all likelihood : statistical modelling and inference using likelihood , 2002 .

[24]  David R. Anderson,et al.  Model selection and multimodel inference : a practical information-theoretic approach , 2003 .

[25]  K. A. Hadland,et al.  The anterior cingulate and reward-guided selection of actions. , 2003, Journal of neurophysiology.

[26]  G. Luppino,et al.  ß Federation of European Neuroscience Societies Prefrontal and agranular cingulate projections to the dorsal premotor areas F2 and F7 in the macaque monkey , 2022 .

[27]  S. Brosnan,et al.  Monkeys reject unequal pay , 2003, Nature.

[28]  M. Roesch,et al.  Impact of expected reward on neuronal activity in prefrontal cortex, frontal and supplementary eye fields and premotor cortex. , 2003, Journal of neurophysiology.

[29]  Keiji Tanaka,et al.  Neuronal Correlates of Goal-Based Motor Selection in the Prefrontal Cortex , 2003, Science.

[30]  Joshua W. Brown,et al.  Performance Monitoring by the Anterior Cingulate Cortex During Saccade Countermanding , 2003, Science.

[31]  W. Newsome,et al.  Matching Behavior and the Representation of Value in the Parietal Cortex , 2004, Science.

[32]  D. Ruppert The Elements of Statistical Learning: Data Mining, Inference, and Prediction , 2004 .

[33]  D. Barraclough,et al.  Prefrontal cortex and decision making in a mixed-strategy game , 2004, Nature Neuroscience.

[34]  C. Bruce,et al.  The effect of attentive fixation on eye movements evoked by electrical stimulation of the frontal eye fields , 2004, Experimental Brain Research.

[35]  C. Evinger,et al.  Different forms of blinks and their two-stage control , 2004, Experimental Brain Research.

[36]  D. Barraclough,et al.  Reinforcement learning and decision making in monkeys during a competitive game. , 2004, Brain research. Cognitive brain research.

[37]  Eldad Yechiam,et al.  Comparison of basic assumptions embedded in learning models for experience-based decision making , 2005, Psychonomic bulletin & review.

[38]  M. Platt,et al.  Risk-sensitive neurons in macaque posterior cingulate cortex , 2005, Nature Neuroscience.

[39]  Matthew T. Kaufman,et al.  Distributed Neural Representation of Expected Value , 2005, The Journal of Neuroscience.

[40]  K. Doya,et al.  Representation of Action-Specific Reward Values in the Striatum , 2005, Science.

[41]  Jonathan D. Cohen,et al.  An integrative theory of locus coeruleus-norepinephrine function: adaptive gain and optimal performance. , 2005, Annual review of neuroscience.

[42]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[43]  E. Procyk,et al.  Anterior cingulate error‐related activity is modulated by predicted reward , 2005, The European journal of neuroscience.

[44]  J. Tanji,et al.  Neurons in the rostral cingulate motor area monitor multiple phases of visuomotor behavior with modest parametric selectivity. , 2005, Journal of neurophysiology.

[45]  P. Glimcher,et al.  Midbrain Dopamine Neurons Encode a Quantitative Reward Prediction Error Signal , 2005, Neuron.

[46]  P. Glimcher,et al.  JOURNAL OF THE EXPERIMENTAL ANALYSIS OF BEHAVIOR 2005, 84, 555–579 NUMBER 3(NOVEMBER) DYNAMIC RESPONSE-BY-RESPONSE MODELS OF MATCHING BEHAVIOR IN RHESUS MONKEYS , 2022 .

[47]  D. Barraclough,et al.  Learning and decision making in monkeys during a rock-paper-scissors game. , 2005, Brain research. Cognitive brain research.

[48]  M. Kawato,et al.  Different neural correlates of reward expectation and reward expectation error in the putamen and caudate nucleus during stimulus-action-reward association learning. , 2006, Journal of neurophysiology.

[49]  Timothy E. J. Behrens,et al.  Optimal decision making and the anterior cingulate cortex , 2006, Nature Neuroscience.

[50]  Daeyeol Lee Neural basis of quasi-rational decision making , 2006, Current Opinion in Neurobiology.

[51]  P. Dayan,et al.  Cortical substrates for exploratory decisions in humans , 2006, Nature.

[52]  Xiao-Jing Wang,et al.  Neural mechanism for stochastic behaviour during a competitive game , 2006, Neural Networks.

[53]  W. Schultz Behavioral theories and the neurophysiology of reward. , 2006, Annual review of psychology.

[54]  K. Doya,et al.  The computational neurobiology of learning and reward , 2006, Current Opinion in Neurobiology.

[55]  E. Procyk,et al.  Reward encoding in the monkey anterior cingulate cortex. , 2006, Cerebral cortex.

[56]  A. Tversky,et al.  Prospect theory: an analysis of decision under risk — Source link , 2007 .

[57]  Keiji Tanaka,et al.  Medial prefrontal cell activity signaling prediction errors of action values , 2007, Nature Neuroscience.

[58]  H. Seo,et al.  Dynamic signals related to choices and outcomes in the dorsolateral prefrontal cortex. , 2007, Cerebral cortex.