Temporal Difference Models and Reward-Related Learning in the Human Brain

[1]  Richard S. Sutton,et al.  Learning to predict by the methods of temporal differences , 1988, Machine Learning.

[2]  E. Hirsch,et al.  Three‐dimensional cartography of functional territories in the human striatopallidal complex by using calbindin immunoreactivity , 2002, The Journal of comparative neurology.

[3]  J. O'Doherty,et al.  Neural Responses during Anticipation of a Primary Taste Reward , 2002, Neuron.

[4]  P. Montague,et al.  Activity in human ventral striatum locked to errors of reward prediction , 2002, Nature Neuroscience.

[5]  Brian Knutson,et al.  Dissociation of reward anticipation and outcome with event-related fMRI , 2001, Neuroreport.

[6]  J. M. Anderson,et al.  Responses of human frontal cortex to surprising events are predicted by formal associative learning theory , 2001, Nature Neuroscience.

[7]  Brian Knutson,et al.  Anticipation of Increasing Monetary Reward Selectively Recruits Nucleus Accumbens , 2001, The Journal of Neuroscience.

[8]  D. Kahneman,et al.  Functional Imaging of Neural Responses to Expectancy and Experience of Monetary Gains and Losses tasks with monetary payoffs , 2001 .

[9]  Samuel M. McClure,et al.  Predictability Modulates Human Brain Response to Reward , 2001, The Journal of Neuroscience.

[10]  E. Rolls,et al.  Representation of pleasant and aversive taste in the human brain. , 2001, Journal of neurophysiology.

[11]  L. Nystrom,et al.  Tracking the hemodynamic responses to reward and punishment in the striatum. , 2000, Journal of neurophysiology.

[12]  S. Kakade,et al.  Learning and selective attention , 2000, Nature Neuroscience.

[13]  Karl J. Friston,et al.  Dissociable Neural Responses in Human Reward Systems , 2000, The Journal of Neuroscience.

[14]  P. Matthews,et al.  Learning about pain: the neural substrate of the prediction error for aversive events. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[15]  K. Hikosaka,et al.  Delay activity of orbital and lateral prefrontal neurons of the monkey varying with different rewards. , 2000, Cerebral cortex.

[16]  E. Rolls,et al.  The representation of pleasant touch in the brain and its relationship with taste and olfactory areas. , 1999, Neuroreport.

[17]  C. Frith,et al.  Orbitofrontal cortex is activated during breaches of expectation in tasks of visual attention , 1999, Nature Neuroscience.

[18]  H. Duvernoy The Human Brain , 1999, Springer Vienna.

[19]  G. Schoenbaum,et al.  Orbitofrontal cortex and basolateral amygdala encode expected outcomes during learning , 1998, Nature Neuroscience.

[20]  J. Hollerman,et al.  Reward prediction in primate basal ganglia and frontal cortex , 1998, Neuropharmacology.

[21]  A. Graybiel,et al.  Neurochemical architecture of the human striatum , 1997, The Journal of comparative neurology.

[22]  Peter Dayan,et al.  A Neural Substrate of Prediction and Reward , 1997, Science.

[23]  W. Schultz,et al.  Preferential activation of midbrain dopamine neurons by appetitive rather than aversive stimuli , 1996, Nature.

[24]  H. Duvernoy The Human Brain Stem and Cerebellum , 1995, Springer Vienna.

[25]  Ternary Structures ON REPRESENTATION OF , 1995 .

[26]  Karl J. Friston,et al.  Spatial registration and normalization of images , 1995 .

[27]  W. Schultz,et al.  Importance of unpredictability for reward responses in primate dopamine neurons. , 1994, Journal of neurophysiology.

[28]  Karl J. Friston,et al.  Value-dependent selection in the brain: Simulation in a synthetic neural model , 1994, Neuroscience.

[29]  W. Schultz,et al.  Neuronal activity in monkey striatum related to the expectation of predictable environmental events. , 1992, Journal of neurophysiology.

[30]  W. Schultz,et al.  Responses of monkey dopamine neurons during learning of behavioral reactions. , 1992, Journal of neurophysiology.

[31]  M. Gabriel,et al.  Learning and Computational Neuroscience: Foundations of Adaptive Networks , 1990 .

[32]  Richard S. Sutton,et al.  Time-Derivative Models of Pavlovian Reinforcement , 1990 .

[33]  R. Oades,et al.  Ventral tegmental (A10) system: neurobiology. 1. Anatomy and connectivity , 1987, Brain Research Reviews.

[34]  J. Pearce,et al.  A model for Pavlovian learning: variations in the effectiveness of conditioned but not of unconditioned stimuli. , 1980, Psychological review.

[35]  R. Rescorla,et al.  A theory of Pavlovian conditioning : Variations in the effectiveness of reinforcement and nonreinforcement , 1972 .