Temporal Difference Models and Reward-Related Learning in the Human Brain
暂无分享,去创建一个
[1] Richard S. Sutton,et al. Learning to predict by the methods of temporal differences , 1988, Machine Learning.
[2] E. Hirsch,et al. Three‐dimensional cartography of functional territories in the human striatopallidal complex by using calbindin immunoreactivity , 2002, The Journal of comparative neurology.
[3] J. O'Doherty,et al. Neural Responses during Anticipation of a Primary Taste Reward , 2002, Neuron.
[4] P. Montague,et al. Activity in human ventral striatum locked to errors of reward prediction , 2002, Nature Neuroscience.
[5] Brian Knutson,et al. Dissociation of reward anticipation and outcome with event-related fMRI , 2001, Neuroreport.
[6] J. M. Anderson,et al. Responses of human frontal cortex to surprising events are predicted by formal associative learning theory , 2001, Nature Neuroscience.
[7] Brian Knutson,et al. Anticipation of Increasing Monetary Reward Selectively Recruits Nucleus Accumbens , 2001, The Journal of Neuroscience.
[8] D. Kahneman,et al. Functional Imaging of Neural Responses to Expectancy and Experience of Monetary Gains and Losses tasks with monetary payoffs , 2001 .
[9] Samuel M. McClure,et al. Predictability Modulates Human Brain Response to Reward , 2001, The Journal of Neuroscience.
[10] E. Rolls,et al. Representation of pleasant and aversive taste in the human brain. , 2001, Journal of neurophysiology.
[11] L. Nystrom,et al. Tracking the hemodynamic responses to reward and punishment in the striatum. , 2000, Journal of neurophysiology.
[12] S. Kakade,et al. Learning and selective attention , 2000, Nature Neuroscience.
[13] Karl J. Friston,et al. Dissociable Neural Responses in Human Reward Systems , 2000, The Journal of Neuroscience.
[14] P. Matthews,et al. Learning about pain: the neural substrate of the prediction error for aversive events. , 2000, Proceedings of the National Academy of Sciences of the United States of America.
[15] K. Hikosaka,et al. Delay activity of orbital and lateral prefrontal neurons of the monkey varying with different rewards. , 2000, Cerebral cortex.
[16] E. Rolls,et al. The representation of pleasant touch in the brain and its relationship with taste and olfactory areas. , 1999, Neuroreport.
[17] C. Frith,et al. Orbitofrontal cortex is activated during breaches of expectation in tasks of visual attention , 1999, Nature Neuroscience.
[18] H. Duvernoy. The Human Brain , 1999, Springer Vienna.
[19] G. Schoenbaum,et al. Orbitofrontal cortex and basolateral amygdala encode expected outcomes during learning , 1998, Nature Neuroscience.
[20] J. Hollerman,et al. Reward prediction in primate basal ganglia and frontal cortex , 1998, Neuropharmacology.
[21] A. Graybiel,et al. Neurochemical architecture of the human striatum , 1997, The Journal of comparative neurology.
[22] Peter Dayan,et al. A Neural Substrate of Prediction and Reward , 1997, Science.
[23] W. Schultz,et al. Preferential activation of midbrain dopamine neurons by appetitive rather than aversive stimuli , 1996, Nature.
[24] H. Duvernoy. The Human Brain Stem and Cerebellum , 1995, Springer Vienna.
[25] Ternary Structures. ON REPRESENTATION OF , 1995 .
[26] Karl J. Friston,et al. Spatial registration and normalization of images , 1995 .
[27] W. Schultz,et al. Importance of unpredictability for reward responses in primate dopamine neurons. , 1994, Journal of neurophysiology.
[28] Karl J. Friston,et al. Value-dependent selection in the brain: Simulation in a synthetic neural model , 1994, Neuroscience.
[29] W. Schultz,et al. Neuronal activity in monkey striatum related to the expectation of predictable environmental events. , 1992, Journal of neurophysiology.
[30] W. Schultz,et al. Responses of monkey dopamine neurons during learning of behavioral reactions. , 1992, Journal of neurophysiology.
[31] M. Gabriel,et al. Learning and Computational Neuroscience: Foundations of Adaptive Networks , 1990 .
[32] Richard S. Sutton,et al. Time-Derivative Models of Pavlovian Reinforcement , 1990 .
[33] R. Oades,et al. Ventral tegmental (A10) system: neurobiology. 1. Anatomy and connectivity , 1987, Brain Research Reviews.
[34] J. Pearce,et al. A model for Pavlovian learning: variations in the effectiveness of conditioned but not of unconditioned stimuli. , 1980, Psychological review.
[35] R. Rescorla,et al. A theory of Pavlovian conditioning : Variations in the effectiveness of reinforcement and nonreinforcement , 1972 .