Dopamine, uncertainty and TD learning
暂无分享,去创建一个
[1] J. Pearce,et al. A model for Pavlovian learning: Variations in the effectiveness of conditioned but not of unconditioned stimuli. , 1980 .
[2] Richard S. Sutton,et al. Learning and Sequential Decision Making , 1989 .
[3] M. Gabriel,et al. Learning and Computational Neuroscience: Foundations of Adaptive Networks , 1990 .
[4] W. Schultz,et al. Responses of monkey dopamine neurons during learning of behavioral reactions. , 1992, Journal of neurophysiology.
[5] W. Schultz,et al. Responses of monkey dopamine neurons to reward and conditioned stimuli during successive steps of learning a delayed response task , 1993, The Journal of neuroscience : the official journal of the Society for Neuroscience.
[6] J. Wickens,et al. Cellular models of reinforcement. , 1995 .
[7] P. Dayan,et al. A framework for mesencephalic dopamine systems based on predictive Hebbian learning , 1996, The Journal of neuroscience : the official journal of the Society for Neuroscience.
[8] Peter Dayan,et al. A Neural Substrate of Prediction and Reward , 1997, Science.
[9] J. Hollerman,et al. Dopamine neurons report an error in the temporal prediction of reward during learning , 1998, Nature Neuroscience.
[10] W. Schultz,et al. A neural network model with dopamine-like reinforcement signal that learns a spatial delayed response task , 1999, Neuroscience.
[11] S. Kakade,et al. Learning and selective attention , 2000, Nature Neuroscience.
[12] C. Gallistel,et al. Time, rate, and conditioning. , 2000, Psychological review.
[13] Peter Dayan,et al. Expected and Unexpected Uncertainty: ACh and NE in the Neocortex , 2002, NIPS.
[14] Sham M. Kakade,et al. Opponent interactions between serotonin and dopamine , 2002, Neural Networks.
[15] W. Schultz,et al. Coding of Predicted Reward Omission by Dopamine Neurons in a Conditioned Inhibition Paradigm , 2003, The Journal of Neuroscience.
[16] Karl J. Friston,et al. Temporal Difference Models and Reward-Related Learning in the Human Brain , 2003, Neuron.
[17] W. Schultz,et al. Discrete Coding of Reward Probability and Uncertainty by Dopamine Neurons , 2003, Science.
[18] Jonathan D. Cohen,et al. Computational roles for dopamine in behavioural control , 2004, Nature.
[19] O. Hikosaka,et al. Dopamine Neurons Can Represent Context-Dependent Prediction Error , 2004, Neuron.
[20] O. Hikosaka,et al. A possible role of midbrain dopamine neurons in short- and long-term adaptation of saccades to position-reward mapping. , 2004, Journal of neurophysiology.
[21] Peter Dayan,et al. Temporal difference models describe higher-order learning in humans , 2004, Nature.
[22] E. Vaadia,et al. Coincident but Distinct Messages of Midbrain Dopamine and Striatal Tonically Active Neurons , 2004, Neuron.
[23] Richard S. Sutton,et al. Reinforcement Learning: An Introduction , 2005, IEEE Transactions on Neural Networks.
[24] Richard S. Sutton,et al. Learning to predict by the methods of temporal differences , 1988, Machine Learning.