A Neural Substrate of Prediction and Reward

The capacity to predict future events permits a creature to detect, model, and manipulate the causal structure of its interactions with its environment. Behavioral experiments suggest that learning is driven by changes in the expectations about future salient events such as rewards and punishments. Physiological work has recently complemented these studies by identifying dopaminergic neurons in the primate whose fluctuating output apparently signals changes or errors in the predictions of future salient and rewarding events. Taken together, these findings can be understood through quantitative theories of adaptive optimizing control.

[1]  M. Konishi The role of auditory feedback in the control of vocalization in the white-crowned sparrow. , 1965, Zeitschrift fur Tierpsychologie.

[2]  E. Fischer Conditioned Reflexes , 1942, American journal of physical medicine.

[3]  B. Campbell,et al.  Punishment and aversive behavior , 1969 .

[4]  R. Solomon,et al.  An opponent-process theory of motivation. I. Temporal dynamics of affect. , 1974, Psychological review.

[5]  N. Mackintosh A Theory of Attention: Variations in the Associability of Stimuli with Reinforcement , 1975 .

[6]  A. Phillips,et al.  Effects of amphetamine isomers and neuroleptics on self-stimulation from the nucleus accumbens and dorsal nor-adrenergenic bundle , 1975, Brain Research.

[7]  F. Mora,et al.  Brain self-stimulation: direct evidence for the involvement of dopamine in the prefrontal cortex. , 1977, Science.

[8]  R. Wise,et al.  Intracranial self-stimulation in relation to the ascending dopaminergic systems of the midbrain: A moveable electrode mapping study , 1980, Brain Research.

[9]  A. Dickinson Contemporary Animal Learning Theory , 1981 .

[10]  A G Barto,et al.  Toward a modern theory of adaptive networks: expectation and prediction. , 1981, Psychological review.

[11]  F. Nottebohm,et al.  Connections of vocal control nuclei in the canary telencephalon , 1982, The Journal of comparative neurology.

[12]  R. Wise Neuroleptics and operant behavior: The anhedonia hypothesis , 1982, Behavioral and Brain Sciences.

[13]  R. Beninger,et al.  Pimozide blocks establishment but not expression of amphetamine-produced environment-specific conditioning. , 1983, Science.

[14]  R. Beninger The role of dopamine in locomotor activity and learning , 1983, Brain Research Reviews.

[15]  Richard S. Sutton,et al.  Neuronlike adaptive elements that can solve difficult learning control problems , 1983, IEEE Transactions on Systems, Man, and Cybernetics.

[16]  A. Arnold,et al.  Forebrain lesions disrupt development but not maintenance of song in passerine birds. , 1984, Science.

[17]  T. F. Freund,et al.  Tyrosine hydroxylase-immunoreactive boutons in synaptic contact with identified striatonigral neurons, with particular reference to dendritic spines , 1984, Neuroscience.

[18]  W. Schultz Responses of midbrain dopamine neurons to behavioral trigger stimuli in the monkey. , 1986, Journal of neurophysiology.

[19]  R. Wise,et al.  Brain dopamine and reward. , 1989, Annual review of psychology.

[20]  S. Cooper,et al.  The Neuropharmacological basis of reward , 1989 .

[21]  Stephen Grossberg,et al.  Neural dynamics of adaptive timing and temporal discrimination during associative learning , 1989, Neural Networks.

[22]  R. Beninger Dissociating the effects of altered dopaminergic function on performance and learning , 1989, Brain Research Bulletin.

[23]  S. Klein,et al.  Instrumental conditioning theory and the impact of biological constraints on learning , 2014 .

[24]  P. Goldman-Rakic,et al.  Dopamine synaptic complex with pyramidal neurons in primate cerebral cortex. , 1989, Proceedings of the National Academy of Sciences of the United States of America.

[25]  Michael I. Jordan,et al.  Advances in Neural Information Processing Systems 30 , 1995 .

[26]  W. Schultz,et al.  Dopamine neurons of the monkey midbrain: contingencies of responses to stimuli eliciting immediate behavioral reactions. , 1990, Journal of neurophysiology.

[27]  Book Review: Contemporary Learning Theories: Instrumental Conditioning Theory and the Impact of Biological Constraints on Learning , 1990 .

[28]  F. Nottebohm,et al.  A comparative study of the behavioral deficits following lesions of various parts of the zebra finch song system: implications for vocal learning , 1991, The Journal of neuroscience : the official journal of the Society for Neuroscience.

[29]  M. Le Moal,et al.  Mesocorticolimbic dopaminergic network: functional and regulatory roles. , 1991, Physiological reviews.

[30]  R. Mooney,et al.  Two distinct inputs to an avian song nucleus activate different glutamate receptor subtypes on individual neurons. , 1991, Proceedings of the National Academy of Sciences of the United States of America.

[31]  永福 智志 The Organization of Learning , 2005, Journal of Cognitive Neuroscience.

[32]  P. Goldman-Rakic,et al.  D1 dopamine receptors in prefrontal cortex: involvement in working memory , 1991, Science.

[33]  A. Arnold,et al.  The development of afferent projections to the robust archistriatal nucleus in male zebra finches: a quantitative electron microscopic study , 1991, The Journal of neuroscience : the official journal of the Society for Neuroscience.

[34]  T. Robbins,et al.  Functions of dopamine in the dorsal and ventral striatum , 1992 .

[35]  S. Grossberg,et al.  A neural network model of adaptively timed reinforcement learning and hippocampal dynamics. , 1992, Brain research. Cognitive brain research.

[36]  R. Wise,et al.  Localization of drug reward mechanisms by intracranial injections , 1992, Synapse.

[37]  G. Koob Dopamine, addiction and reward , 1992 .

[38]  M. Mauk,et al.  Cerebellar cortex lesions disrupt learning-dependent timing of conditioned eyelid responses , 1993, The Journal of neuroscience : the official journal of the Society for Neuroscience.

[39]  W. Schultz,et al.  Responses of monkey dopamine neurons to reward and conditioned stimuli during successive steps of learning a delayed response task , 1993, The Journal of neuroscience : the official journal of the Society for Neuroscience.

[40]  C. Cepeda,et al.  Neuromodulatory actions of dopamine in the neostriatum are dependent upon the excitatory amino acid receptor subtypes activated. , 1993, Proceedings of the National Academy of Sciences of the United States of America.

[41]  A. Parent,et al.  Synaptic relationships between dopaminergic afferents and cortical or thalamic input in the sensorimotor territory of the striatum in monkey , 1994, The Journal of comparative neurology.

[42]  Michael I. Jordan,et al.  MASSACHUSETTS INSTITUTE OF TECHNOLOGY ARTIFICIAL INTELLIGENCE LABORATORY and CENTER FOR BIOLOGICAL AND COMPUTATIONAL LEARNING DEPARTMENT OF BRAIN AND COGNITIVE SCIENCES , 1996 .

[43]  M. J. Friedlander,et al.  Role of NO production in NMDA receptor-mediated neurotransmitter release in cerebral cortex. , 1994, Science.

[44]  Terrence J. Sejnowski,et al.  A Novel Reinforcement Model of Birdsong Vocalization Learning , 1994, NIPS.

[45]  W. Schultz,et al.  Importance of unpredictability for reward responses in primate dopamine neurons. , 1994, Journal of neurophysiology.

[46]  T. Sejnowski,et al.  The predictive brain: temporal coincidence and temporal order in synaptic learning mechanisms. , 1994, Learning & memory.

[47]  E.C.L. Vu,et al.  Identification of a forebrain motor programming network for the learned song of zebra finches , 1994, The Journal of neuroscience : the official journal of the Society for Neuroscience.

[48]  G. Chiara The role of dopamine in drug abuse viewed from the perspective of its role in motivation , 1995 .

[49]  A. Graybiel Building action repertoires: memory and learning functions of the basal ganglia , 1995, Current Opinion in Neurobiology.

[50]  P. Goldman-Rakic,et al.  Modulation of memory fields by dopamine Dl receptors in prefrontal cortex , 1995, Nature.

[51]  Gerald Tesauro,et al.  Temporal difference learning and TD-Gammon , 1995, CACM.

[52]  Karl J. Friston,et al.  Dopaminergic modulation of impaired cognitive activation in the anterior cingulate cortex in schizophrenia , 1995, Nature.

[53]  A. Graybiel The basal ganglia , 1995, Trends in Neurosciences.

[54]  Joel L. Davis,et al.  Adaptive Critics and the Basal Ganglia , 1995 .

[55]  Peter Dayan,et al.  Bee foraging in uncertain environments using predictive hebbian learning , 1995, Nature.

[56]  W. Schultz,et al.  Preferential activation of midbrain dopamine neurons by appetitive rather than aversive stimuli , 1996, Nature.

[57]  J. Wickens,et al.  Dopamine reverses the depression of rat corticostriatal synapses which normally follows high-frequency stimulation of cortex In vitro , 1996, Neuroscience.

[58]  P. Dayan,et al.  A framework for mesencephalic dopamine systems based on predictive Hebbian learning , 1996, The Journal of neuroscience : the official journal of the Society for Neuroscience.

[59]  A. Damasio,et al.  Neurobiology of Decision-Making , 2012, Research and Perspectives in Neurosciences.

[60]  A. C. Yu,et al.  Temporal Hierarchical Control of Singing in Birds , 1996, Science.

[61]  S. Lisberger,et al.  The Cerebellum: A Neuronal Learning Machine? , 1996, Science.

[62]  W. Newsome,et al.  The Variable Discharge of Cortical Neurons: Implications for Connectivity, Computation, and Information Coding , 1998, The Journal of Neuroscience.

[63]  A. Doupe,et al.  Social context modulates singing-related neural activity in the songbird forebrain , 1999, Nature Neuroscience.

[64]  R. Mooney,et al.  Lesions of an Avian Forebrain Nucleus That Disrupt Song Development Alter Synaptic Connectivity and Transmission in the Vocal Premotor Pathway , 1999, The Journal of Neuroscience.

[65]  W. Bechtel,et al.  A companion to cognitive science , 1999 .

[66]  A. Doupe,et al.  Singing-Related Neural Activity in a Dorsal Forebrain–Basal Ganglia Circuit of Adult Zebra Finches , 1999, The Journal of Neuroscience.

[67]  D. Perkel,et al.  Two-Stage, Input-Specific Synaptic Maturation in a Nucleus Essential for Vocal Production in the Zebra Finch , 1999, The Journal of Neuroscience.

[68]  C. Ghez,et al.  Pharmacological inactivation in the analysis of the central control of movement , 1999, Journal of Neuroscience Methods.

[69]  R. Morris,et al.  Delay‐dependent impairment of a matching‐to‐place task with chronic and intrahippocampal infusion of the NMDA‐antagonist D‐AP5 , 1999, Hippocampus.