A Predictive Reinforcement Model of Dopamine Neurons for Learning Approach Behavior

A neural network model of how dopamine and prefrontal cortex activity guides short- and long-term information processing within the cortico-striatal circuits during reward-related learning of approach behavior is proposed. The model predicts two types of reward-related neuronal responses generated during learning: (1) cell activity signaling errors in the prediction of the expected time of reward delivery and (2) neural activations coding for errors in the prediction of the amount and type of reward or stimulus expectancies. The former type of signal is consistent with the responses of dopaminergic neurons, while the latter signal is consistent with reward expectancy responses reported in the prefrontal cortex. It is shown that a neural network architecture that satisfies the design principles of the adaptive resonance theory of Carpenter and Grossberg (1987) can account for the dopamine responses to novelty, generalization, and discrimination of appetitive and aversive stimuli. These hypotheses are scrutinized via simulations of the model in relation to the delivery of free food outside a task, the timed contingent delivery of appetitive and aversive stimuli, and an asymmetric, instructed delay response task.

[1]  A. Hodgkin,et al.  A quantitative description of membrane current and its application to conduction and excitation in nerve , 1952, The Journal of physiology.

[2]  B. Milner Effects of Different Brain Lesions on Card Sorting: The Role of the Frontal Lobes , 1963 .

[3]  M. Cole The Frontal Granular Cortex and Behavior. , 1964 .

[4]  H. Künzle An autoradiographic analysis of the efferent connections from premotor and adjacent prefrontal regions (areas 6 and 9) in macaca fascicularis. , 1978, Brain, behavior and evolution.

[5]  A G Barto,et al.  Toward a modern theory of adaptive networks: expectation and prediction. , 1981, Psychological review.

[6]  T. Ono,et al.  Single unit activity in monkey caudate nucleus during operant bar pressing feeding behavior , 1981, Neuroscience Letters.

[7]  P. Goldman-Rakic,et al.  Brainstem innervation of prefrontal and anterior cingulate cortex in the rhesus monkey revealed by retrograde transport of HRP , 1982, The Journal of comparative neurology.

[8]  L. Heimer,et al.  The ventral pallidal projection to the mediodorsal thalamus: a study with fluorescent retrograde tracers and immunohistofluorescence , 1984, The Journal of neuroscience : the official journal of the Society for Neuroscience.

[9]  H. Kornhuber,et al.  The cortico-nigral projection: reduced glutamate content in the substantia nigra following frontal cortex ablation in the rat , 1984, Brain Research.

[10]  A. Grace,et al.  Opposing effects of striatonigral feedback pathways on midbrain dopamine cell activity , 1985, Brain Research.

[11]  P. Goldman-Rakic,et al.  The primate mediodorsal (MD) nucleus and its projection to the frontal lobe , 1985, The Journal of comparative neurology.

[12]  D. Amaral,et al.  The amygdalostriatal projections in the monkey. An anterograde tracing study , 1985, Brain Research.

[13]  G. Graveland,et al.  The frequency and distribution of medium-sized neurons with indented nuclei in the primate and rodent neostriatum , 1985, Brain Research.

[14]  G. E. Alexander,et al.  Parallel organization of functionally segregated circuits linking basal ganglia and cortex. , 1986, Annual review of neuroscience.

[15]  W. Schultz Activity of pars reticulata neurons of monkey substantia nigra in relation to motor, sensory, and complex events. , 1986, Journal of neurophysiology.

[16]  A. Graybiel,et al.  Subdivisions of the dopamine-containing A8-A9-A10 complex identified by their differential mesostriatal innervation of striosomes and extrastriosomal matrix , 1987, Neuroscience.

[17]  S. Grossberg,et al.  ART 2: self-organization of stable category recognition codes for analog input patterns. , 1987, Applied optics.

[18]  C. Gerfen,et al.  The neostriatal mosaic: II. Patch- and matrix-directed mesostriatal dopaminergic and non-dopaminergic systems , 1987, The Journal of neuroscience : the official journal of the Society for Neuroscience.

[19]  T. Robbins,et al.  A comparative study of visuospatial memory and learning in Alzheimer-type dementia and Parkinson's disease. , 1988, Brain : a journal of neurology.

[20]  P. Groves,et al.  Burst firing induced in midbrain dopamine neurons by stimulation of the medial prefrontal and anterior cingulate cortices , 1988, Brain Research.

[21]  H. Groenewegen Organization of the afferent connections of the mediodorsal thalamic nucleus in the rat, related to the mediodorsal-prefrontal topography , 1988, Neuroscience.

[22]  G. Rebec,et al.  Reciprocal zones of excitation and inhibition in the neostriatum , 1988, Synapse.

[23]  C. Marsden,et al.  'Frontal' cognitive function in patients with Parkinson's disease 'on' and 'off' levodopa. , 1988, Brain : a journal of neurology.

[24]  Daniel S. Levine,et al.  Modeling some effects of frontal lobe damage--Novelty and perseveration , 1989, Neural Networks.

[25]  C. Marsden,et al.  The performance on learning tasks of patients in the early stages of Parkinson's disease , 1989, Neuropsychologia.

[26]  C. Gerfen The neostriatal mosaic: striatal patch-matrix organization is related to cortical lamination. , 1989, Science.

[27]  N. Birbaumer,et al.  Slow cortical potentials in Parkinsonian patients during the course of an associative learning task , 1990 .

[28]  S. Haber,et al.  Topographic organization of the ventral striatal efferent projections in the rhesus monkey: An anterograde tracing study , 1990, The Journal of comparative neurology.

[29]  W. Schultz,et al.  Dopamine neurons of the monkey midbrain: contingencies of responses to active touch during self-initiated arm movements. , 1990, Journal of neurophysiology.

[30]  W. Schultz,et al.  Dopamine neurons of the monkey midbrain: contingencies of responses to stimuli eliciting immediate behavioral reactions. , 1990, Journal of neurophysiology.

[31]  A. Graybiel Neurotransmitters and neuromodulators in the basal ganglia , 1990, Trends in Neurosciences.

[32]  C. Marsden,et al.  Cognitive function in Parkinson's disease: From description to theory , 1990, Trends in Neurosciences.

[33]  Stephen Grossberg,et al.  ART 3: Hierarchical search using chemical transmitters in self-organizing pattern recognition architectures , 1990, Neural Networks.

[34]  J. Wickens,et al.  Two dynamic modes of striatal function under dopaminergic‐cholinergic control: Simulation and analysis of a model , 1991, Synapse.

[35]  S. Grossberg,et al.  A neural network model of adaptively timed reinforcement learning and hippocampal dynamics. , 1992, Brain research. Cognitive brain research.

[36]  W. Schultz,et al.  Responses of monkey dopamine neurons during learning of behavioral reactions. , 1992, Journal of neurophysiology.

[37]  C. Gerfen The neostriatal mosaic: multiple levels of compartmental organization in the basal ganglia. , 1992, Annual review of neuroscience.

[38]  P. Calabresi,et al.  Long-term synaptic depression in the striatum: physiological and pharmacological characterization , 1992, The Journal of neuroscience : the official journal of the Society for Neuroscience.

[39]  M. Jahanshahi,et al.  Time estimation and reproduction is abnormal in Parkinson's disease. , 1992, Brain : a journal of neurology.

[40]  W. Schultz,et al.  Neuronal activity in monkey ventral striatum related to the expectation of reward , 1992, The Journal of neuroscience : the official journal of the Society for Neuroscience.

[41]  W. Schultz,et al.  Neuronal activity in monkey striatum related to the expectation of predictable environmental events. , 1992, Journal of neurophysiology.

[42]  J. Kaas,et al.  Topography and collateralization of the dopaminergic projections to motor and lateral prefrontal cortex in owl monkeys , 1992, The Journal of comparative neurology.

[43]  P. Strick,et al.  Multiple output channels in the basal ganglia. , 1993, Science.

[44]  M. Mauk,et al.  Cerebellar cortex lesions disrupt learning-dependent timing of conditioned eyelid responses , 1993, The Journal of neuroscience : the official journal of the Society for Neuroscience.

[45]  F. Gonon,et al.  Prefrontal cortex regulates burst firing and transmitter release in rat mesolimbic dopamine neurons studied in vivo , 1993, Neuroscience Letters.

[46]  W. Schultz,et al.  Responses of monkey dopamine neurons to reward and conditioned stimuli during successive steps of learning a delayed response task , 1993, The Journal of neuroscience : the official journal of the Society for Neuroscience.

[47]  M. Fabre-Thorpe,et al.  Interaction of the Amygdala with the Frontal Lobe in Reward Memory , 1993, The European journal of neuroscience.

[48]  D. Robinson,et al.  Dopamine dependent reaction time deficits in patients with parkinson's disease are task specific , 1993, Neuropsychologia.

[49]  Joel L. Davis,et al.  A Model of How the Basal Ganglia Generate and Use Neural Signals That Predict Reinforcement , 1994 .

[50]  W. Schultz,et al.  Importance of unpredictability for reward responses in primate dopamine neurons. , 1994, Journal of neurophysiology.

[51]  Charles J. Wilson,et al.  Surround inhibition among projection neurons is weak or nonexistent in the rat neostriatum. , 1994, Journal of neurophysiology.

[52]  Dean V. Buonomano,et al.  Neural Network Model of the Cerebellum: Temporal Discrimination and the Timing of Motor Responses , 1999, Neural Computation.

[53]  M. L. Pucak,et al.  Regulation of substantia nigra dopamine neurons. , 1994, Critical reviews in neurobiology.

[54]  A. Graybiel,et al.  Highly restricted origin of prefrontal cortical inputs to striosomes in the macaque monkey , 1995, The Journal of neuroscience : the official journal of the Society for Neuroscience.

[55]  M. Delong,et al.  Parkinson's Disease and the Basal Ganglia: Lessons from the Laboratory and from Neurosurgery , 1995 .

[56]  A. Parent,et al.  Functional anatomy of the basal ganglia. I. The cortico-basal ganglia-thalamo-cortical loop , 1995, Brain Research Reviews.

[57]  A. Canavan,et al.  Associative learning in degenerative neostriatal disorders: Contrasts in explicit and implicit remembering between Parkinson's and huntington's diseases , 1995, Movement disorders : official journal of the Movement Disorder Society.

[58]  M. Jüptner,et al.  Localization of a cerebellar timing process using PET , 1995, Neurology.

[59]  P. Strick,et al.  Basal Ganglia ‘Loops’ with the Cerebral Cortex , 1995 .

[60]  W. Schultz,et al.  Preferential activation of midbrain dopamine neurons by appetitive rather than aversive stimuli , 1996, Nature.

[61]  A. Charara,et al.  Synaptic innervation of midbrain dopaminergic neurons by glutamate‐enriched terminals in the squirrel monkey , 1996, The Journal of comparative neurology.

[62]  J. Wickens,et al.  Dopamine reverses the depression of rat corticostriatal synapses which normally follows high-frequency stimulation of cortex In vitro , 1996, Neuroscience.

[63]  P. Dayan,et al.  A framework for mesencephalic dopamine systems based on predictive Hebbian learning , 1996, The Journal of neuroscience : the official journal of the Society for Neuroscience.

[64]  Jennifer A. Mangels,et al.  A Neostriatal Habit Learning System in Humans , 1996, Science.

[65]  B. Richmond,et al.  Neural signals in the monkey ventral striatum related to motivation for juice and cocaine rewards. , 1996, Journal of neurophysiology.

[66]  R. Ivry The representation of temporal information in perception and motor control , 1996, Current Opinion in Neurobiology.

[67]  W. Meck Neuropharmacology of timing and time perception. , 1996, Brain research. Cognitive brain research.

[68]  R. Guillery,et al.  Functional organization of thalamocortical relays. , 1996, Journal of neurophysiology.

[69]  P. Overton,et al.  Stimulation of the prefrontal cortex in the rat induces patterns of activity in midbrain dopaminergic neurons which resemble natural burst events , 1996, Synapse.

[70]  S. Grossberg,et al.  Metabotropic Glutamate Receptor Activation in Cerebellar Purkinje Cells as Substrate for Adaptive Timing of the Classically Conditioned Eye-Blink Response , 1996, The Journal of Neuroscience.

[71]  Masataka Watanabe Reward expectancy in primate prefrental neurons , 1996, Nature.

[72]  D. Plenz,et al.  Neural dynamics in cortex-striatum co-cultures—II. Spatiotemporal characteristics of neuronal activity , 1996, Neuroscience.

[73]  Peter C. M. Molenaar,et al.  Numerical bifurcation analysis of distance-dependent on-center off-surround shunting neural networks , 1996, Biological Cybernetics.

[74]  Peter Dayan,et al.  A Neural Substrate of Prediction and Reward , 1997, Science.

[75]  Gail A. Carpenter,et al.  Distributed Learning, Recognition, and Prediction by ART and ARTMAP Neural Networks , 1997, Neural Networks.

[76]  T. Aird Functional Anatomy of the Basal Ganglia , 2000, The Journal of neuroscience nursing : journal of the American Association of Neuroscience Nurses.

[77]  José Luis Contreras-Vidal,et al.  A neural model of basal ganglia-thalamocortical relations in normal and parkinsonian movement , 1995, Biological Cybernetics.

[78]  Rolf Kötter,et al.  Interactions of glutamate and dopamine in a computational model of the striatum , 1995, Journal of Computational Neuroscience.

[79]  M. Watanabe Prefrontal unit activity during associative learning in the monkey , 2004, Experimental Brain Research.

[80]  W. Schultz,et al.  Responses to reward in monkey dorsal and ventral striatum , 2004, Experimental Brain Research.

[81]  A. Graybiel,et al.  Evidence that histochemically distinct zones of the primate substantia nigra pars compacta are related to patterned distributions of nigrostriatal projection neurons and striatonigral fibers , 2004, Experimental Brain Research.

[82]  R. Church,et al.  The differential effects of haloperidol and methamphetamine on time estimation in the rat , 2004, Psychopharmacology.

[83]  Houeto Jean-Luc [Parkinson's disease]. , 2022, La Revue du praticien.

[84]  Richard S. Sutton,et al.  Learning to predict by the methods of temporal differences , 1988, Machine Learning.