Using TD learning to simulate working memory performance in a model of the prefrontal cortex and basal ganglia

Delayed-response tasks (DRTs) have been used to assess working memory (WM) processes in human and nonhuman animals. Experiments have shown that the basal ganglia (BG) and dorsolateral prefrontal cortex (DLPFC) subserve DRT performance. Here, we report the results of simulation studies of a systems-level model of DRT performance. The model was trained using the temporal difference (TD) algorithm and uses an actor-critic architecture. The matrisomes of the BG represent the actor and the striosomes represent the critic. Unlike existing models, we hypothesize that the BG subserve the selection of both motor- and cognitive-related information in these tasks. We also assume that the learning of both processes is based on reward presentation. A novel feature of the model is the incorporation of delay-active neurons in the matrisomes, in addition to DLPFC. Another novel feature of the model is the subdivision of the matrisomal neurons into segregated winner-take-all (WTA) networks consisting of delay- versus transiently-active units. Our simulation model proposes a new neural mechanism to account for the occurrence of perseverative responses in WM tasks in striatal-, as well as in prefrontal damaged subjects. Simulation results also show that the model both accounts for the phenomenon of time shifting of dopamine phasic signals and the effects of partial reinforcement and reward magnitude on WM performance at both behavioral and neural levels. Our simulation results also found that the TD algorithm can subserve learning in delayed-reversal tasks.

[1]  P. Strick,et al.  Basal-ganglia 'projections' to the prefrontal cortex of the primate. , 2002, Cerebral cortex.

[2]  Richard S. Sutton,et al.  Time-Derivative Models of Pavlovian Reinforcement , 1990 .

[3]  John R Anderson,et al.  An integrated theory of the mind. , 2004, Psychological review.

[4]  W. Schultz,et al.  Responses of monkey dopamine neurons to reward and conditioned stimuli during successive steps of learning a delayed response task , 1993, The Journal of neuroscience : the official journal of the Society for Neuroscience.

[5]  J. Changeux,et al.  A Simple Model of Prefrontal Cortex Function in Delayed-Response Tasks , 1989, Journal of Cognitive Neuroscience.

[6]  O. Hikosaka,et al.  Expectation of reward modulates cognitive signals in the basal ganglia , 1998, Nature Neuroscience.

[7]  R. O’Reilly Six principles for biologically based computational models of cortical cognition , 1998, Trends in Cognitive Sciences.

[8]  David J. Foster,et al.  A model of hippocampally dependent navigation, using the temporal difference learning rule , 2000, Hippocampus.

[9]  P. Strick,et al.  Basal Ganglia Output and Cognition: Evidence from Anatomical, Behavioral, and Clinical Studies , 2000, Brain and Cognition.

[10]  A. Lees,et al.  Cognitive deficits in the early stages of Parkinson's disease. , 1983, Brain : a journal of neurology.

[11]  Sean R Eddy,et al.  What is dynamic programming? , 2004, Nature Biotechnology.

[12]  E. Miller,et al.  Different time courses of learning-related activity in the prefrontal cortex and striatum , 2005, Nature.

[13]  J. Saint-Cyr,et al.  Frontal lobe dysfunction in Parkinson's disease. The cortical focus of neostriatal outflow. , 1986, Brain : a journal of neurology.

[14]  W. Precht The synaptic organization of the brain G.M. Shepherd, Oxford University Press (1975). 364 pp., £3.80 (paperback) , 1976, Neuroscience.

[15]  W. F. Prokasy,et al.  Classical conditioning II: Current research and theory. , 1972 .

[16]  V. Fung,et al.  Clonic perseveration following thalamofrontal disconnection: A distinctive movement disorder , 1997, Movement disorders : official journal of the Movement Disorder Society.

[17]  A. Amos A Computational Model of Information Processing in the Frontal Cortex and Basal Ganglia , 2000, Journal of Cognitive Neuroscience.

[18]  John N. J. Reynolds,et al.  Dopamine-dependent plasticity of corticostriatal synapses , 2002, Neural Networks.

[19]  T. Robbins,et al.  Cognitive Impairments in Early Parkinson's Disease Are Accompanied by Reductions in Activity in Frontostriatal Neural Circuitry , 2003, The Journal of Neuroscience.

[20]  M. Mishkin,et al.  Comparison of the effects of frontal and caudate lesions on delayed response and alternation in monkeys. , 1960, Journal of comparative and physiological psychology.

[21]  C. I. Connolly,et al.  Building neural representations of habits. , 1999, Science.

[22]  Michael A. Arbib,et al.  The handbook of brain theory and neural networks , 1995, A Bradford book.

[23]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[24]  R. Sun,et al.  The interaction of the explicit and the implicit in skill learning: a dual-process approach. , 2005, Psychological review.

[25]  O. Hikosaka,et al.  Functional properties of monkey caudate neurons. III. Activities related to expectation of target and reward. , 1989, Journal of neurophysiology.

[26]  M. Piccirilli,et al.  Frontal lobe dysfunction in Parkinson's disease: prognostic value for dementia? , 1989, European neurology.

[27]  W. Schultz,et al.  Responses of monkey dopamine neurons during learning of behavioral reactions. , 1992, Journal of neurophysiology.

[28]  M. Gabriel,et al.  Learning and Computational Neuroscience: Foundations of Adaptive Networks , 1990 .

[29]  W. Schultz The Reward Signal of Midbrain Dopamine Neurons. , 1999, News in physiological sciences : an international journal of physiology produced jointly by the International Union of Physiological Sciences and the American Physiological Society.

[30]  P. Goldman-Rakic,et al.  Differential Activation of the Caudate Nucleus in Primates Performing Spatial and Nonspatial Working Memory Tasks , 1997, The Journal of Neuroscience.

[31]  Joel L. Davis,et al.  A Model of How the Basal Ganglia Generate and Use Neural Signals That Predict Reinforcement , 1994 .

[32]  Peter Redgrave,et al.  A computational model of action selection in the basal ganglia. I. A new functional anatomy , 2001, Biological Cybernetics.

[33]  Jordan Grafman,et al.  Handbook of Neuropsychology , 1991 .

[34]  B M Gaymard,et al.  Memory guided saccade deficit after caudate nucleus lesion , 1999, Journal of neurology, neurosurgery, and psychiatry.

[35]  Joel L. Davis,et al.  Adaptive Critics and the Basal Ganglia , 1995 .

[36]  James C. Houk,et al.  Information Processing in Modular Circuits Linking Basal Ganglia and Cerebral Cortex , 1994 .

[37]  P. Redgrave,et al.  The basal ganglia: a vertebrate solution to the selection problem? , 1999, Neuroscience.

[38]  P. Goldman-Rakic Cellular basis of working memory , 1995, Neuron.

[39]  Peter Redgrave,et al.  Basal Ganglia , 2020, Encyclopedia of Autism Spectrum Disorders.

[40]  P. Goldman-Rakic,et al.  D1 receptors in prefrontal cells and circuits , 2000, Brain Research Reviews.

[41]  T. Sejnowski,et al.  How the Basal Ganglia Make Decisions , 1996 .

[42]  A. Owen Cognitive Dysfunction in Parkinson’s Disease: The Role of Frontostriatal Circuitry , 2004, The Neuroscientist : a review journal bringing neurobiology, neurology and psychiatry.

[43]  Ron Sun,et al.  From implicit skills to explicit knowledge: a bottom-up model of skill learning , 2001, Cogn. Sci..

[44]  W. Schultz,et al.  A neural network model with dopamine-like reinforcement signal that learns a spatial delayed response task , 1999, Neuroscience.

[45]  Philip T. Quinlan,et al.  The Role of Prefrontal Cortex in Perseveration: Developmental and Computational Explorations , 2004 .

[46]  Philip Lieberman,et al.  Selective speech motor, syntax and cognitive deficits associated with bilateral damage to the putamen and the head of the caudate nucleus: a case study , 1998, Neuropsychologia.

[47]  W. Schultz,et al.  Learning of sequential movements by neural network model with dopamine-like reinforcement signal , 1998, Experimental Brain Research.

[48]  G. Shepherd The Synaptic Organization of the Brain , 1979 .

[49]  P. J. Brasted,et al.  Medial prefrontal and neostriatal lesions disrupt performance in an operant delayed alternation task in rats , 1999, Behavioural Brain Research.

[50]  P. Goldman-Rakic,et al.  Comparison of human infants and rhesus monkeys on Piaget's AB task: evidence for dependence on dorsolateral prefrontal cortex , 2004, Experimental Brain Research.

[51]  W. Schultz,et al.  Neuronal activity in monkey striatum related to the expectation of predictable environmental events. , 1992, Journal of neurophysiology.

[52]  R. Kikinis,et al.  MRI study of caudate nucleus volume and its cognitive correlates in neuroleptic-naive patients with schizotypal personality disorder. , 2002, The American journal of psychiatry.

[53]  P. Goldman-Rakic,et al.  Mnemonic coding of visual space in the monkey's dorsolateral prefrontal cortex. , 1989, Journal of neurophysiology.

[54]  J. Wickens,et al.  Dopamine reverses the depression of rat corticostriatal synapses which normally follows high-frequency stimulation of cortex In vitro , 1996, Neuroscience.

[55]  Ian H. Witten,et al.  An Adaptive Optimal Controller for Discrete-Time Markov Environments , 1977, Inf. Control..

[56]  Michael J. Frank,et al.  Interactions between frontal cortex and basal ganglia in working memory: A computational model , 2001, Cognitive, affective & behavioral neuroscience.

[57]  Yuko Munakata,et al.  The role of prefrontal cortex in perseveration : Developmental and computational explorations , 2022 .

[58]  Michael J. Frank,et al.  Dynamic Dopamine Modulation in the Basal Ganglia: A Neurocomputational Account of Cognitive Deficits in Medicated and Nonmedicated Parkinsonism , 2005, Journal of Cognitive Neuroscience.

[59]  T. Sawaguchi,et al.  Prefrontal cortical representation of visuospatial working memory in monkeys examined by local inactivation with muscimol. , 2001, Journal of neurophysiology.

[60]  Richard S. Sutton,et al.  Learning to predict by the methods of temporal differences , 1988, Machine Learning.

[61]  Michael J. Frank,et al.  Making Working Memory Work: A Computational Model of Learning in the Prefrontal Cortex and Basal Ganglia , 2006, Neural Computation.

[62]  Pearce Animal learning and cognition , 1997 .

[63]  Marvin Minsky,et al.  Steps toward Artificial Intelligence , 1995, Proceedings of the IRE.

[64]  E. Sullivan,et al.  Cognitive impairment in early, untreated Parkinson's disease and its relationship to motor disability. , 1991, Brain : a journal of neurology.

[65]  J. Gabrieli Contribution of the basal ganglia to skill learning and working memory in humans , 1995 .

[66]  J. Cavanaugh,et al.  Differential Metabolic Activity in the Striosome and Matrix Compartments of the Rat Striatum during Natural Behaviors , 2002, The Journal of Neuroscience.

[67]  A. Graybiel,et al.  Highly restricted origin of prefrontal cortical inputs to striosomes in the macaque monkey , 1995, The Journal of neuroscience : the official journal of the Society for Neuroscience.

[68]  A. Barto,et al.  Adaptive Critics and the Basal Ganglia , 1994 .

[69]  W. Schultz,et al.  Discrete Coding of Reward Probability and Uncertainty by Dopamine Neurons , 2003, Science.

[70]  Jonathan D. Cohen,et al.  On the Control of Control: The Role of Dopamine in Regulating Prefrontal Function and Working Memory , 2007 .

[71]  James L. McClelland,et al.  Connectionist models of development , 2003 .

[72]  J. Mink THE BASAL GANGLIA: FOCUSED SELECTION AND INHIBITION OF COMPETING MOTOR PROGRAMS , 1996, Progress in Neurobiology.

[73]  Peter Dayan,et al.  A Neural Substrate of Prediction and Reward , 1997, Science.

[74]  Garrett E. Alexander Basal ganglia , 1998 .

[75]  J. Wickens Basal ganglia: structure and computations. , 1997 .

[76]  Edward E. Smith,et al.  Temporal dynamics of brain activation during a working memory task , 1997, Nature.

[77]  R E Gross,et al.  Wisconsin Card Sorting Test performance following head injury: dorsolateral fronto-striatal circuit activity predicts perseveration. , 1999, Journal of clinical and experimental neuropsychology.

[78]  James C. Houk,et al.  Contribution of the Basal Ganglia to Skill Learning and Working Memory in Humans , 1994 .

[79]  T. Robbins,et al.  The effect of dopamine depletion from the caudate nucleus of the common marmoset (Callithrix jacchus) on tests of prefrontal cognitive function. , 2000, Behavioral neuroscience.

[80]  H. E. Rosvold,et al.  Behavioral effects of selective ablation of the caudate nucleus. , 1967, Journal of comparative and physiological psychology.

[81]  J. Houk,et al.  Model of cortical-basal ganglionic processing: encoding the serial order of sensory events. , 1998, Journal of neurophysiology.

[82]  G. E. Alexander,et al.  Microstimulation of the primate neostriatum. II. Somatotopic organization of striatal microexcitable zones and their relation to neuronal response properties. , 1985, Journal of neurophysiology.

[83]  W. Schultz,et al.  Dopamine responses comply with basic assumptions of formal learning theory , 2001, Nature.

[84]  Richard S. Sutton,et al.  Neuronlike adaptive elements that can solve difficult learning control problems , 1983, IEEE Transactions on Systems, Man, and Cybernetics.

[85]  W. Schultz,et al.  Adaptive Coding of Reward Value by Dopamine Neurons , 2005, Science.

[86]  R. Rescorla A theory of pavlovian conditioning: The effectiveness of reinforcement and non-reinforcement , 1972 .