Dissociating hippocampal and striatal contributions to sequential prediction learning

Behavior may be generated on the basis of many different kinds of learned contingencies. For instance, responses could be guided by the direct association between a stimulus and response, or by sequential stimulus–stimulus relationships (as in model‐based reinforcement learning or goal‐directed actions). However, the neural architecture underlying sequential predictive learning is not well understood, in part because it is difficult to isolate its effect on choice behavior. To track such learning more directly, we examined reaction times (RTs) in a probabilistic sequential picture identification task in healthy individuals. We used computational learning models to isolate trial‐by‐trial effects of two distinct learning processes in behavior, and used these as signatures to analyse the separate neural substrates of each process. RTs were best explained via the combination of two delta rule learning processes with different learning rates. To examine neural manifestations of these learning processes, we used functional magnetic resonance imaging to seek correlates of time‐series related to expectancy or surprise. We observed such correlates in two regions, hippocampus and striatum. By estimating the learning rates best explaining each signal, we verified that they were uniquely associated with one of the two distinct processes identified behaviorally. These differential correlates suggest that complementary anticipatory functions drive each region’s effect on behavior. Our results provide novel insights as to the quantitative computational distinctions between medial temporal and basal ganglia learning networks and enable experiments that exploit trial‐by‐trial measurement of the unique contributions of both hippocampus and striatum to response behavior.

[1]  Frequency Interpretations in Probability , 1939, Nature.

[2]  H P BAHRICK,et al.  Incidental learning under two incentive conditions. , 1954, Journal of experimental psychology.

[3]  D Marr,et al.  Simple memory: a theory for archicortex. , 1971, Philosophical transactions of the Royal Society of London. Series B, Biological sciences.

[4]  E. Tulving,et al.  Retrieval processes in recognition memory: Effects of associative context , 1971 .

[5]  R. Rescorla,et al.  A theory of Pavlovian conditioning : Variations in the effectiveness of reinforcement and nonreinforcement , 1972 .

[6]  W. F. Prokasy,et al.  Classical conditioning II: Current research and theory. , 1972 .

[7]  G. Schwarz Estimating the Dimension of a Model , 1978 .

[8]  R. F. Thompson,et al.  Hippocampus and trace conditioning of the rabbit's classically conditioned nictitating membrane response. , 1986, Behavioral neuroscience.

[9]  M. Packard,et al.  Differential effects of fornix and caudate nucleus lesions on two radial maze tasks: evidence for multiple memory systems , 1989, The Journal of neuroscience : the official journal of the Society for Neuroscience.

[10]  Marilyn Hartman,et al.  Explicit and implicit remembering: When is learning preserved in amnesia? , 1989, Neuropsychologia.

[11]  D. G. Davis,et al.  Memory for Reward in Probabilistic Choice: Markovian and Non-Markovian Properties , 1990 .

[12]  Alistair Sinclair,et al.  Algorithms for Random Generation and Counting: A Markov Chain Approach , 1993, Progress in Theoretical Computer Science.

[13]  M. Gluck,et al.  Hippocampal mediation of stimulus representation: A computational theory , 1993, Hippocampus.

[14]  Joel L. Davis,et al.  A Model of How the Basal Ganglia Generate and Use Neural Signals That Predict Reinforcement , 1994 .

[15]  M. Gluck,et al.  Probabilistic classification learning in amnesia. , 1994, Learning & memory.

[16]  J. Hodges Memory, Amnesia and the Hippocampal System , 1995 .

[17]  James L. McClelland,et al.  Why there are complementary learning systems in the hippocampus and neocortex: insights from the successes and failures of connectionist models of learning and memory. , 1995, Psychological review.

[18]  Joel L. Davis,et al.  Adaptive Critics and the Basal Ganglia , 1995 .

[19]  P. Dayan,et al.  A framework for mesencephalic dopamine systems based on predictive Hebbian learning , 1996, The Journal of neuroscience : the official journal of the Society for Neuroscience.

[20]  Jane S. Paulsen,et al.  Dissociations within nondeclarative memory in Huntington's disease. , 1996 .

[21]  F. Craik,et al.  Novelty and familiarity activations in PET studies of memory encoding and retrieval. , 1996, Cerebral cortex.

[22]  Jennifer A. Mangels,et al.  A Neostriatal Habit Learning System in Humans , 1996, Science.

[23]  D. Rubin,et al.  One Hundred Years of Forgetting : A Quantitative Description of Retention , 1996 .

[24]  Peter Dayan,et al.  A Neural Substrate of Prediction and Reward , 1997, Science.

[25]  D H Brainard,et al.  The Psychophysics Toolbox. , 1997, Spatial vision.

[26]  Tim Curran,et al.  Higher-Order Associative Learning in Amnesia: Evidence from the Serial Reaction Time Task , 1997, Journal of Cognitive Neuroscience.

[27]  R. Gerlai A new continuous alternation task in T-maze detects hippocampal dysfunction in mice A strain comparison and lesion study , 1998, Behavioural Brain Research.

[28]  Nancy Kanwisher,et al.  A cortical representation of the local visual environment , 1998, Nature.

[29]  Karl J. Friston,et al.  Nonlinear event‐related responses in fMRI , 1998, Magnetic resonance in medicine.

[30]  Karl J. Friston,et al.  Generalisability, Random Effects & Population Inference , 1998, NeuroImage.

[31]  B. Balleine,et al.  Goal-directed instrumental action: contingency and incentive learning and their cortical substrates , 1998, Neuropharmacology.

[32]  Kenji Doya,et al.  What are the computations of the cerebellum, the basal ganglia and the cerebral cortex? , 1999, Neural Networks.

[33]  D. Rubin,et al.  The Precise Time Course of Retention , 1999 .

[34]  B. Balleine,et al.  The Role of the Hippocampus in Instrumental Conditioning , 2000, The Journal of Neuroscience.

[35]  S. Kakade,et al.  Learning and selective attention , 2000, Nature Neuroscience.

[36]  L. Nystrom,et al.  Tracking the hemodynamic responses to reward and punishment in the striatum. , 2000, Journal of neurophysiology.

[37]  M. Gluck,et al.  Interactive memory systems in the human brain , 2001, Nature.

[38]  D. Kupfer,et al.  Amphetamine-induced dopamine release in human ventral striatum correlates with euphoria , 2001, Biological Psychiatry.

[39]  Brian Knutson,et al.  Anticipation of Increasing Monetary Reward Selectively Recruits Nucleus Accumbens , 2001, The Journal of Neuroscience.

[40]  N. Tzourio-Mazoyer,et al.  Automated Anatomical Labeling of Activations in SPM Using a Macroscopic Anatomical Parcellation of the MNI MRI Single-Subject Brain , 2002, NeuroImage.

[41]  S. Corkin What's new with the amnesic patient H.M.? , 2002, Nature Reviews Neuroscience.

[42]  B. Knowlton,et al.  Learning and memory functions of the Basal Ganglia. , 2002, Annual review of neuroscience.

[43]  H. Pashler STEVENS' HANDBOOK OF EXPERIMENTAL PSYCHOLOGY , 2002 .

[44]  G. McCarthy,et al.  Perceiving patterns in random series: dynamic processing of sequence in prefrontal cortex , 2002, Nature Neuroscience.

[45]  B. Everitt,et al.  Emotion and motivation: the role of the amygdala, ventral striatum, and prefrontal cortex , 2002, Neuroscience & Biobehavioral Reviews.

[46]  B. Balleine,et al.  The Role of Learning in the Operation of Motivational Systems , 2002 .

[47]  R. Poldrack,et al.  Competition among multiple memory systems: converging evidence from animal and human brain studies , 2003, Neuropsychologia.

[48]  Samuel M. McClure,et al.  Temporal Prediction Errors in a Passive Learning Task Activate Human Striatum , 2003, Neuron.

[49]  Karl J. Friston,et al.  Temporal Difference Models and Reward-Related Learning in the Human Brain , 2003, Neuron.

[50]  David S. Touretzky,et al.  Model Uncertainty in Classical Conditioning , 2003, NIPS.

[51]  M. Gluck,et al.  Dissociating Hippocampal versus Basal Ganglia Contributions to Learning and Transfer , 2003, Journal of Cognitive Neuroscience.

[52]  S. Keele,et al.  The cognitive and neural architecture of sequence representation. , 2003, Psychological review.

[53]  C. Stern,et al.  An fMRI Study of the Role of the Medial Temporal Lobe in Implicit and Explicit Sequence Learning , 2003, Neuron.

[54]  M. Shapiro,et al.  Prospective and Retrospective Memory Coding in the Hippocampus , 2003, Neuron.

[55]  Saori C. Tanaka,et al.  Prediction of immediate and future rewards differentially recruits cortico-basal ganglia loops , 2004, Nature Neuroscience.

[56]  Samuel M. McClure,et al.  Separate Neural Systems Value Immediate and Delayed Monetary Rewards , 2004, Science.

[57]  C. Gallistel,et al.  The learning curve: implications of a quantitative analysis. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[58]  D. Barraclough,et al.  Prefrontal cortex and decision making in a mixed-strategy game , 2004, Nature Neuroscience.

[59]  H. Eichenbaum Hippocampus Cognitive Processes and Neural Representations that Underlie Declarative Memory , 2004, Neuron.

[60]  M. Bar Visual objects in context , 2004, Nature Reviews Neuroscience.

[61]  W. T. Maddox,et al.  Dissociating explicit and procedural-learning based systems of perceptual category learning , 2004, Behavioural Processes.

[62]  R. Henson A Mini-Review of fMRI Studies of Human Medial Temporal Lobe Activity Associated with Recognition Memory , 2005, The Quarterly journal of experimental psychology. B, Comparative and physiological psychology.

[63]  K. Doya,et al.  Representation of Action-Specific Reward Values in the Striatum , 2005, Science.

[64]  P. Dayan,et al.  Uncertainty-based competition between prefrontal and dorsolateral striatal systems for behavioral control , 2005, Nature Neuroscience.

[65]  Emery N Brown,et al.  Behavioral / Systems / Cognitive Functional Magnetic Resonance Imaging Activity during the Gradual Acquisition and Expression of Paired-Associate Memory , 2005 .

[66]  Raymond J. Dolan,et al.  Information theory, novelty and hippocampal responses: unpredicted or unpredictable? , 2005, Neural Networks.

[67]  A. David Redish,et al.  Hippocampal replay contributes to within session learning in a temporal difference reinforcement learning model , 2005, Neural Networks.

[68]  P. Glimcher,et al.  Midbrain Dopamine Neurons Encode a Quantitative Reward Prediction Error Signal , 2005, Neuron.

[69]  B. Balleine,et al.  The role of the dorsomedial striatum in instrumental conditioning , 2005, The European journal of neuroscience.

[70]  H. Seung,et al.  JOURNAL OF THE EXPERIMENTAL ANALYSIS OF BEHAVIOR 2005, 84, 581–617 NUMBER 3(NOVEMBER) LINEAR-NONLINEAR-POISSON MODELS OF PRIMATE CHOICE DYNAMICS , 2022 .

[71]  J. Lisman,et al.  The Hippocampal-VTA Loop: Controlling the Entry of Information into Long-Term Memory , 2005, Neuron.

[72]  P. Glimcher,et al.  JOURNAL OF THE EXPERIMENTAL ANALYSIS OF BEHAVIOR 2005, 84, 555–579 NUMBER 3(NOVEMBER) DYNAMIC RESPONSE-BY-RESPONSE MODELS OF MATCHING BEHAVIOR IN RHESUS MONKEYS , 2022 .

[73]  J. Gläscher,et al.  Formal Learning Theory Dissociates Brain Regions with Different Temporal Integration , 2005, Neuron.

[74]  Y. Lacasse,et al.  From the authors , 2005, European Respiratory Journal.

[75]  Caroline F. Zink,et al.  Human striatal activation reflects degree of stimulus saliency , 2006, NeuroImage.

[76]  P. Dayan,et al.  Cortical substrates for exploratory decisions in humans , 2006, Nature.

[77]  Karl J. Friston,et al.  Encoding uncertainty in the hippocampus , 2006, Neural Networks.

[78]  P. Dayan,et al.  Opinion TRENDS in Cognitive Sciences Vol.10 No.8 Full text provided by www.sciencedirect.com A normative perspective on motivation , 2022 .

[79]  J. O'Doherty,et al.  The Role of the Ventromedial Prefrontal Cortex in Abstract State-Based Inference during Decision Making in Humans , 2006, The Journal of Neuroscience.

[80]  Russell A Poldrack,et al.  Modulation of competing memory systems by distraction. , 2006, Proceedings of the National Academy of Sciences of the United States of America.

[81]  Aaron C. Courville,et al.  Bayesian theories of conditioning in a changing world , 2006, Trends in Cognitive Sciences.

[82]  J. O'Doherty,et al.  Model‐Based fMRI and Its Application to Reward Learning and Decision Making , 2007, Annals of the New York Academy of Sciences.

[83]  K. Preuschoff,et al.  Adding Prediction Risk to the Theory of Reward Learning , 2007, Annals of the New York Academy of Sciences.

[84]  N. Daw,et al.  Reinforcement Learning Signals in the Human Striatum Distinguish Learners from Nonlearners during Reward-Based Decision Making , 2007, The Journal of Neuroscience.

[85]  G. Buzsáki,et al.  Forward and reverse hippocampal place-cell sequences during ripples , 2007, Nature Neuroscience.

[86]  Vivian V. Valentin,et al.  Determining the Neural Substrates of Goal-Directed Learning in the Human Brain , 2007, The Journal of Neuroscience.

[87]  W. T. Maddox,et al.  Neural correlates of rule-based and information-integration visual category learning. , 2006, Cerebral cortex.

[88]  Timothy E. J. Behrens,et al.  Learning the value of information in an uncertain world , 2007, Nature Neuroscience.

[89]  Adam Johnson,et al.  Neural Ensembles in CA3 Transiently Encode Paths Forward of the Animal at a Decision Point , 2007, The Journal of Neuroscience.

[90]  Kevin McCabe,et al.  Neural signature of fictive learning signals in a sequential investment task , 2007, Proceedings of the National Academy of Sciences.

[91]  Jeffrey M. Zacks,et al.  A Computational Model of Event Segmentation From Perceptual Prediction , 2007, Cogn. Sci..

[92]  D. Schacter,et al.  The cognitive neuroscience of constructive memory: remembering the past and imagining the future , 2007, Philosophical Transactions of the Royal Society B: Biological Sciences.

[93]  D. Kumaran,et al.  Match–Mismatch Processes Underlie Human Hippocampal Responses to Associative Novelty , 2007, The Journal of Neuroscience.

[94]  P. Glimcher,et al.  The neural correlates of subjective value during intertemporal choice , 2007, Nature Neuroscience.

[95]  Colin Camerer,et al.  A framework for studying the neurobiology of value-based decision making , 2008, Nature Reviews Neuroscience.

[96]  Karl J. Friston,et al.  Influence of Uncertainty and Surprise on Human Corticospinal Excitability during Preparation for Action , 2008, Current Biology.

[97]  D. Shohamy,et al.  Integrating Memories in the Human Brain: Hippocampal-Midbrain Encoding of Overlapping Events , 2008, Neuron.

[98]  Colin Camerer,et al.  Dissociating the Role of the Orbitofrontal Cortex and the Striatum in the Computation of Goal Values and Prediction Errors , 2008, The Journal of Neuroscience.

[99]  N. Daw,et al.  Striatal Activity Underlies Novelty-Based Choice in Humans , 2008, Neuron.

[100]  C. Stark,et al.  Pattern Separation in the Human Hippocampal CA3 and Dentate Gyrus , 2008, Science.

[101]  D. Heeger,et al.  A Hierarchy of Temporal Receptive Windows in Human Cortex , 2008, The Journal of Neuroscience.

[102]  Emery N. Brown,et al.  A mixed filter algorithm for cognitive state estimation from simultaneously recorded continuous and binary measures of performance , 2008, Biological Cybernetics.

[103]  Christian F. Doeller,et al.  Parallel striatal and hippocampal systems for landmarks and boundaries in spatial memory , 2008, Proceedings of the National Academy of Sciences.

[104]  Adam Johnson,et al.  Computing motivation: Incentive salience boosts of drug or appetite states , 2008, Behavioral and Brain Sciences.

[105]  Colin Camerer,et al.  Neuroeconomics: decision making and the brain , 2008 .

[106]  B. Balleine,et al.  A specific role for posterior dorsolateral striatum in human habit learning , 2009, The European journal of neuroscience.

[107]  W. K. Simmons,et al.  Circular analysis in systems neuroscience: the dangers of double dipping , 2009, Nature Neuroscience.

[108]  B. Balleine,et al.  Multiple Forms of Value Learning and the Function of Dopamine , 2009 .

[109]  M. Frank,et al.  Instructional control of reinforcement learning: A behavioral and neurocomputational investigation , 2009, Brain Research.

[110]  H. Eichenbaum,et al.  Robust Conjunctive Item–Place Coding by Hippocampal Neurons Parallels Learning What Happens Where , 2009, The Journal of Neuroscience.

[111]  Daphna Shohamy,et al.  Distinct Hippocampal and Basal Ganglia Contributions to Probabilistic Learning and Reversal , 2009, Journal of Cognitive Neuroscience.

[112]  Lila Davachi,et al.  Distinct Memory Signatures in the Hippocampus: Intentional States Distinguish Match and Mismatch Enhancement Signals , 2009, The Journal of Neuroscience.

[113]  N. Daw,et al.  Human Reinforcement Learning Subdivides Structured Action Spaces by Learning Effector-Specific Values , 2009, The Journal of Neuroscience.

[114]  Emery Brown,et al.  Trial Outcome and Associative Learning Signals in the Monkey Hippocampus , 2009, Neuron.

[115]  M. Frank,et al.  Prefrontal and striatal dopaminergic genes predict individual differences in exploration and exploitation. , 2009, Nature neuroscience.

[116]  Karl J. Friston,et al.  A Dual Role for Prediction Error in Associative Learning , 2008, Cerebral cortex.

[117]  Y. Niv,et al.  Learning latent structure: carving nature at its joints , 2010, Current Opinion in Neurobiology.

[118]  P. Dayan,et al.  States versus Rewards: Dissociable Neural Prediction Error Signals Underlying Model-Based and Model-Free Reinforcement Learning , 2010, Neuron.

[119]  Joshua I. Gold,et al.  Bayesian Online Learning of the Hazard Rate in Change-Point Problems , 2010, Neural Computation.

[120]  D. Shanks,et al.  Learning in a changing environment. , 2010, Journal of experimental psychology. General.

[121]  Makoto Ito,et al.  Evidence for Model-Based Action Planning in a Sequential Finger Movement Task , 2010, Journal of motor behavior.

[122]  John P O'Doherty,et al.  Model-based approaches to neuroimaging: combining reinforcement learning theory with fMRI data. , 2010, Wiley interdisciplinary reviews. Cognitive science.

[123]  Nathaniel D. Daw,et al.  Selective impairment of prediction error signaling in human dorsolateral but not ventral striatum in Parkinson's disease patients: evidence from a model-based fMRI study , 2010, NeuroImage.

[124]  Robert C. Wilson,et al.  An Approximately Bayesian Delta-Rule Model Explains the Dynamics of Belief Updating in a Changing Environment , 2010, The Journal of Neuroscience.

[125]  J. Brewer,et al.  Activity in the hippocampus and neocortical working memory regions predicts successful associative memory for temporally discontiguous events , 2010, Neuropsychologia.

[126]  Karl J. Friston,et al.  Behavioral / Systems / Cognitive Striatal Prediction Error Modulates Cortical Coupling , 2010 .

[127]  Nathaniel D. Daw,et al.  Trial-by-trial data analysis using computational models , 2011 .

[128]  N. Daw,et al.  Multiplicity of control in the basal ganglia: computational roles of striatal subregions , 2011, Current Opinion in Neurobiology.

[129]  H. Seo,et al.  A reservoir of time constants for memory traces in cortical neurons , 2011, Nature Neuroscience.

[130]  Dylan A. Simon,et al.  Neural Correlates of Forward Planning in a Spatial Decision Task in Humans , 2011, The Journal of Neuroscience.

[131]  P. Dayan,et al.  Model-based influences on humans’ choices and striatal prediction errors , 2011, Neuron.

[132]  Y. Niv,et al.  Ventral Striatum and Orbitofrontal Cortex Are Both Required for Model-Based, But Not Model-Free, Reinforcement Learning , 2011, The Journal of Neuroscience.

[133]  R. Schwarting,et al.  Dorsal hippocampal lesions boost performance in the rat sequential reaction time task , 2012, Hippocampus.