Temporal difference models describe higher-order learning in humans

The ability to use environmental stimuli to predict impending harm is critical for survival. Such predictions should be available as early as they are reliable. In pavlovian conditioning, chains of successively earlier predictors are studied in terms of higher-order relationships, and have inspired computational theories such as temporal difference learning. However, there is at present no adequate neurobiological account of how this learning occurs. Here, in a functional magnetic resonance imaging (fMRI) study of higher-order aversive conditioning, we describe a key computational strategy that humans use to learn predictions about pain. We show that neural activity in the ventral striatum and the anterior insula displays a marked correspondence to the signals for sequential learning predicted by temporal difference models. This result reveals a flexible aversive learning process ideally suited to the changing and uncertain nature of real-world environments. Taken with existing data on reward learning, our results suggest a critical role for the ventral striatum in integrating complex appetitive and aversive predictions to coordinate behaviour.

[1]  R. Solomon,et al.  An opponent-process theory of motivation. I. Temporal dynamics of affect. , 1974, Psychological review.

[2]  E. Azmitia,et al.  An autoradiographic analysis of the differential ascending projections of the dorsal and median raphe nuclei in the rat , 1978, The Journal of comparative neurology.

[3]  R. Solomon,et al.  An Opponent-Process Theory of Motivation , 1978 .

[4]  A. Dickinson Contemporary Animal Learning Theory , 1981 .

[5]  A G Barto,et al.  Toward a modern theory of adaptive networks: expectation and prediction. , 1981, Psychological review.

[6]  Richard S. Sutton,et al.  Neuronlike adaptive elements that can solve difficult learning control problems , 1983, IEEE Transactions on Systems, Man, and Cybernetics.

[7]  M. Gabriel,et al.  Learning and Computational Neuroscience: Foundations of Adaptive Networks , 1990 .

[8]  Karl J. Friston,et al.  Value-dependent selection in the brain: Simulation in a synthetic neural model , 1994, Neuroscience.

[9]  E. Chudler,et al.  The role of the basal ganglia in nociception and pain , 1995, Pain.

[10]  Joel L. Davis,et al.  In : Models of Information Processing in the Basal Ganglia , 2008 .

[11]  W. Schultz,et al.  Preferential activation of midbrain dopamine neurons by appetitive rather than aversive stimuli , 1996, Nature.

[12]  P. Dayan,et al.  A framework for mesencephalic dopamine systems based on predictive Hebbian learning , 1996, The Journal of neuroscience : the official journal of the Society for Neuroscience.

[13]  Thomas L. Madden,et al.  Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. , 1997, Nucleic acids research.

[14]  Peter Dayan,et al.  A Neural Substrate of Prediction and Reward , 1997, Science.

[15]  G. Wagner,et al.  A POPULATION GENETIC THEORY OF CANALIZATION , 1997, Evolution; international journal of organic evolution.

[16]  W. J. Dickinson,et al.  Marginal fitness contributions of nonessential genes in yeast. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[17]  Joseph E LeDoux Fear and the brain: where have we been, and where are we going? , 1998, Biological Psychiatry.

[18]  Nakao,et al.  Genome-scale Gene Expression Analysis and Pathway Reconstruction in KEGG. , 1999, Genome informatics. Workshop on Genome Informatics.

[19]  Dmitrij Frishman,et al.  MIPS: a database for genomes and protein sequences , 1999, Nucleic Acids Res..

[20]  O. White,et al.  Global transposon mutagenesis and a minimal Mycoplasma genome. , 1999, Science.

[21]  K. H. Wolfe,et al.  Yeast genome evolution in the post-genome era. , 1999, Current opinion in microbiology.

[22]  Karl J. Friston,et al.  Amygdala–Hippocampal Involvement in Human Aversive Trace Conditioning Revealed through Event-Related Functional Magnetic Resonance Imaging , 1999, The Journal of Neuroscience.

[23]  T. Robbins,et al.  Associative Processes in Addiction and Reward The Role of Amygdala‐Ventral Striatal Subsystems , 1999, Annals of the New York Academy of Sciences.

[24]  Ravi S. Menon,et al.  Dissociating pain from its anticipation in the human brain. , 1999, Science.

[25]  P. Matthews,et al.  Learning about pain: the neural substrate of the prediction error for aversive events. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[26]  B. Palsson,et al.  The Escherichia coli MG1655 in silico metabolic genotype: its definition, characteristics, and capabilities. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[27]  R. Dolan,et al.  Classical fear conditioning in functional neuroimaging , 2000, Current Opinion in Neurobiology.

[28]  J. Horvitz Mesolimbocortical and nigrostriatal dopamine responses to salient non-reward events , 2000, Neuroscience.

[29]  B. Palsson,et al.  Combining pathway analysis with flux balance analysis for the comprehensive study of metabolic systems. , 2000, Biotechnology and bioengineering.

[30]  A. Wagner Robustness against mutations in genetic networks of yeast , 2000, Nature Genetics.

[31]  Roland E. Suri,et al.  Temporal Difference Model Reproduces Anticipatory Neural Activity , 2001, Neural Computation.

[32]  T. Kitami,et al.  Biochemical networking contributes more to genetic buffering in human and mouse metabolic pathways than does gene duplication , 2002, Nature Genetics.

[33]  U. Sauer,et al.  Metabolic Flux Responses to Pyruvate Kinase Knockout in Escherichia coli , 2002, Journal of bacteriology.

[34]  Sham M. Kakade,et al.  Opponent interactions between serotonin and dopamine , 2002, Neural Networks.

[35]  Ronald W. Davis,et al.  Systematic screen for human disease genes in yeast , 2002, Nature Genetics.

[36]  J. Horvitz Dopamine gating of glutamatergic sensorimotor and incentive motivational input signals to the striatum , 2002, Behavioural Brain Research.

[37]  Ronald W. Davis,et al.  Functional profiling of the Saccharomyces cerevisiae genome , 2002, Nature.

[38]  S. Schuster,et al.  Metabolic network structure determines key aspects of functionality and regulation , 2002, Nature.

[39]  G. Church,et al.  Analysis of optimality in natural and perturbed metabolic networks , 2002 .

[40]  R Turner,et al.  Optimized EPI for fMRI studies of the orbitofrontal cortex , 2003, NeuroImage.

[41]  Y. Dong,et al.  Systematic functional analysis of the Caenorhabditis elegans genome using RNAi , 2003, Nature.

[42]  B. Palsson,et al.  Large-scale evaluation of in silico gene deletions in Saccharomyces cerevisiae. , 2003, Omics : a journal of integrative biology.

[43]  Ronald W. Davis,et al.  Role of duplicate genes in genetic robustness against null mutations , 2003, Nature.

[44]  X. Gu Evolution of duplicate genes versus genetic robustness against null mutations. , 2003, Trends in genetics : TIG.

[45]  J. W. Campbell,et al.  Experimental Determination and System Level Analysis of Essential Genes in Escherichia coli MG1655 , 2003, Journal of bacteriology.

[46]  S. Ehrlich,et al.  Essential Bacillus subtilis genes , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[47]  Samuel M. McClure,et al.  Temporal Prediction Errors in a Passive Learning Task Activate Human Striatum , 2003, Neuron.

[48]  Edgar H Vogel,et al.  Stimulus representation in SOP: I Theoretical rationalization and some implications , 2003, Behavioural Processes.

[49]  A. E. Hirsh,et al.  Genomic function (communication arising): Rate of evolution and gene dispensability , 2003, Nature.

[50]  Karl J. Friston,et al.  Temporal Difference Models and Reward-Related Learning in the Human Brain , 2003, Neuron.

[51]  C. Pál,et al.  Dosage sensitivity and the evolution of gene families in yeast , 2003, Nature.

[52]  B. Palsson,et al.  Genome-scale reconstruction of the Saccharomyces cerevisiae metabolic network. , 2003, Genome research.

[53]  B. Palsson,et al.  Saccharomyces cerevisiae phenotypes can be predicted by using constraint-based analysis of a genome-scale reconstructed metabolic network , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[54]  L. Becerra,et al.  Neural circuitry underlying pain modulation: expectation, hypnosis, placebo , 2003, Trends in Cognitive Sciences.

[55]  Jason A. Papin,et al.  Metabolic pathways in the post-genome era. , 2003, Trends in biochemical sciences.

[56]  Michael K. Gilson,et al.  ASAP, a systematic annotation package for community analysis of genomes , 2003, Nucleic Acids Res..

[57]  J. Pronk,et al.  Role of Transcriptional Regulation in Controlling Fluxes in Central Carbon Metabolism of Saccharomyces cerevisiae , 2004, Journal of Biological Chemistry.

[58]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.