Neural mechanisms of observational learning

Individuals can learn by interacting with the environment and experiencing a difference between predicted and obtained outcomes (prediction error). However, many species also learn by observing the actions and outcomes of others. In contrast to individual learning, observational learning cannot be based on directly experienced outcome prediction errors. Accordingly, the behavioral and neural mechanisms of learning through observation remain elusive. Here we propose that human observational learning can be explained by two previously uncharacterized forms of prediction error, observational action prediction errors (the actual minus the predicted choice of others) and observational outcome prediction errors (the actual minus predicted outcome received by others). In a functional MRI experiment, we found that brain activity in the dorsolateral prefrontal cortex and the ventromedial prefrontal cortex respectively corresponded to these two distinct observational learning signals.

[1]  J. Pearce,et al.  A model for Pavlovian learning: Variations in the effectiveness of conditioned but not of unconditioned stimuli. , 1980 .

[2]  A G Barto,et al.  Toward a modern theory of adaptive networks: expectation and prediction. , 1981, Psychological review.

[3]  N. Mackintosh,et al.  Conditioning And Associative Learning , 1983 .

[4]  S. Mineka,et al.  Observational conditioning of snake fear in rhesus monkeys. , 1984, Journal of abnormal psychology.

[5]  Bernard Widrow,et al.  Adaptive switching circuits , 1988 .

[6]  A G Barto,et al.  Prediction of complex two-dimensional trajectories by a cerebellar model of smooth pursuit eye movement. , 1997, Journal of neurophysiology.

[7]  B. Balleine,et al.  Goal-directed instrumental action: contingency and incentive learning and their cortical substrates , 1998, Neuropharmacology.

[8]  Nick Feltovich,et al.  Reinforcement-based vs. Belief-based Learning Models in Experimental Asymmetric-information Games , 2000 .

[9]  A. Dickinson,et al.  Neuronal coding of prediction errors. , 2000, Annual review of neuroscience.

[10]  C. Frith,et al.  The role of dorsolateral prefrontal cortex in the selection of action as revealed by functional imaging , 2000 .

[11]  E. Crone,et al.  Dissociation of response conflict, attentional selection, and expectancy with functional magnetic resonance imaging. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[12]  R. Passingham,et al.  The prefrontal cortex: response selection or maintenance within working memory? , 2000, 5th IEEE EMBS International Summer School on Biomedical Imaging, 2002..

[13]  J. M. Anderson,et al.  Responses of human frontal cortex to surprising events are predicted by formal associative learning theory , 2001, Nature Neuroscience.

[14]  G. Glover,et al.  Error‐related brain activation during a Go/NoGo response inhibition task , 2001, Human brain mapping.

[15]  Sham M. Kakade,et al.  Opponent interactions between serotonin and dopamine , 2002, Neural Networks.

[16]  P. Montague,et al.  Activity in human ventral striatum locked to errors of reward prediction , 2002, Nature Neuroscience.

[17]  C. Chamley Rational Herds: Economic Models of Social Learning , 2003 .

[18]  D. Perrett,et al.  Beauty in a smile: the role of medial orbitofrontal cortex in facial attractiveness , 2003, Neuropsychologia.

[19]  Karl J. Friston,et al.  Dissociable Roles of Ventral and Dorsal Striatum in Instrumental Conditioning , 2004, Science.

[20]  K. Laland Social learning strategies , 2004, Learning & behavior.

[21]  Andreas Olsson,et al.  Learned Fear of “Unseen” Faces after Pavlovian, Observational, and Instructed Fear , 2004, Psychological science.

[22]  J. O'Doherty,et al.  Reward representations and reward-related learning in the human brain: insights from neuroimaging , 2004, Current Opinion in Neurobiology.

[23]  H. Terrace,et al.  Cognitive Imitation in Rhesus Macaques , 2004, Science.

[24]  Michael J. Frank,et al.  Error-Related Negativity Predicts Reinforcement Learning and Conflict Biases , 2005, Neuron.

[25]  M. Delgado,et al.  Perceptions of moral character modulate the neural systems of reward during the trust game , 2005, Nature Neuroscience.

[26]  Karl J. Friston,et al.  A free energy principle for the brain , 2006, Journal of Physiology-Paris.

[27]  R. Poldrack,et al.  Ventral–striatal/nucleus–accumbens sensitivity to prediction errors during classification learning , 2006, Human brain mapping.

[28]  J. O'Doherty,et al.  Model‐Based fMRI and Its Application to Reward Learning and Decision Making , 2007, Annals of the New York Academy of Sciences.

[29]  K. Fliessbach,et al.  Social Comparison Affects Reward-Related Brain Activity in the Human Ventral Striatum , 2007, Science.

[30]  J. O'Doherty,et al.  Neural coding of reward-prediction error signals during classical conditioning with attractive faces. , 2007, Journal of neurophysiology.

[31]  Caroline Catmur,et al.  Sensorimotor Learning Configures the Human Mirror System , 2007, Current Biology.

[32]  M. Bar The proactive brain: using analogies and associations to generate predictions , 2007, Trends in Cognitive Sciences.

[33]  Kevin McCabe,et al.  Neural signature of fictive learning signals in a sequential investment task , 2007, Proceedings of the National Academy of Sciences.

[34]  Peter Bossaerts,et al.  Neural correlates of mentalizing-related computations during strategic interactions in humans , 2008, Proceedings of the National Academy of Sciences.

[35]  Mark W Woolrich,et al.  Associative learning of social value , 2008, Nature.

[36]  C. Summerfield,et al.  A Neural Representation of Prior Information during Perceptual Inference , 2008, Neuron.

[37]  John M. Pearson,et al.  Fictive Reward Signals in the Anterior Cingulate Cortex , 2009, Science.

[38]  Matthew F S Rushworth,et al.  The Computation of Social Behavior , 2009, Science.

[39]  Luca Passamonti,et al.  A Key Role for Similarity in Vicarious Reward , 2009, Science.

[40]  M. Rushworth,et al.  General Mechanisms for Making Decisions? This Review Comes from a Themed Issue on Cognitive Neuroscience Edited the Representation of Value and Reward Expectations in Frontal Cortex Reward Prediction Errors and Learning Rates Other Types of Prediction Error , 2022 .

[41]  Karl J. Friston,et al.  A Dual Role for Prediction Error in Associative Learning , 2008, Cerebral cortex.