Neural signatures of model-free learning when avoiding harm to self and other

Moral behaviour requires learning how our actions help or harm others. Theoretical accounts of learning propose a key division between ‘model-free’ algorithms that efficiently cache outcome values in actions and ‘model-based’ algorithms that prospectively map actions to outcomes, a distinction that may be critical for moral learning. Here, we tested the engagement of these learning mechanisms and their neural basis as participants learned to avoid painful electric shocks for themselves and a stranger. We found that model-free learning was prioritized when avoiding harm to others compared to oneself. Model-free prediction errors for others relative to self were tracked in the thalamus/caudate at the time of the outcome. At the time of choice, a signature of model-free moral learning was associated with responses in subgenual anterior cingulate cortex (sgACC), and resisting this model-free influence was predicted by stronger connectivity between sgACC and dorsolateral prefrontal cortex. Finally, multiple behavioural and neural correlates of model-free moral learning varied with individual differences in moral judgment. Our findings suggest moral learning favours efficiency over flexibility and is underpinned by specific neural mechanisms.

[1]  Wouter Kool,et al.  Reasoning supports utilitarian resolutions to moral dilemmas across diverse measures. , 2020, Journal of personality and social psychology.

[2]  B. Seymour Pain: A Precision Signal for Reinforcement Learning and Control , 2019, Neuron.

[3]  Jo Cutler,et al.  A comparative fMRI meta-analysis of altruistic and strategic decisions to give , 2019, NeuroImage.

[4]  Marco K. Wittmann,et al.  Neural mechanisms for learning self and other ownership , 2018, Nature Communications.

[5]  M. Crockett,et al.  The lateral prefrontal cortex and moral goal pursuit. , 2018, Current opinion in psychology.

[6]  Marco K. Wittmann,et al.  Ventral anterior cingulate cortex and social decision-making , 2018, Neuroscience & Biobehavioral Reviews.

[7]  Sang Wan Lee,et al.  Model-based and model-free pain avoidance learning , 2018, Brain and neuroscience advances.

[8]  Matthew F S Rushworth,et al.  Neural Mechanisms of Social Cognition in Primates. , 2018, Annual review of neuroscience.

[9]  M. Crockett,et al.  Beyond Sacrificial Harm: A Two-Dimensional Model of Utilitarian Psychology , 2017, Psychological review.

[10]  R. Dolan,et al.  Neural and computational processes underlying dynamic changes in self-esteem , 2017, eLife.

[11]  P. Railton Moral Learning: Conceptual foundations and normative relevance , 2017, Cognition.

[12]  P. Railton,et al.  Moral learning: Psychological and philosophical perspectives , 2017, Cognition.

[13]  Joshua D. Greene The rat-a-gorical imperative: Moral intuition and the limits of affective learning , 2017, Cognition.

[14]  M. Husain,et al.  Prosocial apathy for helping others when effort is required , 2017, Nature Human Behaviour.

[15]  E. Koechlin,et al.  The Importance of Falsification in Computational Cognitive Modeling , 2017, Trends in Cognitive Sciences.

[16]  Jenifer Z. Siegel,et al.  Moral transgressions corrupt neural representations of value , 2017, Nature Neuroscience.

[17]  J. Decety,et al.  Interpersonal harm aversion as a necessary foundation for morality: A developmental neuroscience perspective , 2017, Development and Psychopathology.

[18]  Fiery Cushman,et al.  Morality constrains the default representation of what is possible , 2017, Proceedings of the National Academy of Sciences.

[19]  M. Husain,et al.  Neurocomputational mechanisms underlying subjective valuation of effort costs , 2017, PLoS biology.

[20]  Wolfgang M. Pauli,et al.  Learning, Reward, and Decision Making , 2017, Annual review of psychology.

[21]  Vincent D Costa,et al.  Amygdala and Ventral Striatum Make Distinct Contributions to Reinforcement Learning , 2016, Neuron.

[22]  Essi Viding,et al.  Neurocomputational mechanisms of prosocial learning and links to empathy , 2016, Proceedings of the National Academy of Sciences.

[23]  Wouter Kool,et al.  When Does Model-Based Control Pay Off? , 2016, PLoS Comput. Biol..

[24]  Karl J. Friston,et al.  Neural Signatures of Value Comparison in Human Cingulate Cortex during Decisions Requiring an Effort-Reward Trade-off , 2016, The Journal of Neuroscience.

[25]  Hans Knutsson,et al.  Cluster failure: Why fMRI inferences for spatial extent have inflated false-positive rates , 2016, Proceedings of the National Academy of Sciences.

[26]  Alisabeth Ayars Can model-free reinforcement learning explain deontological moral judgments? , 2016, Cognition.

[27]  P. Dayan,et al.  Striatal structure and function predict individual biases in learning to avoid pain , 2016, Proceedings of the National Academy of Sciences.

[28]  N. Daw,et al.  Characterizing a psychiatric symptom dimension related to deficits in goal-directed control , 2016, eLife.

[29]  J. V. Bavel,et al.  The Neuroscience of Moral Cognition: From Dual Processes to Dynamic Systems , 2015 .

[30]  Jenifer Z. Siegel,et al.  Dissociable Effects of Serotonin and Dopamine on the Valuation of Harm in Moral Decision Making , 2015, Current Biology.

[31]  Simon B. Eickhoff,et al.  Functional organization of human subgenual cortical areas: Relationship between architectonical segregation and connectional heterogeneity , 2015, NeuroImage.

[32]  M. Crockett,et al.  Goal-directed, habitual and Pavlovian prosocial behavior , 2015, Front. Behav. Neurosci..

[33]  Dean Mobbs,et al.  Empathic concern drives costly altruism , 2015, NeuroImage.

[34]  Jenifer Z. Siegel,et al.  Harm to others outweighs harm to self in moral decision making , 2014, Proceedings of the National Academy of Sciences.

[35]  F. Cushman,et al.  Bad actions or bad outcomes? Differentiating affective contributions to the moral condemnation of harm. , 2014, Emotion.

[36]  P. Dayan,et al.  Correction: Disentangling the Roles of Approach, Activation and Valence in Instrumental and Pavlovian Responding , 2014, PLoS Computational Biology.

[37]  Alice Y. Chiang,et al.  Working-memory capacity protects model-based learning from stress , 2013, Proceedings of the National Academy of Sciences.

[38]  Jason P. Mitchell,et al.  Intuitive Prosociality , 2013 .

[39]  Thomas H. B. FitzGerald,et al.  Disruption of Dorsolateral Prefrontal Cortex Decreases Model-Based in Favor of Model-free Control in Humans , 2013, Neuron.

[40]  E. Fehr,et al.  Changing Social Norm Compliance with Noninvasive Brain Stimulation , 2013, Science.

[41]  P. Dayan,et al.  Goals and Habits in the Brain , 2013, Neuron.

[42]  R. Blair The neurobiology of psychopathic traits in youths , 2013, Nature Reviews Neuroscience.

[43]  F. Cushman Action, Outcome, and Value , 2013, Personality and social psychology review : an official journal of the Society for Personality and Social Psychology, Inc.

[44]  M. Crockett Models of morality , 2013, Trends in Cognitive Sciences.

[45]  Adam G. Thomas,et al.  The Organization of Dorsal Frontal Cortex in Humans and Macaques , 2013, The Journal of Neuroscience.

[46]  Shinsuke Shimojo,et al.  Neural Computations Underlying Arbitration between Model-Based and Model-free Learning , 2013, Neuron.

[47]  W. Schultz Updating dopamine reward signals , 2013, Current Opinion in Neurobiology.

[48]  Dean Mobbs,et al.  Deconstructing the brain’s moral network: dissociable functionality between the temporoparietal junction and ventro-medial prefrontal cortex , 2013, Social cognitive and affective neuroscience.

[49]  R. Dolan,et al.  Dopamine Enhances Model-Based over Model-Free Choice Behavior , 2012, Neuron.

[50]  M. Rushworth,et al.  Connectivity-based subdivisions of the human right "temporoparietal junction area": evidence for different areas participating in different cortical networks. , 2012, Cerebral cortex.

[51]  Alberto Priori,et al.  Functional and clinical neuroanatomy of morality. , 2012, Brain : a journal of neurology.

[52]  Russell Thompson,et al.  Differential neural circuitry and self-interest in real vs hypothetical moral decisions , 2012, Social cognitive and affective neuroscience.

[53]  R. Marois,et al.  The roots of modern justice: cognitive and neural foundations of social norms and their enforcement , 2012, Nature Neuroscience.

[54]  Kurt Gray,et al.  Mind Perception Is the Essence of Morality , 2012, Psychological inquiry.

[55]  Peter Dayan,et al.  Bonsai Trees in Your Head: How the Pavlovian System Sculpts Goal-Directed Choices by Pruning Decision Trees , 2012, PLoS Comput. Biol..

[56]  P. Dayan,et al.  Mapping value based planning and extensively trained choice in the human brain , 2012, Nature Neuroscience.

[57]  Fiery Cushman,et al.  Simulating murder: the aversion to harmful action. , 2012, Emotion.

[58]  Luke J. Chang,et al.  Triangulating the Neural, Psychological, and Economic Bases of Guilt Aversion , 2011, Neuron.

[59]  Raymond J. Dolan,et al.  Disentangling the Roles of Approach, Activation and Valence in Instrumental and Pavlovian Responding , 2011, PLoS Comput. Biol..

[60]  P. Dayan,et al.  Model-based influences on humans’ choices and striatal prediction errors , 2011, Neuron.

[61]  S. Levinson,et al.  WEIRD languages have misled us, too , 2010, Behavioral and Brain Sciences.

[62]  P. Dayan,et al.  States versus Rewards: Dissociable Neural Prediction Error Signals Underlying Model-Based and Model-Free Reinforcement Learning , 2010, Neuron.

[63]  J. Henrich,et al.  The weirdest people in the world? , 2010, Behavioral and Brain Sciences.

[64]  R. Saxe,et al.  Disruption of the right temporoparietal junction with transcranial magnetic stimulation reduces the role of beliefs in moral judgments , 2010, Proceedings of the National Academy of Sciences.

[65]  I. Kant The Metaphysic of Ethics , 2009 .

[66]  Joseph M. Paxton,et al.  Patterns of neural activity associated with honest and dishonest moral decisions , 2009, Proceedings of the National Academy of Sciences.

[67]  B. Balleine,et al.  A specific role for posterior dorsolateral striatum in human habit learning , 2009, The European journal of neuroscience.

[68]  Jorge Moll,et al.  Social attachment and aversion in human moral cognition , 2009, Neuroscience & Biobehavioral Reviews.

[69]  D. Ariely,et al.  PSYCHOLOGICAL SCIENCE Research Article Contagion and Differentiation in Unethical Behavior The Effect of One Bad Apple on the Barrel , 2022 .

[70]  P. Dayan,et al.  Decision theory, reinforcement learning, and the brain , 2008, Cognitive, affective & behavioral neuroscience.

[71]  Noah J. Goldstein,et al.  Normative Social Influence is Underdetected , 2008, Personality & social psychology bulletin.

[72]  Russell A. Poldrack,et al.  Guidelines for reporting an fMRI study , 2008, NeuroImage.

[73]  U. Fischbacher,et al.  The Neural Signature of Social Norm Compliance , 2007, Neuron.

[74]  Á. Pascual-Leone,et al.  Diminishing Reciprocal Fairness by Disrupting the Right Prefrontal Cortex , 2006, Science.

[75]  J. O'Doherty,et al.  Is Avoiding an Aversive Outcome Rewarding? Neural Substrates of Avoidance Learning in the Human Brain , 2006, PLoS biology.

[76]  P. Dayan,et al.  Uncertainty-based competition between prefrontal and dorsolateral striatal systems for behavioral control , 2005, Nature Neuroscience.

[77]  J. Grafman,et al.  The neural basis of human moral cognition , 2005, Nature Reviews Neuroscience.

[78]  J. O'Doherty,et al.  Human Neural Learning Depends on Reward Prediction Errors in the Blocking Paradigm , 2005, Journal of neurophysiology.

[79]  Andrew D. Engell,et al.  The Neural Bases of Cognitive Conflict and Control in Moral Judgment , 2004, Neuron.

[80]  D. Price Psychological and neural mechanisms of the affective dimension of pain. , 2000, Science.

[81]  Richard E. Tremblay,et al.  The development of aggressive behaviour during childhood: What have we learned in the past century? , 2000 .

[82]  Anthony K. P. Jones,et al.  The cortical representation of pain , 1999, PAIN.

[83]  H Nishijo,et al.  Emotional and Behavioral Correlates of Mediodorsal Thalamic Neurons during Associative Learning in Rats , 1996, The Journal of Neuroscience.

[84]  D. Gaffan,et al.  Amygdalar interaction with the mediodorsal nucleus of the thalamus and the ventromedial prefrontal cortex in stimulus-reward associative learning in the monkey , 1990, The Journal of neuroscience : the official journal of the Society for Neuroscience.

[85]  A. Ohman,et al.  One-trial learning and superior resistance to extinction of autonomic responses conditioned to potentially phobic stimuli. , 1975, Journal of comparative and physiological psychology.

[86]  John C. Harsanyi,et al.  Cardinal Welfare, Individualistic Ethics, and Interpersonal Comparisons of Utility , 1955, Journal of Political Economy.

[87]  J. Bentham An Introduction to the Principles of Morals and Legislation , 1945, Princeton Readings in Political Thought.

[88]  Dan Ariely,et al.  The Effect of One Bad Apple on the Barrel , 2009 .

[89]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[90]  Bernard Gert,et al.  Common Morality: Deciding What to Do , 2004 .

[91]  T. Nagel Mortal Questions: What is it like to be a bat? , 2012 .

[92]  M. Pritchard ON COMMON MORALITY , 2001 .

[93]  F. Lenz,et al.  Stimulation in the human somatosensory thalamus can reproduce both the affective and sensory dimensions of previously experienced pain , 1995, Nature Network Boston.

[94]  Model Comparison and Occam ’ s Razor , 2022 .