Adaptive effort investment in cognitive and physical tasks: a neurocomputational model

Despite its importance in everyday life, the computational nature of effort investment remains poorly understood. We propose an effort model obtained from optimality considerations, and a neurocomputational approximation to the optimal model. Both are couched in the framework of reinforcement learning. It is shown that choosing when or when not to exert effort can be adaptively learned, depending on rewards, costs, and task difficulty. In the neurocomputational model, the limbic loop comprising anterior cingulate cortex (ACC) and ventral striatum in the basal ganglia allocates effort to cortical stimulus-action pathways whenever this is valuable. We demonstrate that the model approximates optimality. Next, we consider two hallmark effects from the cognitive control literature, namely proportion congruency and sequential congruency effects. It is shown that the model exerts both proactive and reactive cognitive control. Then, we simulate two physical effort tasks. In line with empirical work, impairing the model's dopaminergic pathway leads to apathetic behavior. Thus, we conceptually unify the exertion of cognitive and physical effort, studied across a variety of literatures (e.g., motivation and cognitive control) and animal species.

[1]  G. S. Reynolds,et al.  A quantitative analysis of the responding maintained by interval schedules of reinforcement. , 1968, Journal of the experimental analysis of behavior.

[2]  G. Logan,et al.  When it helps to be misled: Facilitative effects of increasing the frequency of conflicting stimuli in a Stroop-like task , 1979 .

[3]  Douglas L. Jones,et al.  From motivation to action: Functional interface between the limbic system and the motor system , 1980, Progress in Neurobiology.

[4]  Jamie I. D. Campbell,et al.  Mental multiplication skill: Structure, process, and acquisition. , 1985 .

[5]  P. Merikle,et al.  Distinguishing conscious from unconscious perceptual processes. , 1986, Canadian journal of psychology.

[6]  G. E. Alexander,et al.  Parallel organization of functionally segregated circuits linking basal ganglia and cortex. , 1986, Annual review of neuroscience.

[7]  G. E. Alexander,et al.  Functional architecture of basal ganglia circuits: neural substrates of parallel processing , 1990, Trends in Neurosciences.

[8]  J. Hohnsbein,et al.  Effects of crossmodal divided attention on late ERP components. II. Error processing in choice reaction tasks. , 1991, Electroencephalography and clinical neurophysiology.

[9]  E. Donchin,et al.  Optimizing the use of information: strategic control of activation of responses. , 1992, Journal of experimental psychology. General.

[10]  A. Henik,et al.  Controlling Stroop effects by manipulating expectations for color words , 1992, Memory & cognition.

[11]  D. Meyer,et al.  A Neural System for Error Detection and Compensation , 1993 .

[12]  J. Salamone,et al.  Anhedonia or anergia? Effects of haloperidol and nucleus accumbens dopamine depletion on instrumental response selection in a T-maze cost/benefit procedure , 1994, Behavioural Brain Research.

[13]  B. Vogt,et al.  Contributions of anterior cingulate cortex to behaviour. , 1995, Brain : a journal of neurology.

[14]  Peter Dayan,et al.  A Neural Substrate of Prediction and Reward , 1997, Science.

[15]  R E Harlan,et al.  The accumbens: beyond the core-shell dichotomy. , 1997, The Journal of neuropsychiatry and clinical neurosciences.

[16]  Rob R. de Ruyter van Steveninck,et al.  The metabolic cost of neural information , 1998, Nature Neuroscience.

[17]  Joshua W. Brown,et al.  How the Basal Ganglia Use Parallel Excitatory and Inhibitory Learning Pathways to Selectively Respond to Unexpected Rewarding Cues , 1999, The Journal of Neuroscience.

[18]  J. Cohen,et al.  The role of locus coeruleus in the regulation of cognitive performance. , 1999, Science.

[19]  B. Moghaddam,et al.  Target‐Specific Glutamatergic Regulation of Dopamine Neurons in the Ventral Tegmental Area , 2000, Journal of neurochemistry.

[20]  M. Botvinick,et al.  Conflict monitoring and cognitive control. , 2001, Psychological review.

[21]  E. Miller,et al.  An integrative theory of prefrontal cortex function. , 2001, Annual review of neuroscience.

[22]  Brian Knutson,et al.  Anticipation of Increasing Monetary Reward Selectively Recruits Nucleus Accumbens , 2001, The Journal of Neuroscience.

[23]  Clay B. Holroyd,et al.  The neural basis of human error processing: reinforcement learning, dopamine, and the error-related negativity. , 2002, Psychological review.

[24]  M. Walton,et al.  The Role of Rat Medial Frontal Cortex in Effort-Based Decision Making , 2002, The Journal of Neuroscience.

[25]  C. Carter,et al.  The anterior cingulate as a conflict monitor: fMRI and ERP studies , 2002, Physiology & Behavior.

[26]  E. Awh,et al.  Conflict adaptation effects in the absence of executive control , 2003, Nature Neuroscience.

[27]  Saori C. Tanaka,et al.  Prediction of immediate and future rewards differentially recruits cortico-basal ganglia loops , 2004, Nature Neuroscience.

[28]  K. R. Ridderinkhof,et al.  The Role of the Medial Frontal Cortex in Cognitive Control , 2004, Science.

[29]  Jonathan D. Cohen,et al.  Anterior Cingulate Conflict Monitoring and Adjustments in Control , 2004, Science.

[30]  M. Walton,et al.  Differential involvement of serotonin and dopamine systems in cost-benefit decisions about delay or effort , 2005, Psychopharmacology.

[31]  Matthew T. Kaufman,et al.  Distributed Neural Representation of Expected Value , 2005, The Journal of Neuroscience.

[32]  T. Egner,et al.  Cognitive control mechanisms resolve conflict through cortical amplification of task-relevant information , 2005, Nature Neuroscience.

[33]  Jonathan D. Cohen,et al.  An integrative theory of locus coeruleus-norepinephrine function: adaptive gain and optimal performance. , 2005, Annual review of neuroscience.

[34]  Jonathan D. Cohen,et al.  An exploration-exploitation model based on norepinepherine and dopamine activity , 2005, NIPS.

[35]  Joshua W. Brown,et al.  Learned Predictions of Error Likelihood in the Anterior Cingulate Cortex , 2005, Science.

[36]  P. Dayan,et al.  Uncertainty-based competition between prefrontal and dorsolateral striatal systems for behavioral control , 2005, Nature Neuroscience.

[37]  Lauren M. Bylsma,et al.  The conflict adaptation effect: It’s not just priming , 2005, Cognitive, affective & behavioral neuroscience.

[38]  J L Kenemans,et al.  Mental Fatigue : Costs and Benefits , 2005 .

[39]  Michael J. Frank,et al.  Dynamic Dopamine Modulation in the Basal Ganglia: A Neurocomputational Account of Cognitive Deficits in Medicated and Nonmedicated Parkinsonism , 2005, Journal of Cognitive Neuroscience.

[40]  P. Dayan,et al.  Dopamine, learning, and impulsivity: a biological account of attention-deficit/hyperactivity disorder. , 2005, Journal of child and adolescent psychopharmacology.

[41]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[42]  R. C. Pierce,et al.  The mesolimbic dopamine system: The final common pathway for the reinforcing effect of drugs of abuse? , 2006, Neuroscience & Biobehavioral Reviews.

[43]  Matthew J. C. Crump,et al.  The context-specific proportion congruent Stroop effect: Location as a contextual cue , 2006, Psychonomic bulletin & review.

[44]  Michael J. Frank,et al.  Making Working Memory Work: A Computational Model of Learning in the Prefrontal Cortex and Basal Ganglia , 2006, Neural Computation.

[45]  M. Walton,et al.  Separate neural pathways process different decision costs , 2006, Nature Neuroscience.

[46]  S. Quartz,et al.  Neural Differentiation of Expected Reward and Risk in Human Subcortical Structures , 2006, Neuron.

[47]  W. Notebaert,et al.  Top-down and bottom-up sequential modulations of congruency effects , 2006, Psychonomic bulletin & review.

[48]  P. Dayan,et al.  Tonic dopamine: opportunity costs and the control of response vigor , 2007, Psychopharmacology.

[49]  Peter Redgrave,et al.  Basal Ganglia , 2020, Encyclopedia of Autism Spectrum Disorders.

[50]  Derek Besner,et al.  Item-specific adaptation and the conflict-monitoring hypothesis: a computational model. , 2007, Psychological review.

[51]  Angela J. Yu,et al.  Should I stay or should I go? How the human brain manages the trade-off between exploitation and exploration , 2007, Philosophical Transactions of the Royal Society B: Biological Sciences.

[52]  John M. Ennis,et al.  A neurobiological theory of automaticity in perceptual categorization. , 2007, Psychological review.

[53]  Keiji Tanaka,et al.  Medial prefrontal cell activity signaling prediction errors of action values , 2007, Nature Neuroscience.

[54]  Timothy E. J. Behrens,et al.  Learning the value of information in an uncertain world , 2007, Nature Neuroscience.

[55]  W. Notebaert,et al.  Dissociating conflict adaptation from feature integration: a multiple regression approach. , 2007, Journal of experimental psychology. Human perception and performance.

[56]  S. Floresco,et al.  Amygdala-prefrontal cortical circuitry regulates effort-based decision making. , 2006, Cerebral cortex.

[57]  R. Dolan,et al.  How the Brain Translates Money into Force: A Neuroimaging Study of Subliminal Motivation , 2007, Science.

[58]  Jonathan D. Cohen,et al.  On the Control of Control: The Role of Dopamine in Regulating Prefrontal Function and Working Memory , 2007 .

[59]  P. Glimcher,et al.  The neural correlates of subjective value during intertemporal choice , 2007, Nature Neuroscience.

[60]  Maarten A. S. Boksem,et al.  Mental fatigue: Costs and benefits , 2008, Brain Research Reviews.

[61]  James R. Schmidt,et al.  The Stroop effect: why proportion congruent has nothing to do with congruency and everything to do with contingency. , 2008, Journal of experimental psychology. Learning, memory, and cognition.

[62]  W. Notebaert,et al.  Cognitive control acts locally , 2008, Cognition.

[63]  G. Dreisbach,et al.  Context-sensitive adjustments of cognitive control: conflict-adaptation effects are modulated by processing demands of the ongoing task. , 2008, Journal of experimental psychology. Learning, memory, and cognition.

[64]  R. Hübner,et al.  On-the-fly adaptation of selectivity in the flanker task , 2008, Psychonomic bulletin & review.

[65]  T. Egner Multiple conflict-driven control mechanisms in the human brain , 2008, Trends in Cognitive Sciences.

[66]  W. Notebaert,et al.  Hebbian learning of cognitive control: dealing with specific and nonspecific adaptation. , 2008, Psychological review.

[67]  S. Floresco,et al.  Dopaminergic and Glutamatergic Regulation of Effort- and Delay-Based Decision Making , 2008, Neuropsychopharmacology.

[68]  D. Zald,et al.  Worth the ‘EEfRT’? The Effort Expenditure for Rewards Task as an Objective Measure of Motivation and Anhedonia , 2009, PloS one.

[69]  M. Walton,et al.  Comparing the role of the anterior cingulate cortex and 6‐hydroxydopamine nucleus accumbens lesions on operant effort‐based decision making , 2009, The European journal of neuroscience.

[70]  Joseph T. McGuire,et al.  Effort discounting in human nucleus accumbens , 2009, Cognitive, affective & behavioral neuroscience.

[71]  Jan R. Wiersema,et al.  Context-dependent Dynamic Processes in Attention Deficit/Hyperactivity Disorder: Differentiating Common and Unique Effects of State Regulation Deficits and Delay Aversion , 2010, Neuropsychology Review.

[72]  L. Green,et al.  Dopamine modulates effort-based decision making in rats. , 2009, Behavioral neuroscience.

[73]  Timothy Edward John Behrens,et al.  Effort-Based Cost–Benefit Valuation and the Human Brain , 2009, The Journal of Neuroscience.

[74]  O. Hikosaka,et al.  Two types of dopamine neuron distinctly convey positive and negative motivational signals , 2009, Nature.

[75]  Matthew J. C. Crump,et al.  Short article: The flexibility of context-specific control: Evidence for context-driven generalization of item-specific control settings , 2009, Quarterly journal of experimental psychology.

[76]  Wim Notebaert,et al.  Adaptation by binding: a learning account of cognitive control , 2009, Trends in Cognitive Sciences.

[77]  P. Tobler,et al.  Functional imaging of the human dopaminergic midbrain , 2009, Trends in Neurosciences.

[78]  Mathias Pessiglione,et al.  Separate Valuation Subsystems for Delay and Effort Decision Costs , 2010, The Journal of Neuroscience.

[79]  H. Groenewegen,et al.  Nucleus accumbens and impulsivity , 2010, Progress in Neurobiology.

[80]  Wilfried Kunde,et al.  Trial-to-trial modulations of the Simon effect in conditions of attentional limitations: Evidence from dual tasks. , 2010, Journal of experimental psychology. Human perception and performance.

[81]  C. N. Boehler,et al.  The influence of reward associations on conflict processing in the Stroop task , 2010, Cognition.

[82]  Joseph T. McGuire,et al.  Decision making and the avoidance of cognitive demand. , 2010, Journal of experimental psychology. General.

[83]  G. Humphreys,et al.  Sustained vs. transient cognitive control: Evidence of a behavioral dissociation , 2010, Cognition.

[84]  Clay B. Holroyd,et al.  Focus on the positive: Computational simulations implicate asymmetrical reward prediction error signals in childhood attention-deficit/hyperactivity disorder , 2010, Brain Research.

[85]  Joshua W. Brown,et al.  Medial prefrontal cortex as an action-outcome predictor , 2011, Nature Neuroscience.

[86]  Rita Z. Goldstein,et al.  Motivation Deficit in ADHD is Associated with Dysfunction of the Dopamine Reward Pathway , 2010, Molecular Psychiatry.

[87]  W. Notebaert,et al.  Conflict adaptation by means of associative learning. , 2011, Journal of experimental psychology. Human perception and performance.

[88]  Tobias Teichert,et al.  The dorsal medial frontal cortex is sensitive to time on task, not response conflict or error likelihood , 2011, NeuroImage.

[89]  Czeslaw Stepniak Expected Value , 2011, International Encyclopedia of Statistical Science.

[90]  Wilfried Kunde,et al.  No conflict control in the absence of awareness , 2011, Psychological research.

[91]  Massimo Silvetti,et al.  Value and Prediction Error in Medial Frontal Cortex: Integrating the Single-Unit and Systems Levels of Analysis , 2011, Front. Hum. Neurosci..

[92]  J. Bugg,et al.  List-wide control is not entirely elusive: Evidence from picture–word Stroop , 2011, Psychonomic bulletin & review.

[93]  M. McDaniel,et al.  Revealing list-level control in the Stroop task by uncovering its benefits and a cost. , 2011, Journal of experimental psychology. Human perception and performance.

[94]  Justin A. Harris,et al.  Response rate and reinforcement rate in Pavlovian conditioning. , 2011, Journal of experimental psychology. Animal behavior processes.

[95]  C. N. Boehler,et al.  Task-Load-Dependent Activation of Dopaminergic Midbrain Areas in the Absence of Reward , 2011, The Journal of Neuroscience.

[96]  H. de Wit,et al.  Amping Up Effort: Effects of d-Amphetamine on Human Effort-Based Decision-Making , 2011, The Journal of Neuroscience.

[97]  Luiz Pessoa,et al.  Reward Reduces Conflict by Enhancing Attentional Control and Biasing Visual Cortical Processing , 2011, Journal of Cognitive Neuroscience.

[98]  Matthew J. C. Crump,et al.  In Support of a Distinction between Voluntary and Stimulus-Driven Control: A Review of the Literature on Proportion Congruent Effects , 2012, Front. Psychology.

[99]  Clay B. Holroyd,et al.  Motivation of extended behaviors by anterior cingulate cortex , 2012, Trends in Cognitive Sciences.

[100]  P. Dayan,et al.  Dopamine and performance in a reinforcement learning task: evidence from Parkinson's disease. , 2012 .

[101]  T. Braver The variable nature of cognitive control: a dual mechanisms framework , 2012, Trends in Cognitive Sciences.

[102]  P. Dayan Instrumental vigour in punishment and reward , 2012, The European journal of neuroscience.

[103]  J. Salamone,et al.  The Mysterious Motivational Functions of Mesolimbic Dopamine , 2012, Neuron.

[104]  D. Zald,et al.  Dopaminergic Mechanisms of Individual Differences in Human Effort-Based Decision-Making , 2012, The Journal of Neuroscience.

[105]  Lionel Rigoux,et al.  A Model of Reward- and Effort-Based Optimal Decision Making and Motor Control , 2012, PLoS Comput. Biol..

[106]  R. Shelton,et al.  Effort-based decision-making in major depressive disorder: a translational model of motivational anhedonia. , 2012, Journal of abnormal psychology.

[107]  Alec Solway,et al.  Goal-directed decision making as probabilistic inference: a computational framework and potential neural correlates. , 2012, Psychological review.

[108]  Wim Notebaert,et al.  Conflict adaptation: It is not what you expect , 2012, Quarterly journal of experimental psychology.

[109]  A. Song,et al.  The involvement of the dopaminergic midbrain and cortico-striatal-thalamic circuits in the integration of reward prospect and attentional task demands. , 2012, Cerebral cortex.

[110]  Angela L. Duckworth,et al.  An opportunity cost model of subjective effort and task performance. , 2013, The Behavioral and brain sciences.

[111]  Ruth Seurinck,et al.  The influence of the noradrenergic system on optimal control of neural plasticity , 2013, Front. Behav. Neurosci..

[112]  M. Greicius,et al.  The Will to Persevere Induced by Electrical Stimulation of the Human Cingulate Gyrus , 2013, Neuron.

[113]  Jonathan D. Cohen,et al.  The Expected Value of Control: An Integrative Theory of Anterior Cingulate Cortex Function , 2013, Neuron.

[114]  P. Dayan,et al.  Effort and Valuation in the Brain: The Effects of Anticipation and Execution , 2013, The Journal of Neuroscience.

[115]  Massimo Silvetti,et al.  Deficient reinforcement learning in medial frontal cortex as a model of dopamine-related motivational deficits in ADHD , 2013, Neural Networks.

[116]  M. Roesch,et al.  Separate Populations of Neurons in Ventral Striatum Encode Value and Motivation , 2013, PloS one.

[117]  Masayuki Matsumoto,et al.  Distinct Representations of Cognitive and Motivational Signals in Midbrain Dopamine Neurons , 2013, Neuron.

[118]  Florent Meyniel,et al.  How the Brain Decides When to Work and When to Rest: Dissociation of Implicit-Reactive from Explicit-Predictive Computational Processes , 2014, PLoS Comput. Biol..

[119]  W. Fias,et al.  Overlapping Neural Systems Represent Cognitive Effort and Reward Anticipation , 2014, PloS one.

[120]  Joshua W. Brown,et al.  From conflict management to reward-based decision making: Actors and critics in primate medial frontal cortex , 2014, Neuroscience & Biobehavioral Reviews.

[121]  Samuel M. McClure,et al.  Hierarchical control over effortful behavior by rodent medial frontal cortex: A computational model. , 2015, Psychological review.