Control of movement vigor and decision making during foraging

Significance How long should one stay and accumulate reward, and then, how fast should one travel to the next reward site? Marginal value theorem describes the decision-making process: the brain compares the immediate rate of harvest with its global history of capture, deciding to leave when the immediate rate falls below the average. Here, we extended the theory, showing that the same principle can be used to control speed of movements: the brain should compare the immediate rate of energy expenditure during movement with the global capture rate, planning to arrive at the destination when the two become equal. Experimental results confirmed many of the predictions, suggesting that a shared principle may underlie decision making and control of movement vigor. During foraging, animals decide how long to stay at a patch and harvest reward, and then, they move with certain vigor to another location. How does the brain decide when to leave, and how does it determine the speed of the ensuing movement? Here, we considered the possibility that both the decision-making and the motor control problems aimed to maximize a single normative utility: the sum of all rewards acquired minus all efforts expended divided by total time. This optimization could be achieved if the brain compared a local measure of utility with its history. To test the theory, we examined behavior of people as they gazed at images: they chose how long to look at the image (harvesting information) and then moved their eyes to another image, controlling saccade speed. We varied reward via image content and effort via image eccentricity, and then, we measured how these changes affected decision making (gaze duration) and motor control (saccade speed). After a history of low rewards, people increased gaze duration and decreased saccade speed. In anticipation of future effort, they lowered saccade speed and increased gaze duration. After a history of high effort, they elevated their saccade speed and increased gaze duration. Therefore, the theory presented a principled way with which the brain may control two aspects of behavior: movement speed and harvest duration. Our experiments confirmed many (but not all) of the predictions, suggesting that harvest duration and movement speed, fundamental aspects of behavior during foraging, may be governed by a shared principle of control.

[1]  Elliot Aronson,et al.  The effect of severity of initiation on liking for a group. , 1959 .

[2]  D. Robinson Oculomotor unit behavior in the monkey. , 1970, Journal of neurophysiology.

[3]  A. Neuringer Many responses per food reward with free food present. , 1970, Science.

[4]  E. Charnov Optimal foraging, the marginal value theorem. , 1976, Theoretical population biology.

[5]  R. Cowie Optimal foraging in great tits (Parus major) , 1977, Nature.

[6]  D. F. Hoyt,et al.  Gait and the energetics of locomotion in horses , 1981, Nature.

[7]  N. Verbeek,et al.  Diet Selection and Optimization by Northwestern Crows Feeding on Japanese Littleneck Clams , 1986 .

[8]  W. Lemon Fitness consequences of foraging behaviour in the zebra finch , 1991, Nature.

[9]  M. Segraves Activity of monkey frontal eye field neurons projecting to oculomotor regions of the pons. , 1992, Journal of neurophysiology.

[10]  O. Hikosaka,et al.  Eye movements in monkeys with local dopamine depletion in the caudate nucleus. II. Deficits in voluntary saccades , 1995, The Journal of neuroscience : the official journal of the Society for Neuroscience.

[11]  O. Hikosaka,et al.  Expectation of reward modulates cognitive signals in the basal ganglia , 1998, Nature Neuroscience.

[12]  P. Glimcher,et al.  Quantitative analysis of substantia nigra pars reticulata activity during a visually guided saccade task. , 1999, Journal of neurophysiology.

[13]  K. Cullen,et al.  Quantitative analysis of abducens neuron discharge dynamics during saccadic and slow eye movements. , 1999, Journal of neurophysiology.

[14]  T. Zentall,et al.  “work ethic” in pigeons: Reward value is directly related to the effort or time required to obtain the reward , 2000, Psychonomic bulletin & review.

[15]  D. Ariely,et al.  Beautiful Faces Have Variable Reward Value fMRI and Behavioral Evidence , 2001, Neuron.

[16]  O. Hikosaka,et al.  Modulation of saccadic eye movements by predicted reward outcome , 2001, Experimental Brain Research.

[17]  S. Laughlin,et al.  An Energy Budget for Signaling in the Grey Matter of the Brain , 2001, Journal of cerebral blood flow and metabolism : official journal of the International Society of Cerebral Blood Flow and Metabolism.

[18]  A. Kacelnik,et al.  To walk or to fly? How birds choose among foraging modes. , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[19]  Okihide Hikosaka,et al.  Role of Primate Substantia Nigra Pars Reticulata in Reward-Oriented Saccadic Eye Movement , 2002, The Journal of Neuroscience.

[20]  佐藤 真琴,et al.  Role of primate substantia nigra pars reticulata in reward-oriented saccadic eye movement , 2002 .

[21]  A. Kacelnik,et al.  Cost can increase preference in starlings , 2002, Animal Behaviour.

[22]  D. Perrett,et al.  Beauty in a smile: the role of medial orbitofrontal cortex in facial attractiveness , 2003, Neuropsychologia.

[23]  H. Ralston,et al.  Optimization of energy expenditure during level walking , 2004, European Journal of Applied Physiology and Occupational Physiology.

[24]  H. Ralston Energy-speed relation and optimal speed during level walking , 1958, Internationale Zeitschrift für angewandte Physiologie einschließlich Arbeitsphysiologie.

[25]  Thomas R Zentall,et al.  Contrast and the justification of effort , 2005, Psychonomic bulletin & review.

[26]  P. Dayan,et al.  Tonic dopamine: opportunity costs and the control of response vigor , 2007, Psychopharmacology.

[27]  O. Hikosaka,et al.  Lateral habenula as a source of negative reward signals in dopamine neurons , 2007, Nature.

[28]  R. Shadmehr,et al.  The intrinsic value of visual information affects saccade velocities , 2009, Experimental Brain Research.

[29]  O. Hikosaka,et al.  Two types of dopamine neuron distinctly convey positive and negative motivational signals , 2009, Nature.

[30]  Karen L Steudel-Numbers,et al.  Optimal running speed and the evolution of hominin hunting strategies. , 2009, Journal of human evolution.

[31]  R. Shadmehr,et al.  Temporal Discounting of Reward and the Cost of Time in Motor Control , 2010, The Journal of Neuroscience.

[32]  R. Nelson,et al.  Motor Planning under Unpredictable Reward: Modulations of Movement Vigor and Primate Striatum Activity , 2011, Front. Neurosci..

[33]  Peter Dayan,et al.  Vigor in the Face of Fluctuating Rates of Reward: An Experimental Examination , 2011, Journal of Cognitive Neuroscience.

[34]  John M. Pearson,et al.  Neuronal basis of sequential foraging decisions in a patchy environment , 2011, Nature Neuroscience.

[35]  Leonid Kruglyak,et al.  Catecholamine receptor polymorphisms affect decision-making in C. elegans , 2011, Nature.

[36]  O. Hikosaka,et al.  The Primate Ventral Pallidum Encodes Expected Reward Value and Regulates Motor Action , 2012, Neuron.

[37]  Lionel Rigoux,et al.  A Model of Reward- and Effort-Based Optimal Decision Making and Motor Control , 2012, PLoS Comput. Biol..

[38]  R. Turner,et al.  Limited Encoding of Effort by Dopamine Neurons in a Cost–Benefit Trade-off Task , 2013, The Journal of Neuroscience.

[39]  T. Zentall Animals Prefer Reinforcement that Follows Greater Effort: Justification of Effort or Within-Trial Contrast? , 2013 .

[40]  Stefan Mihalas,et al.  A general theory of intertemporal decision-making and the perception of time , 2013, Front. Behav. Neurosci..

[41]  Jennie E. S. Choi,et al.  Vigor of Movements and the Cost of Time in Decision Making , 2014, The Journal of Neuroscience.

[42]  Jeremiah Y. Cohen,et al.  Serotonergic neurons signal reward and punishment on multiple timescales , 2015, eLife.

[43]  M. Rice,et al.  Degree of Preference and Its Influence on Motor Control When Reaching for Most Preferred, Neutrally Preferred, and Least Preferred Candy , 2015, OTJR : occupation, participation and health.

[44]  Jessica C. Selinger,et al.  Humans Can Continuously Optimize Energetic Cost during Walking , 2015, Current Biology.

[45]  Karolina M. Lempert,et al.  Modulation of Saccade Vigor during Value-Based Decision Making , 2015, The Journal of Neuroscience.

[46]  N. Daw,et al.  Learning the opportunity cost of time in a patch-foraging task , 2015, Cognitive, Affective, & Behavioral Neuroscience.

[47]  M. Husain,et al.  Reward Pays the Cost of Noise Reduction in Motor and Cognitive Control , 2015, Current Biology.

[48]  Robbie S Wilson,et al.  Predicting the Movement Speeds of Animals in Natural Environments. , 2015, Integrative and Comparative Biology.

[49]  Helen J. Huang,et al.  A Representation of Effort in Decision-Making and Motor Control , 2016, Current Biology.

[50]  E. Burdet,et al.  The duration of reaching movement is longer than predicted by minimum variance. , 2016, Journal of neurophysiology.

[51]  F. Jean,et al.  Why Don't We Move Slower? The Value of Time in the Neural Control of Action , 2016, The Journal of Neuroscience.

[52]  Konrad Paul Kording,et al.  Role of expected reward in frontal eye field during natural scene search. , 2016, Journal of neurophysiology.

[53]  J. Randall Flanagan,et al.  Rapid target foraging with reach or gaze: The hand looks further ahead than the eye , 2017, PLoS Comput. Biol..

[54]  Steve W. C. Chang,et al.  Social resource foraging is guided by the principles of the Marginal Value Theorem , 2017, Scientific Reports.

[55]  Steve W. C. Chang,et al.  Posterior Cingulate Neurons Dynamically Signal Decisions to Disengage during Foraging , 2017, Neuron.

[56]  Reza Shadmehr,et al.  Distinct neural circuits for control of movement vs. holding still. , 2017, Journal of neurophysiology.

[57]  Ioannis Rigas,et al.  Movement vigor as a traitlike attribute of individuality. , 2018, Journal of neurophysiology.

[58]  Alaa A. Ahmed,et al.  Vigor of reaching movements: reward discounts the cost of effort. , 2018, Journal of neurophysiology.

[59]  R. Costa,et al.  Dopamine neuron activity before action initiation gates and invigorates future movements , 2018, Nature.

[60]  J. Krebs,et al.  Foraging Theory , 2019 .