Learning from Sensory and Reward Prediction Errors during Motor Adaptation

Voluntary motor commands produce two kinds of consequences. Initially, a sensory consequence is observed in terms of activity in our primary sensory organs (e.g., vision, proprioception). Subsequently, the brain evaluates the sensory feedback and produces a subjective measure of utility or usefulness of the motor commands (e.g., reward). As a result, comparisons between predicted and observed consequences of motor commands produce two forms of prediction error. How do these errors contribute to changes in motor commands? Here, we considered a reach adaptation protocol and found that when high quality sensory feedback was available, adaptation of motor commands was driven almost exclusively by sensory prediction errors. This form of learning had a distinct signature: as motor commands adapted, the subjects altered their predictions regarding sensory consequences of motor commands, and generalized this learning broadly to neighboring motor commands. In contrast, as the quality of the sensory feedback degraded, adaptation of motor commands became more dependent on reward prediction errors. Reward prediction errors produced comparable changes in the motor commands, but produced no change in the predicted sensory consequences of motor commands, and generalized only locally. Because we found that there was a within subject correlation between generalization patterns and sensory remapping, it is plausible that during adaptation an individual's relative reliance on sensory vs. reward prediction errors could be inferred. We suggest that while motor commands change because of sensory and reward prediction errors, only sensory prediction errors produce a change in the neural system that predicts sensory consequences of motor commands.

[1]  Reza Shadmehr,et al.  Motor Adaptation as a Process of Reoptimization , 2008, The Journal of Neuroscience.

[2]  P. Thier,et al.  The Cerebellum Updates Predictions about the Visual Consequences of One's Behavior , 2008, Current Biology.

[3]  Sarah E. Criscimagna-Hemminger,et al.  Size of error affects cerebellar contributions to motor learning. , 2010, Journal of neurophysiology.

[4]  K. Doya Complementary roles of basal ganglia and cerebellum in learning and motor control , 2000, Current Opinion in Neurobiology.

[5]  Daniel M. Wolpert,et al.  Making smooth moves , 2022 .

[6]  Konrad Paul Kording,et al.  Estimating the sources of motor errors for adaptation and generalization , 2008, Nature Neuroscience.

[7]  K. Shenoy,et al.  A Central Source of Movement Variability , 2006, Neuron.

[8]  Konrad Paul Kording,et al.  The dynamics of memory as a consequence of optimal adaptation to a changing body , 2007, Nature Neuroscience.

[9]  J. Krakauer,et al.  A computational neuroanatomy for motor control , 2008, Experimental Brain Research.

[10]  Saori C. Tanaka,et al.  Prediction of immediate and future rewards differentially recruits cortico-basal ganglia loops , 2004, Nature Neuroscience.

[11]  K. Doya,et al.  Representation of Action-Specific Reward Values in the Striatum , 2005, Science.

[12]  R. J. Beers,et al.  Motor Learning Is Optimally Tuned to the Properties of Motor Noise , 2009, Neuron.

[13]  Reza Shadmehr,et al.  Learning of action through adaptive combination of motor primitives , 2000, Nature.

[14]  T Poggio,et al.  Fast perceptual learning in visual hyperacuity. , 1991, Science.

[15]  R. Shadmehr,et al.  Adaptation and generalization in acceleration-dependent force fields , 2006, Experimental Brain Research.

[16]  Reza Shadmehr,et al.  On-Line Processing of Uncertain Information in Visuomotor Control , 2008, The Journal of Neuroscience.

[17]  A. Graybiel,et al.  Activity of striatal neurons reflects dynamic encoding and recoding of procedural memories , 2005, Nature.

[18]  R. Shadmehr,et al.  Interacting Adaptive Processes with Different Timescales Underlie Short-Term Motor Learning , 2006, PLoS biology.

[19]  N. Daw,et al.  Reinforcement Learning Signals in the Human Striatum Distinguish Learners from Nonlearners during Reward-Based Decision Making , 2007, The Journal of Neuroscience.

[20]  Mark Hallett,et al.  Motor skill learning in Parkinson's disease , 1996, Journal of the Neurological Sciences.

[21]  O. Hikosaka,et al.  Modulation of saccadic eye movements by predicted reward outcome , 2001, Experimental Brain Research.

[22]  Gary C. Sing,et al.  Primitives for Motor Adaptation Reflect Correlated Neural Tuning to Position and Velocity , 2009, Neuron.

[23]  Mitsuo Kawato,et al.  A computational model of four regions of the cerebellum based on feedback-error learning , 2004, Biological Cybernetics.

[24]  Toshiyuki Kondo,et al.  Biological arm motion through reinforcement learning , 2004, Biological Cybernetics.

[25]  J. Krakauer,et al.  Learning not to generalize: modular adaptation of visuomotor gain. , 2010, Journal of neurophysiology.

[26]  M. Landy,et al.  Decision making, movement planning and statistical decision theory , 2008, Trends in Cognitive Sciences.

[27]  J. Wickens,et al.  Neural mechanisms of reward-related motor learning , 2003, Current Opinion in Neurobiology.

[28]  H. Yin,et al.  The role of the basal ganglia in habit formation , 2006, Nature Reviews Neuroscience.

[29]  Daniel B. Willingham,et al.  Intact mirror-tracing and impaired rotary-pursuit skill learning in patients with Huntington's disease: evidence for dissociable memory systems in skill learning. , 1997, Neuropsychology.

[30]  Felice L. Bedford,et al.  Keeping perception accurate , 1999, Trends in Cognitive Sciences.

[31]  R. Shadmehr,et al.  Representation of internal models of action in the autistic brain , 2009, Nature Neuroscience.

[32]  K. Doya,et al.  Parallel Cortico-Basal Ganglia Mechanisms for Acquisition and Execution of Visuomotor SequencesA Computational Approach , 2001, Journal of Cognitive Neuroscience.

[33]  B. Knowlton,et al.  Learning and memory functions of the Basal Ganglia. , 2002, Annual review of neuroscience.

[34]  Mitsuo Kawato,et al.  Internal models for motor control and trajectory planning , 1999, Current Opinion in Neurobiology.

[35]  M. Frank,et al.  Prefrontal and striatal dopaminergic genes predict individual differences in exploration and exploitation. , 2009, Nature neuroscience.

[36]  L. Pinneo On noise in the nervous system. , 1966, Psychological review.

[37]  J. Krakauer,et al.  Sensory prediction errors drive cerebellum-dependent adaptation of reaching. , 2007, Journal of neurophysiology.

[38]  John W Krakauer,et al.  Adaptation to visuomotor rotation through interaction between posterior parietal and motor cortical areas. , 2009, Journal of neurophysiology.

[39]  Michael I. Jordan,et al.  Optimal feedback control as a theory of motor coordination , 2002, Nature Neuroscience.

[40]  R. Shadmehr,et al.  Intact ability to learn internal models of arm dynamics in Huntington's disease but not cerebellar degeneration. , 2005, Journal of neurophysiology.

[41]  B. Balleine,et al.  Motivational control of goal-directed action , 1994 .

[42]  Peter Thier,et al.  Internalizing agency of self-action: perception of one's own hand movements depends on an adaptable prediction about the sensory action outcome. , 2006, Journal of neurophysiology.

[43]  R C Miall,et al.  System Identification Applied to a Visuomotor Task: Near-Optimal Human Performance in a Noisy Changing Task , 2003, The Journal of Neuroscience.

[44]  Kelvin E. Jones,et al.  Sources of signal-dependent noise during isometric force production. , 2002, Journal of neurophysiology.

[45]  M. Ernst,et al.  The statistical determinants of adaptation rate in human reaching. , 2008, Journal of vision.

[46]  M. Ghilardi,et al.  Learning and consolidation of visuo-motor adaptation in Parkinson's disease. , 2009, Parkinsonism & related disorders.

[47]  Daniel M Wolpert,et al.  Bayesian integration in force estimation. , 2004, Journal of neurophysiology.

[48]  M. Landy,et al.  Optimal Compensation for Changes in Task-Relevant Movement Variability , 2005, The Journal of Neuroscience.

[49]  R. Shadmehr Generalization as a behavioral window to the neural mechanisms of learning internal models. , 2004, Human movement science.

[50]  Zoubin Ghahramani,et al.  Computational principles of movement neuroscience , 2000, Nature Neuroscience.