Functional mechanisms of motor skill acquisition

As a motor skill is learned, behavior progresses from execution of movements that appear to be separately generated to recruitment of a single entity. Movements come to be executed more quickly, require less attention, and behavior loses flexibility. Neural activity also changes. Task-related neuron activity during a movement executed as part of a motor skill differs from that during the same movement executed alone. Also, cortical planning areas (e.g., frontal and prefrontal cortices) dominate control early in learning, while less cognitive areas (e.g., striatum) dominate later. The change in behavior and neural activity suggests that different control strategies and systems are employed as the motor skill develops. We propose that the behavioral and neural progression is due to a transfer of control to different types of controllers: explicit planner, which selects movements by considering the goal; value-based, which selects movements based on estimated values of each choice; and static-policy, in which a sensory cue directly elicits a movement – no decision is made. Explicit planners require much computation (and thus time and attention) and pre-existing knowledge, but are able to make reasonable decisions with little experience and are flexible to changes in task and environment. Static-policy controllers require little computation and knowledge, but must be trained with experience and are inflexible. Value-based controllers have intermediate characteristics. Neural systems can implement these mechanisms: frontal cortices conduct planning, striatum and prefrontal cortex estimate values, and the static policy controller can be implemented by a direct mapping, such as thalamus (sensory) to striatum (motor). The progression of the behavior and neural systems associated with the progression of the controllers is similar to that seen in motor skill development. We test the validity of this scheme with computational models – based on biologically plausible mechanisms and architecture – in which an agent must execute a series of actions (analogous to movements), elicited by the controllers, to solve tasks. As the succeeding controller is trained, it selects a movement faster than the preceding controller, which relinquishes control. By comparing model behavior to human and animal behavior in analogous tasks, we show that it exhibits qualities indicative of motor skill acquisition. We also investigate how task specification and environmental conditions affect motor skill development and strategy, how the presence of existing motor skills affect the agent's strategy in solving other tasks, and the parallels between resulting model behavior and human and animal behavior. Previous models have investigated how different controllers participate in biological decision making [1] and motor control [2-4]. While each model has unique properties, they all show that the availability of different controllers improves learning and behavior.

[1]  Mitsuo Kawato,et al.  Feedback-Error-Learning Neural Network for Supervised Motor Learning , 1990 .

[2]  J Tanji,et al.  Visually based path-planning by Japanese monkeys. , 2001, Brain research. Cognitive brain research.

[3]  J. W. Aldridge,et al.  Coding of Serial Order by Neostriatal Neurons: A “Natural Action” Approach to Movement Sequence , 1998, The Journal of Neuroscience.

[4]  K. Doya,et al.  Parallel neural networks for learning sequential procedures , 1999, Trends in Neurosciences.

[5]  J. Doyon,et al.  Reorganization and plasticity in the adult brain during learning of motor skills , 2005, Current Opinion in Neurobiology.

[6]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 2005, IEEE Transactions on Neural Networks.

[7]  Warren B. Powell,et al.  Handbook of Learning and Approximate Dynamic Programming , 2006, IEEE Transactions on Automatic Control.

[8]  Jennie Si,et al.  Supervised ActorCritic Reinforcement Learning , 2004 .

[9]  R. Eckmiller Advanced neural computers , 1990 .

[10]  P. Dayan,et al.  Uncertainty-based competition between prefrontal and dorsolateral striatal systems for behavioral control , 2005, Nature Neuroscience.

[11]  A. Graybiel,et al.  Role of [corrected] nigrostriatal dopamine system in learning to perform sequential motor tasks in a predictive manner. , 1999, Journal of neurophysiology.

[12]  J. Houk,et al.  Modulation of striatal single units by expected reward: a spiny neuron model displaying dopamine-induced bistability. , 2003, Journal of neurophysiology.