Robust adaptive dynamic programming for sensorimotor control with signal-dependent noise

As human beings, we coordinate movements and interact with our environment through sensory information and motor adaptation in our daily lives. Many characteristics of these interactions can be studied using optimization-based models, which assume that the precise knowledge of both the sensorimotor system and its interacting environment is available for the central nervous system (CNS). However, when static and dynamic uncertainties are present, the previously developed optimization models may fail to explain how the CNS can adapt to the uncertainties and still coordinate the movement. In this paper, we attempt to propose a novel computational mechanism for sensorimotor control from a perspective of robust adaptive dynamic programming (RADP). It is suggested that, instead of identifying the system dynamics of both the motor system and the environment, the CNS computes iteratively a robust optimal control policy for movement, using the real-time sensory data. With the help of numerical analysis and simulations, it is observed that the proposed model can reproduce movement trajectories which are consistent with experimental data. Consequently, we conjecture that, in order to achieve successful adaptation, this RADP-type mechanism may be used by the CNS of humans to coordinate movements in the presence of static/dynamic arising in the sensorimotor system.

[1]  Miroslav Krstic,et al.  Stabilization of Nonlinear Uncertain Systems , 1998 .

[2]  G. Winkler,et al.  The Stochastic Integral , 1990 .

[3]  Konrad Paul Kording,et al.  Estimating the sources of motor errors for adaptation and generalization , 2008, Nature Neuroscience.

[4]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[5]  P. Werbos,et al.  Beyond Regression : "New Tools for Prediction and Analysis in the Behavioral Sciences , 1974 .

[6]  R. Ivry,et al.  The coordination of movement: optimal feedback control and beyond , 2010, Trends in Cognitive Sciences.

[7]  Zhong-Ping Jiang,et al.  Adaptive dynamic programming as a theory of sensorimotor control , 2012, 2012 IEEE Signal Processing in Medicine and Biology Symposium (SPMB).

[8]  Reza Shadmehr,et al.  Motor Adaptation as a Process of Reoptimization , 2008, The Journal of Neuroscience.

[9]  Konrad Paul Kording,et al.  The dynamics of memory as a consequence of optimal adaptation to a changing body , 2007, Nature Neuroscience.

[10]  Zhong-Ping Jiang,et al.  Computational adaptive optimal control for continuous-time linear systems with completely unknown dynamics , 2012, Autom..

[11]  Zoubin Ghahramani,et al.  Computational principles of movement neuroscience , 2000, Nature Neuroscience.

[12]  M. Kawato,et al.  Functional significance of stiffness in adaptation of multijoint arm movements to stable and unstable dynamics , 2003, Experimental Brain Research.

[13]  David W. Franklin,et al.  Computational Mechanisms of Sensorimotor Control , 2011, Neuron.

[14]  Zhong-Ping Jiang,et al.  Robust adaptive dynamic programming for optimal nonlinear control design , 2013, 2013 9th Asian Control Conference (ASCC).

[15]  Jan C. Willems,et al.  Feedback stabilizability for stochastic systems with state and control dependent noise , 1976, Autom..

[16]  Emanuel Todorov,et al.  Stochastic Optimal Control and Estimation Methods Adapted to the Noise Characteristics of the Sensorimotor System , 2005, Neural Computation.

[17]  D. Kleinman On the stability of linear stochastic systems , 1969 .

[18]  F A Mussa-Ivaldi,et al.  Adaptive representation of dynamics during learning of a motor task , 1994, The Journal of neuroscience : the official journal of the Society for Neuroscience.

[19]  Sean R Eddy,et al.  What is dynamic programming? , 2004, Nature Biotechnology.

[20]  Michael I. Jordan,et al.  Optimal feedback control as a theory of motor coordination , 2002, Nature Neuroscience.

[21]  Daniel M. Wolpert,et al.  Making smooth moves , 2022 .

[22]  P. Morasso Spatial control of arm movements , 2004, Experimental Brain Research.

[23]  Emanuel Todorov,et al.  Evidence for the Flexible Sensorimotor Strategies Predicted by Optimal Feedback Control , 2007, The Journal of Neuroscience.

[24]  Zhong-Ping Jiang,et al.  Movement Duration, Fitts's Law, and an Infinite-Horizon Optimal Feedback Control Model for Biological Motor Systems , 2013, Neural Computation.

[25]  Reza Shadmehr,et al.  Computational nature of human adaptive control during learning of reaching movements in force fields , 1999, Biological Cybernetics.

[26]  T. Flash,et al.  Moving gracefully: quantitative theories of motor coordination , 1987, Trends in Neurosciences.

[27]  Rieko Osu,et al.  The central nervous system stabilizes unstable dynamics by learning optimal impedance , 2001, Nature.

[28]  Charles R. Johnson,et al.  Matrix analysis , 1985, Statistical Inference for Engineers and Data Scientists.

[29]  M. Kawato,et al.  Formation and control of optimal trajectory in human multijoint arm movement , 1989, Biological Cybernetics.

[30]  宇野 洋二,et al.  Formation and control of optimal trajectory in human multijoint arm movement : minimum torque-change model , 1988 .

[31]  T. Flash,et al.  The coordination of arm movements: an experimentally confirmed mathematical model , 1985, The Journal of neuroscience : the official journal of the Society for Neuroscience.

[32]  Kenji Doya,et al.  Neural mechanisms of learning and control , 2001 .

[33]  S. Scott Optimal feedback control and the neural basis of volitional motor control , 2004, Nature Reviews Neuroscience.

[34]  Michael S Landy,et al.  Adaptation to sensory-motor reflex perturbations is blind to the source of errors. , 2012, Journal of vision.

[35]  P. R. Davidson,et al.  Motor learning and prediction in a variable environment , 2003, Current Opinion in Neurobiology.

[36]  Zhong-Ping Jiang,et al.  Optimal control mechanisms in human arm reaching movements , 2011, Proceedings of the 30th Chinese Control Conference.

[37]  Zhong-Ping Jiang,et al.  Robust adaptive dynamic programming for linear and nonlinear systems: An overview , 2013, Eur. J. Control.

[38]  Iasson Karafyllis,et al.  Stability and Stabilization of Nonlinear Systems , 2011 .

[39]  Luigi Fortuna,et al.  Reinforcement Learning and Adaptive Dynamic Programming for Feedback Control , 2009 .