Optimal Feedback Control for anthropomorphic manipulators

We study target reaching tasks of redundant anthropomorphic manipulators under the premise of minimal energy consumption and compliance during motion. We formulate this motor control problem in the framework of Optimal Feedback Control (OFC) by introducing a specific cost function that accounts for the physical constraints of the controlled plant. Using an approximative computational optimal control method we can optimally control a high-dimensional anthropomorphic robot without having to specify an explicit inverse kinematics, inverse dynamics or feedback control law. We highlight the benefits of this biologically plausible motor control strategy over traditional (open loop) optimal controllers: The presented approach proves to be significantly more energy efficient and compliant, while being accurate with respect to the task at hand. These properties are crucial for the control of mobile anthropomorphic robots, that are designed to interact safely in a human environment. To the best of our knowledge this is the first OFC implementation on a high-dimensional (redundant) manipulator.

[1]  Donald E. Kirk,et al.  Optimal control theory : an introduction , 1970 .

[2]  M. Ciletti,et al.  The computation and theory of optimal control , 1972 .

[3]  David Q. Mayne,et al.  Differential dynamic programming , 1972, The Mathematical Gazette.

[4]  Koichi Osuka,et al.  A New Identification Method for Serial Manipulator Arms , 1984 .

[5]  T. Flash,et al.  The coordination of arm movements: an experimentally confirmed mathematical model , 1985, The Journal of neuroscience : the official journal of the Society for Neuroscience.

[6]  Robert F. Stengel,et al.  Optimal Control and Estimation , 1994 .

[7]  Dimitri P. Bertsekas,et al.  Dynamic Programming and Optimal Control, Two Volume Set , 1995 .

[8]  Daniel M. Wolpert,et al.  Making smooth moves , 2022 .

[9]  Michael I. Jordan,et al.  A Minimal Intervention Principle for Coordinated Movement , 2002, NIPS.

[10]  Jun Morimoto,et al.  Minimax Differential Dynamic Programming: An Application to Robust Biped Walking , 2002, NIPS.

[11]  M. Kawato,et al.  Formation and control of optimal trajectory in human multijoint arm movement , 1989, Biological Cybernetics.

[12]  E. Todorov Optimality principles in sensorimotor control , 2004, Nature Neuroscience.

[13]  S. Scott Optimal feedback control and the neural basis of volitional motor control , 2004, Nature Reviews Neuroscience.

[14]  W. L. Nelson Physical principles for economies of skilled movements , 1983, Biological Cybernetics.

[15]  E. Todorov,et al.  A generalized iterative LQG method for locally-optimal feedback control of constrained nonlinear stochastic systems , 2005, Proceedings of the 2005, American Control Conference, 2005..

[16]  Weiwei Li Optimal control for biological movement systems , 2006 .

[17]  Emanuel Todorov,et al.  Iterative linearization methods for approximately optimal control and estimation of non-linear stochastic system , 2007, Int. J. Control.

[18]  William D. Smart,et al.  Receding Horizon Differential Dynamic Programming , 2007, NIPS.

[19]  Sethu Vijayakumar,et al.  Optimal Control with Adaptive Internal Dynamics Models , 2008, ICINCO-ICSO.