Reconstructing Null-space Policies Subject to Dynamic Task Constraints in Redundant Manipulators

We consider the problem of direct policy learning in situations where the policies are only observable through their projections into the null-space of a set of dynamic, non-linear task constraints. We tackle the issue of deriving consistent data for the learning of such policies and make two contributions towards its solution. Firstly, we derive the conditions required to exactly reconstruct null-space policies and suggest a learning strategy based on this derivation. Secondly, we consider the case that the null-space policy is conservative and show that such a policy can be learnt more easily and robustly by learning the underlying potential function and using this as our representation of the policy.

[1]  Daniel E. Whitney,et al.  Resolved Motion Rate Control of Manipulators and Human Prostheses , 1969 .

[2]  A. Liegeois,et al.  Automatic supervisory control of the configuration and behavior of multi-body mechanisms , 1977 .

[3]  Tsuneo Yoshikawa,et al.  Manipulability of Robotic Mechanisms , 1985 .

[4]  Yoshihiko Nakamura,et al.  Advanced robotics - redundancy and optimization , 1990 .

[5]  Vincent De Sapio,et al.  Human-Like Motion from Physiologically-Based Potential Energies , 2004 .

[6]  Jun Nakanishi,et al.  A unifying methodology for the control of robotic systems , 2005, 2005 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[7]  Michael Gienger,et al.  Task-oriented whole body motion for humanoid robots , 2005, 5th IEEE-RAS International Conference on Humanoid Robots, 2005..

[8]  Michael Gienger,et al.  Real-Time Self Collision Avoidance for Humanoids by means of Nullspace Criteria and Task Intervals , 2006, 2006 6th IEEE-RAS International Conference on Humanoid Robots.

[9]  Oussama Khatib,et al.  Contact consistent control framework for humanoid robots , 2006, Proceedings 2006 IEEE International Conference on Robotics and Automation, 2006. ICRA 2006..

[10]  Sethu Vijayakumar,et al.  Learning Utility Surfaces for Movement Selection , 2006, 2006 IEEE International Conference on Robotics and Biomimetics.

[11]  Oussama Khatib,et al.  A whole-body control framework for humanoids operating in human environments , 2006, Proceedings 2006 IEEE International Conference on Robotics and Automation, 2006. ICRA 2006..