Efficient learning of constraints and generic null space policies

A large class of motions can be decomposed into a movement task and null-space policy subject to a set of constraints. When learning such motions from demonstrations, we aim to achieve generalisation across different unseen constraints and to increase the robustness to noise while keeping the computational cost low. There exists a variety of methods for learning the movement policy and the constraints. The effectiveness of these techniques has been demonstrated in low-dimensional scenarios and simple motions. In this paper, we present a fast and accurate approach to learning constraints from observations. This novel formulation of the problem allows the constraint learning method to be coupled with the policy learning method to improve policy learning accuracy, which enables us to learn more complex motions. We demonstrate our approach by learning a complex surface wiping policy in a 7-DOF robotic arm.

[1]  Oussama Khatib,et al.  Contact consistent control framework for humanoid robots , 2006, Proceedings 2006 IEEE International Conference on Robotics and Automation, 2006. ICRA 2006..

[2]  Sethu Vijayakumar,et al.  Methods for Learning Control Policies from Variable-Constraint Demonstrations , 2010, From Motor Learning to Interaction Learning in Robots.

[3]  Sethu Vijayakumar,et al.  Learning nullspace policies , 2010, 2010 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[4]  Matthew Howard,et al.  Learning control policies from constrained motion , 2009 .

[5]  Hsiu-Chin Lin Novel approach for representing, generalising, and quantifying periodic gaits , 2015 .

[6]  Olivier Stasse,et al.  Reverse Control for Humanoid Robot Task Recognition , 2012, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[7]  Pierre-Brice Wieber,et al.  Hierarchical quadratic programming: Fast online humanoid-robot motion generation , 2014, Int. J. Robotics Res..

[8]  Michael Gienger,et al.  Real-Time Self Collision Avoidance for Humanoids by means of Nullspace Criteria and Task Intervals , 2006, 2006 6th IEEE-RAS International Conference on Humanoid Robots.

[9]  Nicolas Mansard,et al.  Task Sequencing for High-Level Sensor-Based Control , 2007, IEEE Transactions on Robotics.

[10]  Nikolaos G. Tsagarakis,et al.  Statistical dynamical systems for skills acquisition in humanoids , 2012, 2012 12th IEEE-RAS International Conference on Humanoid Robots (Humanoids 2012).

[11]  D.M. Mount,et al.  An Efficient k-Means Clustering Algorithm: Analysis and Implementation , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[12]  Christopher G. Atkeson,et al.  Constructive Incremental Learning from Only Local Information , 1998, Neural Computation.

[13]  Sethu Vijayakumar,et al.  Learning null space projections , 2015, 2015 IEEE International Conference on Robotics and Automation (ICRA).

[14]  Michael Gienger,et al.  Task-oriented whole body motion for humanoid robots , 2005, 5th IEEE-RAS International Conference on Humanoid Robots, 2005..

[15]  Stefan Schaal,et al.  http://www.jstor.org/about/terms.html. JSTOR's Terms and Conditions of Use provides, in part, that unless you have obtained , 2007 .

[16]  Hsiu-Chin Lin,et al.  Learning Null Space Projections in Operational Space Formulation , 2016, ArXiv.

[17]  Surya P. N. Singh,et al.  V-REP: A versatile and scalable robot simulation framework , 2013, 2013 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[18]  Ronan Boulic,et al.  An inverse kinematics architecture enforcing an arbitrary number of strict priority levels , 2004, The Visual Computer.

[19]  Alexander Herzog,et al.  Momentum control with hierarchical inverse dynamics on a torque-controlled humanoid , 2014, Autonomous Robots.

[20]  Tsuneo Yoshikawa,et al.  Manipulability of Robotic Mechanisms , 1985 .

[21]  H. Cruse,et al.  The human arm as a redundant manipulator: The control of path and joint angles , 2004, Biological Cybernetics.

[22]  Oussama Khatib,et al.  A Unified Framework for Whole-Body Humanoid Robot Control with Multiple Constraints and Contacts , 2008, EUROS.

[23]  Stefan Schaal,et al.  Learning inverse kinematics , 2001, Proceedings 2001 IEEE/RSJ International Conference on Intelligent Robots and Systems. Expanding the Societal Role of Robotics in the the Next Millennium (Cat. No.01CH37180).

[24]  Brett Browning,et al.  A survey of robot learning from demonstration , 2009, Robotics Auton. Syst..