Constrained DMPs for Feasible Skill Learning on Humanoid Robots

In the context of humanoid skill learning, movement primitives have gained much attention because of their compact representation and convenient combination with a myriad of optimization approaches. Among them, a well-known scheme is to use Dynamic Movement Primitives (DMPs) with reinforcement learning (RL) algorithms. While various remarkable results have been reported, skill learning with physical constraints has not been sufficiently investigated. For example, when RL is employed to optimize the robot joint trajectories, the exploration noise could drive the resulting trajectory out of the joint limits. In this paper, we focus on robot skill learning characterized by joint limit avoidance, by introducing the novel Constrained Dynamic Movement Primitives (CDMPs). By controlling a set of transformed states (called exogenous states) instead of the original DMPs states, CDMPs are capable of maintaining the joint trajectories within the safety limits. We validate CDMPs on the humanoid robot iCub, showing the applicability of our approach.

[1]  Darwin G. Caldwell,et al.  Hybrid Probabilistic Trajectory Optimization Using Null-Space Exploration , 2018, 2018 IEEE International Conference on Robotics and Automation (ICRA).

[2]  Jan Peters,et al.  Policy Search for Motor Primitives in Robotics , 2008, NIPS 2008.

[3]  Bin Yang,et al.  Improved Damped Least Squares Solution with Joint Limits, Joint Weights and Comfortable Criteria for Controlling Human-like Figures , 2008, 2008 IEEE Conference on Robotics, Automation and Mechatronics.

[4]  Darwin G. Caldwell,et al.  Kernelized movement primitives , 2017, Int. J. Robotics Res..

[5]  Alin Albu-Schäffer,et al.  Dynamic whole-body mobile manipulation with a torque controlled humanoid robot via impedance control laws , 2011, 2011 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[6]  Atsuo Takanishi,et al.  Online learning of humanoid robot kinematics under switching tools contexts , 2013, 2013 IEEE International Conference on Robotics and Automation.

[7]  Giulio Sandini,et al.  The iCub humanoid robot: An open-systems platform for research in cognitive development , 2010, Neural Networks.

[8]  Gabriele Nava,et al.  Stability analysis and design of momentum-based controllers for humanoid robots , 2016, 2016 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[9]  Jun Nakanishi,et al.  Dynamical Movement Primitives: Learning Attractor Models for Motor Behaviors , 2013, Neural Computation.

[10]  Jan Peters,et al.  Nonamemanuscript No. (will be inserted by the editor) Reinforcement Learning to Adjust Parametrized Motor Primitives to , 2011 .

[11]  Darwin G. Caldwell,et al.  Learning and Reproduction of Gestures by Imitation , 2010, IEEE Robotics & Automation Magazine.

[12]  Daniele Pucci,et al.  On-line joint limit avoidance for torque controlled robots by joint space parametrization , 2016, 2016 IEEE-RAS 16th International Conference on Humanoid Robots (Humanoids).

[13]  Stefan Schaal,et al.  A Generalized Path Integral Control Approach to Reinforcement Learning , 2010, J. Mach. Learn. Res..

[14]  Stefan Schaal,et al.  Movement reproduction and obstacle avoidance with dynamic movement primitives and potential fields , 2008, Humanoids 2008 - 8th IEEE-RAS International Conference on Humanoid Robots.

[15]  Li Chen,et al.  Hierarchical Nonholonomic Path Planning of Dual-Arm Space Robot Systems with Joint Limits , 2006, 2006 6th World Congress on Intelligent Control and Automation.

[16]  Daniele Pucci,et al.  iCub: The not-yet-finished story of building a robot child , 2017, Science Robotics.

[17]  François Chaumette,et al.  New strategies for avoiding robot joint limits: Application to visual servoing using a large projection operator , 2010, 2010 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[18]  Peters Jan,et al.  Jointly learning trajectory generation and hitting point prediction in robot table tennis , 2016 .

[19]  Rajiv V. Dubey,et al.  A weighted least-norm solution based scheme for avoiding joint limits for redundant joint manipulators , 1993, IEEE Trans. Robotics Autom..

[20]  Daniele Pucci,et al.  iCub Whole-Body Control through Force Regulation on Rigid Non-Coplanar Contacts , 2015, Front. Robot. AI.