Sequential learning unification controller from human demonstrations for robotic compliant manipulation

Abstract Robotic compliant manipulation not only contains robot motion but also embodies interaction with the environment. Frequently endowing the compliant manipulation skills to the robot by manual programming or off-line training is complicated and time-consuming. In this paper, we propose a sequential learning framework to take both kinematic profile and variable impedance parameter profile into consideration to model a unified control strategy with “motion generation” and “compliant control”. In order to acquire this unification controller efficiently, we use a sequential learning neural network to encode robot motion and a new force-based variable impedance learning algorithm to estimate varying damping and stiffness profiles in three directions. Furthermore, the state-independent stability constraints for variable impedance control are presented. The effectiveness of the proposed learning framework is validated by a set of experiments using the 4-DoF Barrett WAM.

[1]  Klaus Neumann,et al.  Learning robot motions with stable dynamical systems under diffeomorphic transformations , 2015, Robotics Auton. Syst..

[2]  Oussama Khatib,et al.  Springer Handbook of Robotics , 2007, Springer Handbooks.

[3]  Aude Billard,et al.  Online learning of varying stiffness through physical human-robot interaction , 2012, 2012 IEEE International Conference on Robotics and Automation.

[4]  Chao Xu,et al.  Fast and Stable Learning of Dynamical Systems Based on Extreme Learning Machine , 2019, IEEE Transactions on Systems, Man, and Cybernetics: Systems.

[5]  Bruno Siciliano,et al.  Variable Impedance Control of Redundant Manipulators for Intuitive Human–Robot Physical Interaction , 2015, IEEE Transactions on Robotics.

[6]  Sandra Hirche,et al.  Risk-sensitive interaction control in uncertain manipulation tasks , 2013, 2013 IEEE International Conference on Robotics and Automation.

[7]  J. Salisbury,et al.  Active stiffness control of a manipulator in cartesian coordinates , 1980, 1980 19th IEEE Conference on Decision and Control including the Symposium on Adaptive Processes.

[8]  Keng Peng Tee,et al.  Dynamic Movement Primitives Plus: For enhanced reproduction quality and efficient trajectory modification using truncated kernels and Local Biases , 2016, 2016 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[9]  Gordon Cheng,et al.  Humanoid Multimodal Tactile-Sensing Modules , 2011, IEEE Transactions on Robotics.

[10]  D. Serre Matrices: Theory and Applications , 2002 .

[11]  H. Gomi,et al.  Task-Dependent Viscoelasticity of Human Multijoint Arm and Its Spatial Characteristics for Interaction with Environments , 1998, The Journal of Neuroscience.

[12]  Darwin G. Caldwell,et al.  Robot motor skill coordination with EM-based Reinforcement Learning , 2010, 2010 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[13]  Stefan Schaal,et al.  Learning variable impedance control , 2011, Int. J. Robotics Res..

[14]  Paolo Dario,et al.  Soft Robot Arm Inspired by the Octopus , 2012, Adv. Robotics.

[15]  Wei He,et al.  Adaptive Fuzzy Neural Network Control for a Constrained Robot Using Impedance Learning , 2018, IEEE Transactions on Neural Networks and Learning Systems.

[16]  Narasimhan Sundararajan,et al.  A Fast and Accurate Online Sequential Learning Algorithm for Feedforward Networks , 2006, IEEE Transactions on Neural Networks.

[17]  Darwin G. Caldwell,et al.  Imitation Learning of Positional and Force Skills Demonstrated via Kinesthetic Teaching and Haptic Input , 2011, Adv. Robotics.

[18]  Aude Billard,et al.  Learning Stable Nonlinear Dynamical Systems With Gaussian Mixture Models , 2011, IEEE Transactions on Robotics.

[19]  Gene H. Golub,et al.  Matrix computations (3rd ed.) , 1996 .

[20]  Brett Browning,et al.  A survey of robot learning from demonstration , 2009, Robotics Auton. Syst..

[21]  Aude Billard,et al.  Learning Compliant Manipulation through Kinesthetic and Tactile Human-Robot Interaction , 2014, IEEE Transactions on Haptics.

[22]  Darwin G. Caldwell,et al.  A Method for Derivation of Robot Task-Frame Control Authority from Repeated Sensory Observations , 2017, IEEE Robotics and Automation Letters.

[23]  Jun Nakanishi,et al.  Learning Attractor Landscapes for Learning Motor Primitives , 2002, NIPS.

[24]  R. Ham,et al.  Compliant actuator designs , 2009, IEEE Robotics & Automation Magazine.

[25]  Darwin G. Caldwell,et al.  Learning-based control strategy for safe human-robot interaction exploiting task and robot redundancies , 2010, 2010 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[26]  Bruno Siciliano,et al.  Task-Space Control of Robot Manipulators With Null-Space Compliance , 2014, IEEE Transactions on Robotics.

[27]  Neville Hogan,et al.  Impedance Control: An Approach to Manipulation , 1984, 1984 American Control Conference.

[28]  Andrej Gams,et al.  Learning Compliant Movement Primitives Through Demonstration and Statistical Generalization , 2016, IEEE/ASME Transactions on Mechatronics.

[29]  Evangelos Theodorou,et al.  Tendon-Driven Variable Impedance Control Using Reinforcement Learning , 2012, Robotics: Science and Systems.

[30]  Aude Billard,et al.  Stability Considerations for Variable Impedance Control , 2016, IEEE Transactions on Robotics.

[31]  Aude Billard,et al.  Learning to Play Minigolf: A Dynamical System-Based Approach , 2012, Adv. Robotics.

[32]  Yu Kang,et al.  Human–Robot Coordination Control of Robotic Exoskeletons by Skill Transfers , 2017, IEEE Transactions on Industrial Electronics.

[33]  Sethu Vijayakumar,et al.  Optimal variable stiffness control: formulation and application to explosive movement tasks , 2012, Auton. Robots.

[34]  Chao Zeng,et al.  A Learning Framework of Adaptive Manipulative Skills From Human to Robot , 2019, IEEE Transactions on Industrial Informatics.

[35]  Cristian Secchi,et al.  A tank-based approach to impedance control with variable stiffness , 2013, 2013 IEEE International Conference on Robotics and Automation.

[36]  I W Hunter,et al.  System identification of human joint dynamics. , 1990, Critical reviews in biomedical engineering.

[37]  Shuzhi Sam Ge,et al.  Impedance Learning for Robots Interacting With Unknown Environments , 2014, IEEE Transactions on Control Systems Technology.

[38]  Nikolaos G. Tsagarakis,et al.  Tele-impedance: Teleoperation with impedance regulation using a body–machine interface , 2012, Int. J. Robotics Res..

[39]  Cheng Fang,et al.  A DMPs-Based Framework for Robot Learning and Generalization of Humanlike Variable Impedance Skills , 2018, IEEE/ASME Transactions on Mechatronics.

[40]  Weiping Li,et al.  Applied Nonlinear Control , 1991 .

[41]  Ville Kyrki,et al.  Learning compliant assembly motions from demonstration , 2016, 2016 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[42]  Giorgio Metta,et al.  Methods and Technologies for the Implementation of Large-Scale Robot Tactile Sensors , 2011, IEEE Transactions on Robotics.

[43]  Oussama Khatib,et al.  Learning potential functions from human demonstrations with encapsulated dynamic and compliant behaviors , 2017, Auton. Robots.