Evolution Strategies Learning With Variable Impedance Control for Grasping Under Uncertainty

During a robot's interaction with the environment, it is necessary to ensure the safety and robustness of the robot's movements. To improve the safety and adaptiveness of robots in performing complex movement tasks, a novel method called covariance matrix adaptation-evolution strategies (CMA-ES) for learning complex and high-dimensional motor skills is presented. Considering the complex motion model of trajectories, dynamic movement primitives (DMPs), which is a generic method for trajectories modeling in attractor landscape based on differential dynamic systems, is used to represent the robot's trajectories. CMA-ES offers a theoretical rule for updating the parameters of DMPs and a variable impedance controller, which can reduce the impact of noisy environment on the robot's movement. In this paper, we propose two hierarchies for controlling the robot: the high-level neural-dynamic network optimization for redundancy resolution in task space and the low-level CMA-ES fusing with DMPs for learning trajectories in joint space. In this paper, CMA-ES method is explored to learn variable impedance control and the performance of the proposed method in learning the robot's movements is also tested.

[1]  Zhicong Huang,et al.  Adaptive Impedance Control for an Upper Limb Robotic Exoskeleton Using Biological Signals , 2017, IEEE Transactions on Industrial Electronics.

[2]  Clément Gosselin,et al.  General Model of Human-Robot Cooperation Using a Novel Velocity Based Variable Impedance Control , 2007, Second Joint EuroHaptics Conference and Symposium on Haptic Interfaces for Virtual Environment and Teleoperator Systems (WHC'07).

[3]  Zhan Yu,et al.  Line Assisted Light Field Triangulation and Stereo Matching , 2013, 2013 IEEE International Conference on Computer Vision.

[4]  Sethu Vijayakumar,et al.  Exploiting sensorimotor stochasticity for learning control of variable impedance actuators , 2010, 2010 10th IEEE-RAS International Conference on Humanoid Robots.

[5]  Panfeng Huang,et al.  Coordinated stabilization of tumbling targets using tethered space manipulators , 2015, IEEE Transactions on Aerospace and Electronic Systems.

[6]  Fan Zhang,et al.  Dexterous Tethered Space Robot: Design, Measurement, Control, and Experiment , 2017, IEEE Transactions on Aerospace and Electronic Systems.

[7]  Nikolaus Hansen,et al.  Completely Derandomized Self-Adaptation in Evolution Strategies , 2001, Evolutionary Computation.

[8]  Mergen H. Ghayesh,et al.  Impedance Control of an Intrinsically Compliant Parallel Ankle Rehabilitation Robot , 2016, IEEE Transactions on Industrial Electronics.

[9]  Stefan Schaal,et al.  Reinforcement Learning With Sequences of Motion Primitives for Robust Manipulation , 2012, IEEE Transactions on Robotics.

[10]  Bruno Siciliano,et al.  Variable Impedance Control of Redundant Manipulators for Intuitive Human–Robot Physical Interaction , 2015, IEEE Transactions on Robotics.

[11]  Bernhard P. Wrobel,et al.  Multiple View Geometry in Computer Vision , 2001 .

[12]  Nikolaus Hansen,et al.  Evaluating the CMA Evolution Strategy on Multimodal Test Functions , 2004, PPSN.

[13]  Shuzhi Sam Ge,et al.  A unified quadratic-programming-based dynamical system approach to joint torque optimization of physically constrained redundant manipulators , 2004, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[14]  Yizhai Zhang,et al.  Precise Angles-Only Navigation for Noncooperative Proximity Operation With Application to Tethered Space Robot , 2019, IEEE Transactions on Control Systems Technology.

[15]  Panfeng Huang,et al.  Predictive Approach for Sensorless Bimanual Teleoperation Under Random Time Delays With Adaptive Fuzzy Control , 2018, IEEE Transactions on Industrial Electronics.

[16]  Toshio Fukuda,et al.  Reinforcement Learning of Manipulation and Grasping Using Dynamical Movement Primitives for a Humanoidlike Mobile Manipulator , 2017, IEEE/ASME Transactions on Mechatronics.

[17]  C. Kanzow Levenberg-Marquardt methods for constrained nonlinear equations with strong local convergence properties , 2004 .

[18]  Yunong Zhang,et al.  A Hybrid Multi-Objective Scheme Applied to Redundant Robot Manipulators , 2017, IEEE Transactions on Automation Science and Engineering.

[19]  Petros Koumoutsakos,et al.  Learning Probability Distributions in Continuous Evolutionary Algorithms - a Comparative Review , 2004, Nat. Comput..

[20]  Stefan Schaal,et al.  http://www.jstor.org/about/terms.html. JSTOR's Terms and Conditions of Use provides, in part, that unless you have obtained , 2007 .

[21]  Stefan Schaal,et al.  A Generalized Path Integral Control Approach to Reinforcement Learning , 2010, J. Mach. Learn. Res..

[22]  Jun Nakanishi,et al.  Dynamical Movement Primitives: Learning Attractor Models for Motor Behaviors , 2013, Neural Computation.

[23]  Guanglin Li,et al.  Development of Sensory-Motor Fusion-Based Manipulation and Grasping Control for a Robotic Hand-Eye System , 2017, IEEE Transactions on Systems, Man, and Cybernetics: Systems.

[24]  Hans-Paul Schwefel,et al.  Evolution and optimum seeking , 1995, Sixth-generation computer technology series.