论文信息 - Evolution Strategies Learning With Variable Impedance Control for Grasping Under Uncertainty

Evolution Strategies Learning With Variable Impedance Control for Grasping Under Uncertainty

During a robot's interaction with the environment, it is necessary to ensure the safety and robustness of the robot's movements. To improve the safety and adaptiveness of robots in performing complex movement tasks, a novel method called covariance matrix adaptation-evolution strategies (CMA-ES) for learning complex and high-dimensional motor skills is presented. Considering the complex motion model of trajectories, dynamic movement primitives (DMPs), which is a generic method for trajectories modeling in attractor landscape based on differential dynamic systems, is used to represent the robot's trajectories. CMA-ES offers a theoretical rule for updating the parameters of DMPs and a variable impedance controller, which can reduce the impact of noisy environment on the robot's movement. In this paper, we propose two hierarchies for controlling the robot: the high-level neural-dynamic network optimization for redundancy resolution in task space and the low-level CMA-ES fusing with DMPs for learning trajectories in joint space. In this paper, CMA-ES method is explored to learn variable impedance control and the performance of the proposed method in learning the robot's movements is also tested.

[1] Zhicong Huang,et al. Adaptive Impedance Control for an Upper Limb Robotic Exoskeleton Using Biological Signals , 2017, IEEE Transactions on Industrial Electronics.

[2] Clément Gosselin,et al. General Model of Human-Robot Cooperation Using a Novel Velocity Based Variable Impedance Control , 2007, Second Joint EuroHaptics Conference and Symposium on Haptic Interfaces for Virtual Environment and Teleoperator Systems (WHC'07).

[3] Zhan Yu,et al. Line Assisted Light Field Triangulation and Stereo Matching , 2013, 2013 IEEE International Conference on Computer Vision.

[4] Sethu Vijayakumar,et al. Exploiting sensorimotor stochasticity for learning control of variable impedance actuators , 2010, 2010 10th IEEE-RAS International Conference on Humanoid Robots.

[5] Panfeng Huang,et al. Coordinated stabilization of tumbling targets using tethered space manipulators , 2015, IEEE Transactions on Aerospace and Electronic Systems.

[6] Fan Zhang,et al. Dexterous Tethered Space Robot: Design, Measurement, Control, and Experiment , 2017, IEEE Transactions on Aerospace and Electronic Systems.

[7] Nikolaus Hansen,et al. Completely Derandomized Self-Adaptation in Evolution Strategies , 2001, Evolutionary Computation.

[8] Mergen H. Ghayesh,et al. Impedance Control of an Intrinsically Compliant Parallel Ankle Rehabilitation Robot , 2016, IEEE Transactions on Industrial Electronics.

[9] Stefan Schaal,et al. Reinforcement Learning With Sequences of Motion Primitives for Robust Manipulation , 2012, IEEE Transactions on Robotics.

[10] Bruno Siciliano,et al. Variable Impedance Control of Redundant Manipulators for Intuitive Human–Robot Physical Interaction , 2015, IEEE Transactions on Robotics.

[11] Bernhard P. Wrobel,et al. Multiple View Geometry in Computer Vision , 2001 .

[12] Nikolaus Hansen,et al. Evaluating the CMA Evolution Strategy on Multimodal Test Functions , 2004, PPSN.

[13] Shuzhi Sam Ge,et al. A unified quadratic-programming-based dynamical system approach to joint torque optimization of physically constrained redundant manipulators , 2004, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[14] Yizhai Zhang,et al. Precise Angles-Only Navigation for Noncooperative Proximity Operation With Application to Tethered Space Robot , 2019, IEEE Transactions on Control Systems Technology.

[15] Panfeng Huang,et al. Predictive Approach for Sensorless Bimanual Teleoperation Under Random Time Delays With Adaptive Fuzzy Control , 2018, IEEE Transactions on Industrial Electronics.

[16] Toshio Fukuda,et al. Reinforcement Learning of Manipulation and Grasping Using Dynamical Movement Primitives for a Humanoidlike Mobile Manipulator , 2017, IEEE/ASME Transactions on Mechatronics.

[17] C. Kanzow. Levenberg-Marquardt methods for constrained nonlinear equations with strong local convergence properties , 2004 .

[18] Yunong Zhang,et al. A Hybrid Multi-Objective Scheme Applied to Redundant Robot Manipulators , 2017, IEEE Transactions on Automation Science and Engineering.

[19] Petros Koumoutsakos,et al. Learning Probability Distributions in Continuous Evolutionary Algorithms - a Comparative Review , 2004, Nat. Comput..

[20] Stefan Schaal,et al. http://www.jstor.org/about/terms.html. JSTOR's Terms and Conditions of Use provides, in part, that unless you have obtained , 2007 .

[21] Stefan Schaal,et al. A Generalized Path Integral Control Approach to Reinforcement Learning , 2010, J. Mach. Learn. Res..

[22] Jun Nakanishi,et al. Dynamical Movement Primitives: Learning Attractor Models for Motor Behaviors , 2013, Neural Computation.

[23] Guanglin Li,et al. Development of Sensory-Motor Fusion-Based Manipulation and Grasping Control for a Robotic Hand-Eye System , 2017, IEEE Transactions on Systems, Man, and Cybernetics: Systems.

[24] Hans-Paul Schwefel,et al. Evolution and optimum seeking , 1995, Sixth-generation computer technology series.