Comparison of end-to-end and hybrid deep reinforcement learning strategies for controlling cable-driven parallel robots

Abstract Deep reinforcement learning (DRL) has been proven effective in learning policies of high-dimensional states and actions. Recently, a variety of robot manipulation tasks have been accomplished using end-to-end DRL strategies. An end-to-end DRL strategy accomplishes a robot manipulation task as a black box. On the other hand, a robot manipulation task can be divided into multiple subtasks and accomplished by non-learning-based approaches. A hybrid DRL strategy integrates DRL with non-learning-based approaches. The hybrid DRL strategy accomplishes some subtasks of a robot manipulation task by DRL and the rest subtasks by non-learning-based approaches. However, the effects of integrating DRL with non-learning-based approaches on the learning speed and the robustness of DRL to model uncertainties have not been discussed. In this study, an end-to-end DRL strategy and a hybrid DRL strategy are developed and compared in controlling a cable-driven parallel robot. This study shows that, by integrating DRL with non-learning-based approaches, the hybrid DRL strategy learns faster and is more robust to model uncertainties than the end-to-end DRL strategy. This study demonstrates that, by taking advantages of both learning and non-learning-based approaches, the hybrid DRL strategy provides an alternative to accomplish a robot manipulation task.

[1]  M. M. Hossain,et al.  Implementation of Q learning and deep Q network for controlling a self balancing robot model , 2018, Robotics and biomimetics.

[2]  Guy Lever,et al.  Deterministic Policy Gradient Algorithms , 2014, ICML.

[3]  So-Ryeok Oh,et al.  Cable suspended planar robots with redundant cables: controllers with positive tensions , 2005, IEEE Transactions on Robotics.

[4]  Ou Ma,et al.  Vibration analysis of cable-driven parallel manipulators , 2009 .

[5]  Sergey Levine,et al.  Trust Region Policy Optimization , 2015, ICML.

[6]  F. Bekes,et al.  Motion control of a tendon-based parallel manipulator using optimal tension distribution , 2004, IEEE/ASME Transactions on Mechatronics.

[7]  Ahmad Kalhor,et al.  Experimental study on the kinematic control of a cable suspended parallel robot for object tracking purpose , 2018 .

[8]  Jean-Pierre Merlet,et al.  Interval-Analysis-Based Determination of the Wrench-Feasible Workspace of Parallel Cable-Driven Robots , 2011, IEEE Transactions on Robotics.

[9]  Sergey Levine,et al.  Deep reinforcement learning for tensegrity robot locomotion , 2016, 2017 IEEE International Conference on Robotics and Automation (ICRA).

[10]  Sergey Levine,et al.  Guided Policy Search via Approximate Mirror Descent , 2016, NIPS.

[11]  Zhan Li,et al.  Training a robust reinforcement learning controller for the uncertain system based on policy gradient method , 2018, Neurocomputing.

[12]  Clément Gosselin,et al.  Determination and Management of Cable Interferences Between Two 6-DOF Foot Platforms in a Cable-Driven Locomotion Interface , 2009, IEEE Transactions on Systems, Man, and Cybernetics - Part A: Systems and Humans.

[13]  Sergey Levine,et al.  Learning deep control policies for autonomous aerial vehicles with MPC-guided policy search , 2015, 2016 IEEE International Conference on Robotics and Automation (ICRA).

[14]  Preben Alstrøm,et al.  Learning to Drive a Bicycle Using Reinforcement Learning and Shaping , 1998, ICML.

[15]  Alex Graves,et al.  Asynchronous Methods for Deep Reinforcement Learning , 2016, ICML.

[16]  Saeed Behzadipour,et al.  Stiffness of Cable-based Parallel Manipulators With Application to Stability Analysis , 2006 .

[17]  Sergey Levine,et al.  Neural Network Dynamics for Model-Based Deep Reinforcement Learning with Model-Free Fine-Tuning , 2017, 2018 IEEE International Conference on Robotics and Automation (ICRA).

[18]  Jan Peters,et al.  Reinforcement learning in robotics: A survey , 2013, Int. J. Robotics Res..

[19]  Andrew Howard,et al.  Design and use paradigms for Gazebo, an open-source multi-robot simulator , 2004, 2004 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) (IEEE Cat. No.04CH37566).

[20]  Gursel Alici,et al.  Review on Design and Control Aspects of Robotic Shoulder Rehabilitation Orthoses , 2017, IEEE Transactions on Human-Machine Systems.

[21]  Amir Khajepour,et al.  Kinematically-Constrained Redundant Cable-Driven Parallel Robots: Modeling, Redundancy Analysis, and Stiffness Optimization , 2017, IEEE/ASME Transactions on Mechatronics.

[22]  Sergey Levine,et al.  End-to-End Training of Deep Visuomotor Policies , 2015, J. Mach. Learn. Res..

[23]  Takamitsu Matsubara,et al.  Deep reinforcement learning with smooth policy update: Application to robotic cloth manipulation , 2019, Robotics Auton. Syst..

[24]  Imme Ebert-Uphoff,et al.  Wrench-feasible workspace generation for cable-driven robots , 2006, IEEE Transactions on Robotics.

[25]  Yuval Tassa,et al.  MuJoCo: A physics engine for model-based control , 2012, 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[26]  Ou Ma,et al.  Force-closure analysis of 6-DOF cable manipulators with seven or more cables , 2009, Robotica.

[27]  Hamid D. Taghirad,et al.  Robust PID control of fully-constrained cable driven parallel robots , 2014 .

[28]  Anastasios Tefas,et al.  Deep reinforcement learning for controlling frontal person close-up shooting , 2019, Neurocomputing.

[29]  Manuel Graña,et al.  Variable speed wind turbine controller adaptation by reinforcement learning , 2016, Integr. Comput. Aided Eng..

[30]  Hamid D. Taghirad,et al.  Adaptive robust control of fully-constrained cable driven parallel robots , 2015 .

[31]  Saeed Behzadipour,et al.  Antagonistic variable stiffness elements , 2009 .

[32]  Guilin Yang,et al.  A generic force-closure analysis algorithm for cable-driven parallel manipulators , 2011 .

[33]  Jacques Gangloff,et al.  Active vibration canceling of a cable-driven parallel robot using reaction wheels , 2014, 2014 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[34]  Meng Wei,et al.  Robot skill acquisition in assembly process using deep reinforcement learning , 2019, Neurocomputing.