Robot Arm Control Method of Moving Below Object Based on Deep Reinforcement Learning

The existing robot arm control system has long commissioning time and the control system has poor scope of application. In this paper, the Deep Deterministic Policy Gradient (DDPG) algorithm has been adopted and adapted to control the robot arm to move below the object at any position, thereby enhancing the flexibility of the control algorithm and shortening the adjusting time. In addition, to address the problem that physical production line cannot be utilized directly or provide sufficient data for training deep reinforcement learning agent, this paper constructs a virtual model containing both the robot arm and the object as training environment for the agent. Simulation experiment has been performed with state variables and reward properly designed. As is shown by the results, the control agent trained in this paper show good performance in controlling the robot arm, which in turn confirms the effectiveness of the training algorithm with effective data support of the constructed simulation environment.

[1]  Jae-Bok Song,et al.  Null space motion control of a redundant robot arm using matrix augmentation and saturation method , 2014 .

[2]  Guang-Hong Yang,et al.  Adaptive decentralized control for a class of interconnected nonlinear systems via backstepping approach and graph theory , 2017, Autom..

[3]  Changyin Sun,et al.  Neural Network Control of a Robotic Manipulator With Input Deadzone and Output Constraint , 2016, IEEE Transactions on Systems, Man, and Cybernetics: Systems.

[4]  Oussama Khatib,et al.  Motion control of redundant robots under joint constraints: Saturation in the Null Space , 2012, 2012 IEEE International Conference on Robotics and Automation.

[5]  Xiaowei Yu,et al.  Adaptive Backstepping Quantized Control for a Class of Nonlinear Systems , 2017, IEEE Transactions on Automatic Control.

[6]  Yunong Zhang,et al.  Physical-limits-constrained minimum velocity norm coordinating scheme for wheeled mobile redundant manipulators , 2014, Robotica.

[7]  Qiang Wen-yi,et al.  Study on Neural Network Adaptive Control Method for Uncertain Space Manipulator , 2010 .

[8]  Alex Graves,et al.  Asynchronous Methods for Deep Reinforcement Learning , 2016, ICML.

[9]  Jun Chen,et al.  Robust Adaptive Neural-Fuzzy Network Tracking Control for Robot Manipulator , 2014, Int. J. Comput. Commun. Control.

[10]  Wei-Chen Wang,et al.  Robust adaptive position and force controller design of robot manipulator using fuzzy neural networks , 2016 .

[11]  Alex Graves,et al.  Playing Atari with Deep Reinforcement Learning , 2013, ArXiv.

[12]  Sergey Levine,et al.  Trust Region Policy Optimization , 2015, ICML.

[13]  Gang Wu,et al.  Path planning algorithm for bending robots , 2009, 2009 IEEE International Conference on Robotics and Biomimetics (ROBIO).

[14]  Shaocheng Tong,et al.  Adaptive Fuzzy Output-Feedback Stabilization Control for a Class of Switched Nonstrict-Feedback Nonlinear Systems , 2017, IEEE Transactions on Cybernetics.

[15]  Hao Wang,et al.  Fuzzy tracking adaptive control of discrete-time switched nonlinear systems , 2017, Fuzzy Sets Syst..