论文信息 - Research on Motion Planning of Seven Degree of Freedom Manipulator Based on DDPG

Research on Motion Planning of Seven Degree of Freedom Manipulator Based on DDPG

For the motion control of the seven degree of freedom manipulator, there are many problems in the traditional inverse kinematics solution, such as high modeling skills, difficulty in solving the equation matrix, and a huge amount of calculation. In this paper, reinforcement learning is applied in seven degree of freedom manipulator. In order to cope with the problem of large state space and Continuous action in RL, the neural network is used to map the state space to the action space. The action selection network and the action evaluation network are constructed with the Actor-Critic framework. The action selection policy is learned by the training of RL based on DDPG. Finally, test the effectiveness of the method by Baxter robot in Gazebo simulator.

[1] Xin Xu,et al. Sequential anomaly detection based on temporal-difference learning: Principles, models and case studies , 2010, Appl. Soft Comput..

[2] Chen Xing,et al. Kernel-Based Continuous-Action Actor-Critic Learning , 2014 .

[3] Liu Yuncheng,et al. A revised Gaussian distribution sampling scheme based on RRT* algorithms in robot motion planning , 2017, 2017 3rd International Conference on Control, Automation and Robotics (ICCAR).

[4] Xia Li-li. Reinforcement Learning with Continuous State-continuous Action , 2011 .

[5] Yuval Tassa,et al. Continuous control with deep reinforcement learning , 2015, ICLR.

[6] Shane Legg,et al. Human-level control through deep reinforcement learning , 2015, Nature.

[7] Liu Hai-bo. Mobile Robot Path Planning Based on Hierarchical Reinforcement Learning in Unknown Dynamic Environment , 2006 .

[8] Sotiris Makris,et al. Robotized Assembly Process Using Dual Arm Robot , 2014 .

[9] Li Shi-qi. Reinforcement learning based obstacle avoidance for robotic manipulator , 2007 .

[10] Guy Lever,et al. Deterministic Policy Gradient Algorithms , 2014, ICML.