论文信息 - Task Planning in “Block World” with Deep Reinforcement Learning

Task Planning in “Block World” with Deep Reinforcement Learning

At the moment reinforcement learning have advanced significantly with discovering new techniques and instruments for training. This paper is devoted to the application convolutional and recurrent neural networks in the task of planning with reinforcement learning problem. The aim of the work is to check whether the neural networks are fit for this problem. During the experiments in a block environment the task was to move blocks to obtain the final arrangement which was the target. Significant part of the problem is connected with the determining on the reward function and how the results are depending in reward’s calculation. The current results show that without modifying the initial problem into more straightforward ones neural networks didn’t demonstrate stable learning process. In the paper a modified reward function with sub-targets and euclidian reward calculation was used for more precise reward determination. Results have shown that none of the tested architectures were not able to achieve goal.

Edward Ayunts | Alekasndr I. Panov

[1] Gordon Cheng,et al. Humanoid robot learning and game playing using PC-based vision , 2002, IEEE/RSJ International Conference on Intelligent Robots and Systems.

[2] Alex Graves,et al. Playing Atari with Deep Reinforcement Learning , 2013, ArXiv.

[3] Sergey Levine,et al. Deep visual foresight for planning robot motion , 2016, 2017 IEEE International Conference on Robotics and Automation (ICRA).

[4] Alex Graves,et al. Recurrent Models of Visual Attention , 2014, NIPS.