Path Finding Algorithms for Autonomous Robots Based on Reinforcement Learning

449 www.ijarcet.org  Normally, the problem of path finding is solved by classical algorithms, such as the Dijkstra, Bellman-Ford,Johnsonalgorithm. However, the classical path finding algorithms cannot handle the problem of dynamic environment in the realworld.This paperintroducesmodelforautonomous robot path finding based onreinforcement learning algorithms.We describe the basic concept, principle and the method of reinforcement learning algorithms, and do some simulation experiments in order to evaluate the effectivenessamong different reinforcement learning algorithms.Simulation results show that Q-learning algorithm does not work well when the environment has changed constantly and there are too many obstacles.Both SARSA algorithms and Q-learning algorithms are relatively slow in convergence.Meanwhile, Dyna-Q algorithm determines the path of the robot relatively fast and the stability rates are high.

[1]  Richard S. Sutton,et al.  Planning by Incremental Dynamic Programming , 1991, ML.

[2]  Mahesan Niranjan,et al.  On-line Q-learning using connectionist systems , 1994 .

[3]  Il Hong Suh,et al.  A reinforcement learning approach involving a shortest path finding algorithm , 2003, Proceedings 2003 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2003) (Cat. No.03CH37453).

[4]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[5]  O Hachour,et al.  PATH PLANNING OF AUTONOMOUS MOBILE ROBOT , 2008 .

[6]  Yujie Cui,et al.  Dynamic path planning for mobile robot based on genetic algorithm in unknown environment , 2010, 2010 Chinese Control and Decision Conference.

[7]  S. Parasuraman,et al.  Dynamic path planning algorithm in mobile robot navigation , 2011, 2011 IEEE Symposium on Industrial Electronics and Applications.

[8]  V. Madhu Babu,et al.  An autonomous path finding robot using Q-learning , 2016, 2016 10th International Conference on Intelligent Systems and Control (ISCO).