Automated Guided Logistics Handling Vehicle Path Routing under Multi-Task Scenarios

In recent years, the application of automated guided vehicles (AGVs) in industrial manufacture has attracted more and more attention. Based on reinforcement learning, an automated guided logistics handling vehicles routing algorithm was proposed under a multi-task scenario. First, the algorithm calculated routing paths for each task node using Q-learning to solve the “curse of dimensionality” problem. Then, with the objective of comprehensive cost considering the time window, the optimal routing sequence was obtained using the depth first search algorithm (DFS). Finally, an example is given to illustrate the algorithm.

[1]  Panos M. Pardalos,et al.  Approximate dynamic programming: solving the curses of dimensionality , 2009, Optim. Methods Softw..

[2]  Peter Dayan,et al.  Q-learning , 1992, Machine Learning.

[3]  Tao Zheng,et al.  AGV Path Planning based on Improved A-star Algorithm , 2019, 2019 IEEE 3rd Advanced Information Management, Communicates, Electronic and Automation Control Conference (IMCEC).

[4]  Hu Bin Dynamic Routing of Automated Guided Vehicles Based on Time Window , 2012 .

[5]  Tomás Lozano-Pérez,et al.  Spatial Planning: A Configuration Space Approach , 1983, IEEE Transactions on Computers.

[6]  Qiang Yang,et al.  On the design for AGVs: Modeling, path planning and localization , 2011, 2011 IEEE International Conference on Mechatronics and Automation.