A reinforcement learning based approach for a multiple-load carrier scheduling problem

This paper studies the problem of scheduling a multiple-load carrier which is used to deliver parts to line-side buffers of a general assembly (GA) line. In order to maximize the reward of the GA line, both the throughput of the GA line and the material handling distance are considered as scheduling criteria. After formulating the scheduling problem as a reinforcement learning (RL) problem by defining state features, actions and the reward function, we develop a Q($$\lambda $$λ) RL algorithm based scheduling approach. To improve performance, forecasted information such as quantities of parts required in a look-ahead horizon is used when we define state features and actions in formulation. Other than applying traditional material handling request generating policy, we use a look-ahead based request generating policy with which material handling requests are generated based not only on current buffer information but also on future part requirement information. Moreover, by utilizing a heuristic dispatching algorithm, the approach is able to handle future requests as well as existing ones. To evaluate the performance of the approach, we conduct simulation experiments to compare the proposed approach with other approaches. Numerical results demonstrate that the policies obtained by the RL approach outperform other approaches.

[1]  Yuehwern Yih,et al.  A multiple-attribute method for concurrently solving the pickup-dispatching problem and the load-selection problem of multiple-load AGVs , 2012 .

[2]  L. N. Van Wassenhove,et al.  Analysis of Scheduling Rules for an FMS , 1990 .

[3]  Yugang Yu,et al.  Performance evaluation of dynamic scheduling approaches in vehicle-based internal transport systems , 2010 .

[4]  M. Neuts A General Class of Bulk Queues with Poisson Input , 1967 .

[5]  Subhash C. Sarin,et al.  A survey of dispatching rules for operational control in wafer fabrication , 2011 .

[6]  Kenn Steger-Jensen,et al.  Scheduling a single mobile robot for part-feeding tasks of production lines , 2014, J. Intell. Manuf..

[7]  Lifeng Xi,et al.  A multiple-criteria real-time scheduling approach for multiple-load carriers subject to LIFO loading constraints , 2011 .

[8]  Reza Tavakkoli-Moghaddam,et al.  Vehicle routing scheduling using an enhanced hybrid optimization approach , 2010, Journal of Intelligent Manufacturing.

[9]  Hans-Otto Günther,et al.  Dispatching multi-load AGVs in highly automated seaport container terminals , 2004, OR Spectr..

[10]  R. C. Leachman,et al.  An improved methodology for real-time production decisions at batch-process work stations , 1993 .

[11]  Kap Hwan Kim,et al.  Routing automated guided vehicles in container terminals through the Q-learning technique , 2011, Logist. Res..

[12]  René de Koster,et al.  Testing and classifying vehicle dispatching rules in three real-world settings , 2004 .

[13]  Don T. Phillips,et al.  Control of Multiproduct Bulk Service Diffusion/Oxidation Processes , 1992 .

[14]  René M. B. M. de Koster,et al.  A review of design and control of automated guided vehicle systems , 2006, Eur. J. Oper. Res..

[15]  Mufit Ozden,et al.  A simulation study of multiple-load-carrying automated guided vehicles in a flexible manufacturing system , 1988 .

[16]  Iris F. A. Vis,et al.  Survey of research in the design and control of automated guided vehicle systems , 2006, Eur. J. Oper. Res..

[17]  Martin A. Riedmiller,et al.  Distributed policy search reinforcement learning for job-shop scheduling tasks , 2012 .

[18]  Na Li,et al.  Semiconductor final test scheduling with Sarsa(λ, k) algorithm , 2011, Eur. J. Oper. Res..

[19]  Tao Geng,et al.  Multiagent AGVs dispatching system using multilevel decisions method , 2002, Proceedings of the 2002 American Control Conference (IEEE Cat. No.CH37301).

[20]  Weiping Wang,et al.  Minimizing mean weighted tardiness in unrelated parallel machine scheduling with reinforcement learning , 2012, Comput. Oper. Res..

[21]  Chia-Nan Wang,et al.  The heuristic preemptive dispatching method of material transportation system in 300 mm semiconductor fabrication , 2012, J. Intell. Manuf..

[22]  Lionel Amodeo,et al.  Particle swarm optimization algorithm for a vehicle routing problem with heterogeneous fleet, mixed backhauls, and time windows , 2012, Journal of Intelligent Manufacturing.

[23]  Yael Edan,et al.  Evaluation of automatic guided vehicle systems , 2009 .

[24]  Jean-Yves Potvin,et al.  Neural networks for automated vehicle dispatching , 1992, Comput. Oper. Res..

[25]  Frank L. Lewis,et al.  Multi-commodity flow dynamic resource assignment and matrix-based job dispatching for multi-relay transfer in complex material handling systems (MHS) , 2014, J. Intell. Manuf..

[26]  Heloisa A. Camargo,et al.  A Genetic Fuzzy System for Defining a Reactive Dispatching Rule for AGVs , 2006, 2006 IEEE International Conference on Systems, Man and Cybernetics.

[27]  Jing Peng,et al.  Incremental multi-step Q-learning , 1994, Machine Learning.

[28]  Yi-Chi Wang,et al.  Application of reinforcement learning for agent-based production scheduling , 2005, Eng. Appl. Artif. Intell..

[29]  Luis G. Occeña,et al.  Analysis of the AGV loading capacity in a JIT environment , 1993 .

[30]  Yuehwern Yih,et al.  Selection of dispatching rules on multiple dispatching decision points in real-time scheduling of a semiconductor wafer fabrication system , 2003 .

[31]  Richard O. Duda,et al.  Pattern classification and scene analysis , 1974, A Wiley-Interscience publication.

[32]  David Sinriech,et al.  A dynamic scheduling algorithm for a multiple-load multiple-carrier system , 2002 .

[33]  Lifeng Xi,et al.  A support vector machine based scheduling approach for a material handling system , 2010, 2010 Sixth International Conference on Natural Computation.

[34]  Hark Hwang,et al.  A DISPATCHING ALGORITHM FOR MULTIPLE-LOAD AGVS USING A FUZZY DECISION-MAKING METHOD IN A JOB SHOP ENVIRONMENT , 2001 .

[35]  Suresh K. Khator,et al.  Operational control of multi-load vehicles in an automated guided vehicle system , 1993 .