A method integrating simulation and reinforcement learning for operation scheduling in container terminals

The objective of operation scheduling in container terminals is to determine a schedule that minimizes time for loading or unloading a given set of containers. This paper presents a method integrating reinforcement learning and simulation to optimize operation scheduling in container terminals. The introduced method uses a simulation model to construct the system environment while the Q-learning algorithm (reinforcement learning algorithm) is applied to learn optimal dispatching rules for different equipment (e.g. yard cranes, yard trailers). The optimal scheduling scheme is obtained by the interaction of the Q-learning algorithm and simulation environment. To evaluate the effectiveness of the proposed method, a lower bound is calculated considering the characteristics of the scheduling problem in container terminals. Finally, numerical experiments are provided to illustrate the validity of the proposed method.

[1]  Henry Y. K. Lau,et al.  Integrated scheduling of handling equipment at automated container terminals , 2008, Ann. Oper. Res..

[2]  Qingcheng Zeng,et al.  Integrating simulation and optimization to schedule loading operations in container terminals , 2009, Comput. Oper. Res..

[3]  Anne Goodchild,et al.  Crane double cycling in container ports: Planning methods and evaluation , 2007 .

[4]  Qingcheng Zeng,et al.  Models and algorithms for multi-crane oriented scheduling method in container terminals , 2009 .

[5]  Hark Hwang,et al.  Sequencing delivery and receiving operations for yard cranes in port container terminals , 2003 .

[6]  A. A. Shabayek,et al.  A simulation model for the Kwai Chung container terminals in Hong Kong , 2002, Eur. J. Oper. Res..

[7]  Richard J. Linn,et al.  Rubber tired gantry crane deployment for container yard operation , 2003, Comput. Ind. Eng..

[8]  Carlos F. Daganzo,et al.  THE CRANE SCHEDULING PROBLEM , 1989 .

[9]  Kap Hwan Kim,et al.  A crane scheduling method for port container terminals , 2004, Eur. J. Oper. Res..

[10]  Maurizio Bielli,et al.  Object oriented model for container terminal distributed simulation , 2006, Eur. J. Oper. Res..

[11]  Abdelhakim Artiba,et al.  Integrating simulation and optimization to schedule a hybrid flow shop with maintenance constraints , 2004, Comput. Ind. Eng..

[12]  Lifeng Xi,et al.  A tabu search algorithm for the integrated scheduling problem of container handling systems in a maritime terminal , 2007, Eur. J. Oper. Res..

[13]  Won Young Yun,et al.  A simulation model for container-terminal operation analysis using an object-oriented approach , 1999 .

[14]  Zhihong Jin,et al.  Metaheuristic algorithms for the multistage hybrid flowshop scheduling problem , 2006 .

[15]  W. C. Ng,et al.  Crane scheduling in container yards with inter-crane interference , 2005, Eur. J. Oper. Res..

[16]  Erhan Kozan,et al.  Genetic algorithms to schedule container transfers at multimodal terminals , 1999 .

[17]  Kap Hwan Kim,et al.  A note on a dynamic space-allocation method for outbound containers , 2003, Eur. J. Oper. Res..

[18]  Richard J. Linn,et al.  Dynamic crane deployment in container storage yards , 2002 .

[19]  Peter Dayan,et al.  Q-learning , 1992, Machine Learning.

[20]  Mehmet Emin Aydin,et al.  Dynamic job-shop scheduling using reinforcement learning agents , 2000, Robotics Auton. Syst..

[21]  Petros A. Ioannou,et al.  A comparison of different AGV dispatching rules in an automated container terminal , 2002, Proceedings. The IEEE 5th International Conference on Intelligent Transportation Systems.

[22]  Richard J. Linn,et al.  Storage space allocation in container terminals , 2003 .

[23]  Akio Imai,et al.  Yard trailer routing at a maritime container terminal , 2005 .

[24]  Lixin Miao,et al.  Quay crane scheduling with non-interference constraints in port container terminals , 2008 .

[25]  Martin W. P. Savelsbergh,et al.  Minimum Vehicle Fleet Size Under Time-Window Constraints at a Container Terminal , 2005, Transp. Sci..

[26]  Ebru K. Bish,et al.  A multiple-crane-constrained scheduling problem in a container terminal , 2003, Eur. J. Oper. Res..

[27]  Qiang Meng,et al.  Scheduling of two-transtainer systems for loading outbound containers in port container terminals with simulated annealing algorithm , 2007 .

[28]  D. Santos,et al.  Global lower bounds for flow shops with multiple processors , 1995 .

[29]  Yi-Chi Wang,et al.  Learning policies for single machine job dispatching , 2004 .