Delay prediction in container terminals: A comparison of machine learning methods

One of the most important issues in Container Terminal (CT) management is to manage adequately late arrivals. In fact, despite contractual obligations to notify the Estimated Time of Arrival (ETA) 24 hours before arrival, often ship operators have to revise it due to unexpected events such as weather conditions, delay in previous port and so on. In a daily planning scenario, this causes a series of inconveniences directly associated with the resource allocation problem. Terminal operators need to increase the accuracy of incoming demand in order to allocate more efficiently human resources, equipment and spatial resources required to satisfy the predicted demand. For planners the decision-making processes related with demand uncertainty may sometimes be highly complex without the support of suitable methodological tools. Specific models should be adopted, in a daily planning scenario, to provide a useful support tool in CTs and to help mitigating the consequences of late arrivals. In this study, using data collected in a Mediterranean Transhipment Container Terminal, we illustrate a data mining approach for predicting the level of daily alarm related to late arrivals. First, we defined three levels of daily alarm ranking the delay of arrivals. Then we obtained an estimate of the alarm level using three different Machine Learning models (Naive Bayes, Decision Trees and Random Forests) and we compared their predictive power on a test data set.

[1]  Guy N. Brock,et al.  clValid , an R package for cluster validation , 2008 .

[2]  Mario Rodríguez-Molins,et al.  A decision support system for managing combinatorial problems in container terminals , 2012, Knowl. Based Syst..

[3]  Eréndira Rendón,et al.  Internal versus External cluster validation indexes , 2011 .

[4]  Katta G. Murty,et al.  A decision support system for operations in a container terminal , 2005, Decis. Support Syst..

[5]  Halima Bahi,et al.  Data mining approach , 2012 .

[6]  Shinichi Morishita,et al.  On Classification and Regression , 1998, Discovery Science.

[7]  Luca Maria Gambardella,et al.  Simulation and Planning of an Intermodal Container Terminal , 1998, Simul..

[8]  Claudia Pani,et al.  A data mining approach to forecast late arrivals in a transhipment container terminal , 2014 .

[9]  Luca Maria Gambardella,et al.  Simulation and Forecasting in an Intermodal Container Terminal , 1996 .

[10]  D. Hand,et al.  Idiot's Bayes—Not So Stupid After All? , 2001 .

[11]  Zheng Guansheng,et al.  A New Method to Alarm Large Scale of Flights Delay Based on Machine Learning , 2008, 2008 International Symposium on Knowledge Acquisition and Modeling.

[12]  J. A. Hartigan,et al.  A k-means clustering algorithm , 1979 .

[13]  Claudia Pani,et al.  Prediction of arrival times and human resources allocation for container terminal , 2011 .

[14]  Loo Hay Lee,et al.  A decision model for berth allocation under uncertainty , 2011, Eur. J. Oper. Res..

[15]  M. Fung,et al.  Forecasting Hong Kong's container throughput: an error-correction model , 2002 .

[16]  Leo Breiman,et al.  Classification and Regression Trees , 1984 .