Ship arrival prediction and its value on daily container terminal operation

Abstract Effective prediction of ship arrivals should provide the estimated delay or advance of arrival ships with greater accuracy, and improve the performance of container terminal operations. Therefore, taking Gangji (Yining) Container Terminal (GYCT), China, as an example, this paper resorts to data mining approaches to predict ship arrivals and explore the value of such predicted ship arrivals on the container terminal operation. First, this study applies three data mining approaches, including Back-Propagation network (BP), Classification and Regression Tree (CART) and Random Forest (RF), to estimate the delay or advance of ship arrivals using the collected data of ship arrivals. Then the predictive performance of these three approaches is compared and discussed, it is concluded that RF performs better than BP and CART, and ETA month and ship length are the most important determinants of ship arrivals in GYCT terminal. Finally, series simulation experiments are conducted to assess the value of the ship arrivals predicted by RF model on the improvement on daily operation planning of berth allocation and quay crane assignment in the GYCT terminal. And the results show that incorporating the predicted ship arrivals by RF model is beneficial to improve the performance of operation planning of GYCT terminal.

[1]  Michele Vespe,et al.  Vessel Pattern Knowledge Discovery from AIS Data: A Framework for Anomaly Detection and Route Prediction , 2013, Entropy.

[2]  Huicheng Zhou,et al.  Value of Medium-range Precipitation Forecasts in Inflow Prediction and Hydropower Optimization , 2010 .

[3]  A. A. Shabayek,et al.  A simulation model for the Kwai Chung container terminals in Hong Kong , 2002, Eur. J. Oper. Res..

[4]  Vadlamani Ravi,et al.  Soft computing system for bank performance prediction , 2008, Appl. Soft Comput..

[5]  Ebrahim Shayan,et al.  Investigation of port capacity under a new approach by computer simulation , 2002 .

[6]  Roman M. Balabin,et al.  Variable selection in near-infrared spectroscopy: benchmarking of feature selection methods on biodiesel data. , 2011, Analytica chimica acta.

[7]  Piotr Duda,et al.  Decision Trees for Mining Data Streams Based on the McDiarmid's Bound , 2013, IEEE Transactions on Knowledge and Data Engineering.

[8]  Mario Chica-Olmo,et al.  An assessment of the effectiveness of a random forest classifier for land-cover classification , 2012 .

[9]  Claudia Pani,et al.  Prediction of late/early arrivals in container terminals – A qualitative approach , 2015, European Journal of Transport and Infrastructure Research.

[10]  A. Prasad,et al.  Newer Classification and Regression Tree Techniques: Bagging and Random Forests for Ecological Prediction , 2006, Ecosystems.

[11]  Geoffrey E. Hinton,et al.  Learning representations by back-propagating errors , 1986, Nature.

[12]  Tin Wee Tan,et al.  Predicting host tropism of influenza A virus proteins using random forest , 2014, BMC Medical Genomics.

[13]  Wei-Yin Loh,et al.  Classification and regression trees , 2011, WIREs Data Mining Knowl. Discov..

[14]  Margaret G. Schmidt,et al.  Predictive soil parent material mapping at a regional-scale: a Random Forest approach. , 2014 .

[15]  Wenyuan Wang,et al.  Effect of Entrance Channel Dimensions on Berth Occupancy of Container Terminals , 2016 .

[16]  Carlos F. Daganzo,et al.  CRANE PRODUCTIVITY AND SHIP DELAY IN PORTS , 1989 .

[17]  Xiongwen Quan,et al.  Robust berth scheduling with uncertain vessel delay and handling time , 2012, Ann. Oper. Res..

[18]  H. Troy Nagle,et al.  Performance of the Levenberg–Marquardt neural network training method in electronic nose applications , 2005 .

[19]  Ying-Chan Tang,et al.  Detecting hospital fraud and claim abuse through diabetic outpatient services , 2008, Health care management science.

[20]  Jacob Cohen A Coefficient of Agreement for Nominal Scales , 1960 .

[21]  P.S. Sathidevi,et al.  Auditory-Based Wavelet Packet Filterbank for Speech Recognition Using Neural Network , 2007, 15th International Conference on Advanced Computing and Communications (ADCOM 2007).

[22]  Sönke Hartmann,et al.  Simulation of container ship arrivals and quay occupation , 2011 .

[23]  D. R. Cutler,et al.  Utah State University From the SelectedWorks of , 2017 .

[24]  Jorge Puente,et al.  A genetic algorithm for robust berth allocation and quay crane assignment , 2014, Progress in Artificial Intelligence.

[25]  Ling-bo Wang,et al.  Automatic authentication and distinction of Epimedium koreanum and Epimedium wushanense with HPLC fingerprint analysis assisted by pattern recognition techniques , 2012 .

[26]  Bo-Suk Yang,et al.  Multi-step ahead direct prediction for machine condition prognosis using regression trees and neuro-fuzzy systems , 2013 .

[27]  Erhan Kozan,et al.  COMPARISON OF ANALYTICAL AND SIMULATION PLANNING MODELS OF SEAPORT CONTAINER TERMINALS , 1997 .

[28]  Vadlamani Ravi,et al.  Detecting phishing e-mails using text and data mining , 2012, 2012 IEEE International Conference on Computational Intelligence and Computing Research.

[29]  Meng Joo Er,et al.  A fast approach for automatic generation of fuzzy rules by generalized dynamic fuzzy neural networks , 2001, IEEE Trans. Fuzzy Syst..

[30]  Wenyuan Wang,et al.  Simulation-based optimization for generating the dimensions of a dredged coastal entrance channel , 2014, Simul..

[31]  Claudia Pani,et al.  A data mining approach to forecast late arrivals in a transhipment container terminal , 2014 .

[32]  Pieter H. A. J. M. van Gelder,et al.  Risk- and Simulation-Based Optimization of Channel Depths: Entrance Channel of Cam Pha Coal Port , 2008, Simul..

[33]  Á. Felicísimo,et al.  Mapping landslide susceptibility with logistic regression, multiple adaptive regression splines, classification and regression trees, and maximum entropy methods: a comparative study , 2013, Landslides.

[34]  J. R. Landis,et al.  The measurement of observer agreement for categorical data. , 1977, Biometrics.

[35]  Gary D. Stormo,et al.  Program in Gene Function and Expression Publications and Presentations Program in Gene Function and Expression 4-2014 An improved predictive recognition model for Cys 2-His 2 zinc finger proteins , 2014 .

[36]  Asifullah Khan,et al.  Intelligent churn prediction in telecom: employing mRMR feature selection and RotBoost based ensemble classification , 2013, Applied Intelligence.

[37]  Claudia Pani,et al.  Prediction of arrival times and human resources allocation for container terminal , 2011 .

[38]  Emrullah Demirci Simulation Modelling and Analysis of a Port Investment , 2003, Simul..

[39]  Omaima N. A. AL-Allaf,et al.  Improving the Performance of Backpropagation Neural Network Algorithm for Image Compression/Decompression System , 2010 .

[40]  Andrew Zisserman,et al.  Automatic and Efficient Human Pose Estimation for Sign Language Videos , 2013, International Journal of Computer Vision.

[41]  Claudia Pani,et al.  Delay prediction in container terminals: A comparison of machine learning methods , 2013 .

[42]  Der-Horng Lee,et al.  Integrated discrete berth allocation and quay crane scheduling in port container terminals , 2010 .

[43]  Chung-Piaw Teo,et al.  Berth management in container terminal: the template design problem , 2006, OR Spectr..

[44]  Agnieszka Malinowska Classification and regression tree theory application for assessment of building damage caused by surface deformation , 2014, Natural Hazards.

[45]  Wenyuan Wang,et al.  Study on the Ship Arrival Pattern of Container Terminals , 2013 .

[46]  Vural Aksakalli,et al.  Risk assessment in social lending via random forests , 2015, Expert Syst. Appl..

[47]  G. De’ath,et al.  CLASSIFICATION AND REGRESSION TREES: A POWERFUL YET SIMPLE TECHNIQUE FOR ECOLOGICAL DATA ANALYSIS , 2000 .

[48]  Shyam Visweswaran,et al.  Improving Classification Performance with Discretization on Biomedical Datasets , 2008, AMIA.