Forecasting primary delay recovery of high-speed railway using multiple linear regression, supporting vector machine, artificial neural network, and random forest regression

Accurate prediction of recoverable train delay can support the train dispatchers’ decision-making with timetable rescheduling and improving service reliability. In this paper, we present the results of an effort aimed to develop primary delay recovery (PDR) predictor model using train operation records from Wuhan-Guangzhou (W-G) High-speed Railway (HSR). To this end, we first identified the main variables that contribute to delay, including dwell buffer time (DBT), running buffer time (RBT), magnitude of primary delay (PDT), and individual sections’ influence. Different models are applied and calibrated to predict the PDR. The validation results on test datasets indicate that the Random Forest Regression (RFR) model outperforms other three alternative models, namely, Multiple Linear Regression (MLR), Support Vector Machine (SVM), and Artificial Neural Networks (ANN) regarding prediction accuracy measure. Specifically, the evaluation results show that when the prediction tolerance is less than 1 minute, th...

[1]  Paolo Toth,et al.  Nominal and robust train timetabling problems , 2012, Eur. J. Oper. Res..

[2]  Rob M.P. Goverde,et al.  Propagation of train delays in stations , 2002 .

[3]  Gabriele Malavasi,et al.  Simulation of stochastic elements in railway systems using self-learning processes , 2001, Eur. J. Oper. Res..

[4]  Harshad Khadilkar,et al.  Data-Enabled Stochastic Modeling for Evaluating Schedule Robustness of Railway Networks , 2017, Transp. Sci..

[5]  Ingo A. Hansen,et al.  Online train delay recognition and running time prediction , 2010, 13th International IEEE Conference on Intelligent Transportation Systems.

[6]  Rob M.P. Goverde,et al.  Estimation of train dwell time at short stops based on track occupation event data , 2015 .

[7]  Luis Cadarso,et al.  Recovery of disruptions in rapid transit networks , 2013 .

[8]  Masoud Yaghini,et al.  Railway Passenger Train Delay Prediction via Neural Network Model , 2013 .

[9]  Guang-Bin Huang,et al.  Extreme learning machine: a new learning scheme of feedforward neural networks , 2004, 2004 IEEE International Joint Conference on Neural Networks (IEEE Cat. No.04CH37541).

[10]  Lee Chapman,et al.  The impacts of the 28 June 2012 storms on UK road and rail transport , 2015 .

[11]  Naoto Fukumura,et al.  Real-time freight locomotive rescheduling and uncovered train detection during disruption , 2012, Eur. J. Oper. Res..

[12]  Jan Peters,et al.  Prediction of Delays in Public Transportation using Neural Networks , 2005, International Conference on Computational Intelligence for Modelling, Control and Automation and International Conference on Intelligent Agents, Web Technologies and Internet Commerce (CIMCA-IAWTIC'06).

[13]  Francesco Corman,et al.  Assessment of Advanced Dispatching Measures for Recovering Disrupted Railway Traffic Situations , 2012 .

[14]  Kpotissan Adjetey-Bahun,et al.  A model to quantify the resilience of mass railway transportation systems , 2016, Reliab. Eng. Syst. Saf..

[15]  Leo G. Kroon,et al.  Reliability and Heterogeneity of Railway Services , 2006, Eur. J. Oper. Res..

[16]  Leena Suhl,et al.  A note on the online nature of the railway delay management problem , 2011, Networks.

[17]  Francesco Corman,et al.  A Review of Online Dynamic Models and Algorithms for Railway Traffic Management , 2015, IEEE Transactions on Intelligent Transportation Systems.

[18]  Pavle Kecman,et al.  Predictive modelling of running and dwell times in railway traffic , 2015, Public Transp..

[19]  Chao Wen,et al.  Predicting high-speed train operation conflicts using workflow nets and triangular fuzzy numbers , 2015 .

[20]  Yong Cui,et al.  Calibration of disturbance parameters in railway operational simulation based on reinforcement learning , 2016, J. Rail Transp. Plan. Manag..

[21]  Ulrich Weidmann,et al.  Development of a Dwell Time Calculation Model for Timetable Planning , 2008 .

[22]  Alessandro Farina,et al.  A new urban freight distribution scheme and an optimization methodology for reducing its overall cost , 2015 .

[23]  Léon Bottou,et al.  Large-Scale Machine Learning with Stochastic Gradient Descent , 2010, COMPSTAT.

[24]  O. Lindfeldt Evaluation Of Punctuality On A Heavily UtilisedRailway Line With Mixed Traffic , 2010 .

[25]  Kariyazaki Keiji,et al.  Simulation analysis of train operation to recover knock-on delay under high-frequency intervals , 2015 .

[26]  Jouni Wallander,et al.  Data mining in rail transport delay chain analysis , 2012 .

[27]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[28]  I. A. Hansen,et al.  Evaluating Stochastic Train Process TimeDistribution Models On The Basis Of EmpiricalDetection Data , 2006 .

[29]  Dario Pacciarelli,et al.  A branch and bound algorithm for scheduling trains in a railway network , 2007, Eur. J. Oper. Res..

[30]  Paul Schonfeld,et al.  Analyzing passenger train arrival delays with support vector regression , 2015 .

[31]  Davy Janssens,et al.  Within day rescheduling microsimulation combined with macrosimulated traffic , 2014 .

[32]  E. Bosscha Big data in railway operations: using artificial neural networks to predict train delay propagation , 2016 .

[33]  Asunción P. Cucala,et al.  An integrated information model for traffic planning, operation and management of railway lines , 2004 .

[34]  Denis Borenstein,et al.  Review of real-time vehicle schedule recovery methods in transportation services , 2014, J. Sched..

[35]  Jyh-Cherng Jong,et al.  The Analysis of Train Reliability for the Taiwan High Speed Rail , 2010 .

[36]  Christian S. Mjøsund,et al.  Improving railway performance in Norway , 2013, J. Rail Transp. Plan. Manag..

[37]  Chao Wen,et al.  High-Speed Railway Train Timetable Conflict Prediction Based on Fuzzy Temporal Knowledge Reasoning , 2016 .

[38]  Liping Fu,et al.  Statistical investigation on train primary delay based on real records: evidence from Wuhan–Guangzhou HSR , 2017 .

[39]  Dario Pacciarelli,et al.  A tabu search algorithm for rerouting trains during rail operations , 2007 .