An Interpretable Station Delay Prediction Model Based on Graph Community Neural Network and Time-Series Fuzzy Decision Tree

High-speed train delay prediction has always been one of the important research issues in the railway dispatching. Accurate and interpretable delay prediction can enable staff to implement preventive measures and scheduling decisions in advance, and guide relevant departments to cooperate in completing complex transportation tasks, so as to improve rail transit operations, service quality, and the efficiency of train operation. This article proposes a new interpretable model based on graph community neural network and time-series fuzzy decision tree. This model can well capture the influence of spatiotemporal characteristics, train community structure, and multifactor in high-speed train station delay prediction. Besides, the time series fuzzy decision tree based on multiobjective optimization and reduced error pruning can mine potential decision rules to improve the model's interpretability, transparency, and high reliability. Finally, we prove that the prediction effect of the proposed model is superior than the other seven state-of-the-art models and our model is interpretable.

[1]  F. Corman,et al.  A review of train delay prediction approaches , 2022, J. Rail Transp. Plan. Manag..

[2]  Nan Wang,et al.  A high-speed railway network dataset from train operation records and weather data , 2022, Scientific Data.

[3]  Qingguo Zhou,et al.  A Hybrid LSTM-CPS Approach for Long-Term Prediction of Train Delays in Multivariate Time Series , 2021, Future Transportation.

[4]  Dalin Zhang,et al.  Train Time Delay Prediction for High-Speed Train Dispatching Based on Spatio-Temporal Graph Convolutional Network , 2021, IEEE Transactions on Intelligent Transportation Systems.

[5]  G. Sajan,et al.  Forecasting and Analysis of Train Delays and Impact of Weather Data using Machine Learning , 2021, 2021 12th International Conference on Computing Communication and Networking Technologies (ICCCNT).

[6]  Quan Z. Sheng,et al.  A Comprehensive Survey on Community Detection With Deep Learning , 2021, IEEE Transactions on Neural Networks and Learning Systems.

[7]  Peter Tiňo,et al.  A Survey on Neural Network Interpretability , 2020, IEEE Transactions on Emerging Topics in Computational Intelligence.

[8]  Liping Fu,et al.  Modeling train operation as sequences: A study of delay prediction with operation and weather data , 2020 .

[9]  Udo Buscher,et al.  Railway crew scheduling: Models, methods and applications , 2020, Eur. J. Oper. Res..

[10]  Chao Wen,et al.  A deep learning approach for multi-attribute data: A study of train delay prediction in railway systems , 2020, Inf. Sci..

[11]  Qiang Huang,et al.  GraphLIME: Local Interpretable Model Explanations for Graph Neural Networks , 2020, IEEE Transactions on Knowledge and Data Engineering.

[12]  Prasanna Balaprakash,et al.  Graph-Partitioning-Based Diffusion Convolutional Recurrent Neural Network for Large-Scale Traffic Forecasting , 2019, Transportation Research Record: Journal of the Transportation Research Board.

[13]  Muqeem Ahmed,et al.  Train Delay Estimation in Indian Railways by Including Weather Factors Through Machine Learning Techniques , 2019, Recent Advances in Computer Science and Communications.

[14]  Samy Bengio,et al.  Cluster-GCN: An Efficient Algorithm for Training Deep and Large Graph Convolutional Networks , 2019, KDD.

[15]  Hong Zhou,et al.  Artificial Neural Network , 2020, Encyclopedia of GIS.

[16]  Renjie Liao,et al.  Graph Partition Neural Networks for Semi-Supervised Classification , 2018, ICLR.

[17]  Quanshi Zhang,et al.  Interpretable Convolutional Neural Networks , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[18]  Zhanxing Zhu,et al.  Spatio-temporal Graph Convolutional Neural Network: A Deep Learning Framework for Traffic Forecasting , 2017, IJCAI.

[19]  Xavier Bresson,et al.  Convolutional Neural Networks on Graphs with Fast Localized Spectral Filtering , 2016, NIPS.

[20]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[21]  Eyke Hüllermeier,et al.  Pattern trees for regression and fuzzy systems modeling , 2010, International Conference on Fuzzy Systems.

[22]  Jean-Loup Guillaume,et al.  Fast unfolding of communities in large networks , 2008, 0803.0476.

[23]  James M. Keller,et al.  A possibilistic fuzzy c-means clustering algorithm , 2005, IEEE Transactions on Fuzzy Systems.

[24]  Olcay Boz,et al.  Extracting decision trees from trained neural networks , 2002, KDD.

[25]  M E J Newman,et al.  Community structure in social and biological networks , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[26]  L. Breiman Random Forests , 2001, Encyclopedia of Machine Learning and Data Mining.

[27]  H. Tsukimoto,et al.  Rule extraction from neural networks via decision tree induction , 2001, IJCNN'01. International Joint Conference on Neural Networks. Proceedings (Cat. No.01CH37222).

[28]  Tapio Elomaa,et al.  An Analysis of Reduced Error Pruning , 2001, J. Artif. Intell. Res..

[29]  Michael I. Jordan,et al.  On Spectral Clustering: Analysis and an algorithm , 2001, NIPS.

[30]  S. Hochreiter,et al.  Long Short-Term Memory , 1997, Neural Computation.

[31]  I. Hatono,et al.  Fuzzy decision trees by fuzzy ID3 algorithm and its application to diagnosis systems , 1994, Proceedings of 1994 IEEE 3rd International Fuzzy Systems Conference.

[32]  Ronny Klæboe,et al.  Extreme weather impacts on freight railways in Europe , 2013, Natural Hazards.

[33]  D. Basak,et al.  Support Vector Regression , 2008 .