A deep learning approach for multi-attribute data: A study of train delay prediction in railway systems

Abstract Dynamical systems that contain moving objects generate multi-attribute data, including static, time-series, and spatiotemporal formats. The diversity of the data formats creates challenges for the accurate modeling of these systems, for example, the state/location/trajectory prediction of moving objects. We developed a deep learning (DL) approach that combines 3-dimensional convolutional neural networks (3D CNN), long short-term memory (LSTM) recurrent neural network, and fully-connected neural network (FCNN) architectures to address this problem. The proposed model, named CLF-Net, uses individual factors with different attributes as input to achieve better predictions. The spatiotemporal features are fed into the 3D CNN, the time-series variables are fed into the LSTM, and the non-time-series factors are fed into the FCNN, respectively. A case study of train delay prediction for four railway lines with different operational features shows that the CLF-Net outperforms conventional machine learning models and the state-of-the-art DL models with regard to the performance metrics of the root mean squared error and mean absolute error.

[1]  Jürgen Schmidhuber,et al.  Long Short-Term Memory , 1997, Neural Computation.

[2]  Bernhard Schölkopf,et al.  A tutorial on support vector regression , 2004, Stat. Comput..

[3]  A. Govardhan,et al.  SPATIOTEMPORAL DATA MINING : ISSUES , TASKS AND APPLICATIONS , 2012 .

[4]  Tao Lu,et al.  Multi-Memory Convolutional Neural Network for Video Super-Resolution , 2019, IEEE Transactions on Image Processing.

[5]  Xindong Wu,et al.  10 Challenging Problems in Data Mining Research , 2006, Int. J. Inf. Technol. Decis. Mak..

[6]  Shian-Shyong Tseng,et al.  Discovering Traffic Bottlenecks in an Urban Network by Spatiotemporal Data Mining on Location-Based Services , 2011, IEEE Transactions on Intelligent Transportation Systems.

[7]  Ming Yang,et al.  3D Convolutional Neural Networks for Human Action Recognition , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[8]  Xiuwen Yi,et al.  DNN-based prediction model for spatio-temporal data , 2016, SIGSPATIAL/GIS.

[9]  Tara N. Sainath,et al.  Deep Neural Networks for Acoustic Modeling in Speech Recognition , 2012 .

[10]  Stephen Grossberg,et al.  Recurrent neural networks , 2013, Scholarpedia.

[11]  Andy Liaw,et al.  Classification and Regression by randomForest , 2007 .

[12]  Taghi M. Khoshgoftaar,et al.  Deep learning applications and challenges in big data analytics , 2015, Journal of Big Data.

[13]  Donggen Wang,et al.  A spatio-temporal data model for activity-based transport demand modelling , 2001, Int. J. Geogr. Inf. Sci..

[14]  Lei Shu,et al.  Reality mining: A prediction algorithm for disease dynamics based on mobile big data , 2017, Inf. Sci..

[15]  Yunpeng Wang,et al.  Spatiotemporal Recurrent Convolutional Networks for Traffic Prediction in Transportation Networks , 2017, Sensors.

[16]  P. S. Sastry,et al.  A survey of temporal data mining , 2006 .

[17]  Xindong Wu,et al.  Data mining with big data , 2014, IEEE Transactions on Knowledge and Data Engineering.

[18]  Tak-Chung Fu,et al.  A review on time series data mining , 2011, Eng. Appl. Artif. Intell..

[19]  Léon Bottou,et al.  Large-Scale Machine Learning with Stochastic Gradient Descent , 2010, COMPSTAT.

[20]  Silvio Savarese,et al.  Structural-RNN: Deep Learning on Spatio-Temporal Graphs , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[21]  Yoram Singer,et al.  Adaptive Subgradient Methods for Online Learning and Stochastic Optimization , 2011, J. Mach. Learn. Res..

[22]  Ayoub Ait Lahcen,et al.  An overview of big data opportunities, applications and tools , 2015, 2015 Intelligent Systems and Computer Vision (ISCV).

[23]  Witold Pedrycz,et al.  Clustering Spatiotemporal Data: An Augmented Fuzzy C-Means , 2013, IEEE Transactions on Fuzzy Systems.

[24]  Daniel Svozil,et al.  Introduction to multi-layer feed-forward neural networks , 1997 .

[25]  Fei-Fei Li,et al.  Large-Scale Video Classification with Convolutional Neural Networks , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[26]  James Bailey,et al.  An effective and versatile distance measure for spatiotemporal trajectories , 2019, Data Mining and Knowledge Discovery.

[27]  Ying Wah Teh,et al.  Time-series clustering - A decade review , 2015, Inf. Syst..

[28]  Kirsi Virrantaus,et al.  Space–time density of trajectories: exploring spatio-temporal patterns in movement data , 2010, Int. J. Geogr. Inf. Sci..

[29]  Yu Zhang,et al.  Very deep convolutional networks for end-to-end speech recognition , 2016, 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[30]  Geoffrey E. Hinton,et al.  Learning representations by back-propagating errors , 1986, Nature.

[31]  Sheldon M. Ross Introduction to Probability Models. , 1995 .

[32]  Ranga Raju Vatsavai,et al.  Spatiotemporal data mining in the era of big spatial data: algorithms and applications , 2012, BigSpatial '12.

[33]  Lorenzo Torresani,et al.  Learning Spatiotemporal Features with 3D Convolutional Networks , 2014, 2015 IEEE International Conference on Computer Vision (ICCV).

[34]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[35]  Yoshua Bengio,et al.  Gradient-based learning applied to document recognition , 1998, Proc. IEEE.

[36]  Dit-Yan Yeung,et al.  Convolutional LSTM Network: A Machine Learning Approach for Precipitation Nowcasting , 2015, NIPS.

[37]  Xi Wang,et al.  Modeling Spatial-Temporal Clues in a Hybrid Deep Learning Framework for Video Classification , 2015, ACM Multimedia.

[38]  Geeta Sikka,et al.  Recent Techniques of Clustering of Time Series Data: A Survey , 2012 .

[39]  Jonathan R. Stroud,et al.  Dynamic models for spatiotemporal data , 2001 .

[40]  Patricia Scully,et al.  Deep Neural Networks for Learning Spatio-Temporal Features From Tomography Sensors , 2018, IEEE Transactions on Industrial Electronics.

[41]  Gwilym M. Jenkins,et al.  Time series analysis, forecasting and control , 1971 .