Forecasting Transportation Network Speed Using Deep Capsule Networks With Nested LSTM Models

Accurate and reliable traffic forecasting for complicated transportation networks is of vital importance to modern transportation management. The complicated spatial dependencies of roadway links and the dynamic temporal patterns of traffic states make it particularly challenging. To address these challenges, we propose a new capsule network (CapsNet) to extract the spatial features of traffic networks and utilize a nested LSTM (NLSTM) structure to capture the hierarchical temporal dependencies in traffic sequence data. A framework for network-level traffic forecasting is also proposed by sequentially connecting CapsNet and NLSTM. On the basis of literature review, our study is the first to adopt CapsNet and NLSTM in the field of traffic forecasting. An experiment on a Beijing transportation network with 278 links shows that the proposed framework with the capability of capturing complicated spatiotemporal traffic patterns outperforms multiple state-of-the-art traffic forecasting baseline models. The superiority and feasibility of CapsNet and NLSTM are also demonstrated, respectively, by visualizing and quantitatively evaluating the experimental results.

[1]  Billy M. Williams,et al.  Urban Freeway Traffic Flow Prediction: Application of Seasonal Autoregressive Integrated Moving Average and Exponential Smoothing Models , 1998 .

[2]  Hashem R Al-Masaeid,et al.  Short-Term Prediction of Traffic Volume in Urban Arterials , 1995 .

[3]  Billy M. Williams,et al.  Modeling and Forecasting Vehicular Traffic Flow as a Seasonal ARIMA Process: Theoretical Basis and Empirical Results , 2003, Journal of Transportation Engineering.

[4]  Ervin Y. Rodin,et al.  Traffic Prediction and Management via RBF Neural Nets and Semantic Control , 1998 .

[5]  Yann LeCun,et al.  The mnist database of handwritten digits , 2005 .

[6]  Jin Xin Cao,et al.  Traffic volume forecasting based on radial basis function neural network with the consideration of traffic flows at the adjacent intersections , 2014 .

[7]  Yunpeng Wang,et al.  Spatiotemporal Recurrent Convolutional Networks for Traffic Prediction in Transportation Networks , 2017, Sensors.

[8]  I Okutani,et al.  Dynamic prediction of traffic volume through Kalman Filtering , 1984 .

[9]  Jürgen Schmidhuber,et al.  Long Short-Term Memory , 1997, Neural Computation.

[10]  Man-Chun Tan,et al.  An Aggregation Approach to Short-Term Traffic Flow Prediction , 2009, IEEE Transactions on Intelligent Transportation Systems.

[11]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[12]  Zhiyong Cui,et al.  Deep Bidirectional and Unidirectional LSTM Recurrent Neural Network for Network-wide Traffic Speed Prediction , 2018, ArXiv.

[13]  Yong Wang,et al.  Learning Traffic as Images: A Deep Convolutional Neural Network for Large-Scale Transportation Network Speed Prediction , 2017, Sensors.

[14]  Qingchao Liu,et al.  Short‐Term Traffic Speed Forecasting Based on Attention Convolutional Neural Network for Arterials , 2018, Comput. Aided Civ. Infrastructure Eng..

[15]  Gary A. Davis,et al.  Nonparametric Regression and Short‐Term Freeway Traffic Forecasting , 1991 .

[16]  Bin Ran,et al.  AN APPLICATION OF NEURAL NETWORK ON TRAFFIC SPEED PREDICTION UNDER ADVERSE WEATHER CONDITION , 2003 .

[17]  Dongjoo Park,et al.  Forecasting Freeway Link Travel Times with a Multilayer Feedforward Neural Network , 1999 .

[18]  Geoffrey E. Hinton,et al.  Dynamic Routing Between Capsules , 2017, NIPS.

[19]  Nitish Srivastava,et al.  Dropout: a simple way to prevent neural networks from overfitting , 2014, J. Mach. Learn. Res..

[20]  Bernhard Schölkopf,et al.  A tutorial on support vector regression , 2004, Stat. Comput..

[21]  S. P. Hoogendoorn,et al.  Freeway Travel Time Prediction with State-Space Neural Networks: Modeling State-Space Dynamics with Recurrent Neural Networks , 2002 .

[22]  Yu Zheng,et al.  Deep Spatio-Temporal Residual Networks for Citywide Crowd Flows Prediction , 2016, AAAI.

[23]  Shing Chung Josh Wong,et al.  Urban traffic flow prediction using a fuzzy-neural approach , 2002 .

[24]  Thomas Urbanik,et al.  Short-Term Freeway Traffic Volume Forecasting Using Radial Basis Function Neural Network , 1998 .

[25]  Yunpeng Wang,et al.  Long short-term memory neural network for traffic speed prediction using remote microwave sensor data , 2015 .

[26]  Jianhua Guo,et al.  Adaptive Kalman filter approach for stochastic short-term traffic flow rate prediction and uncertainty quantification , 2014 .

[27]  Jan-Ming Ho,et al.  Travel time prediction with support vector regression , 2003, Proceedings of the 2003 IEEE International Conference on Intelligent Transportation Systems.

[28]  Ivan Laptev,et al.  Learning and Transferring Mid-level Image Representations Using Convolutional Neural Networks , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[29]  Yunpeng Wang,et al.  A spatiotemporal correlative k-nearest neighbor model for short-term traffic multistep forecasting , 2016 .

[30]  Fei-Yue Wang,et al.  Traffic Flow Prediction With Big Data: A Deep Learning Approach , 2015, IEEE Transactions on Intelligent Transportation Systems.

[31]  Wenhao Huang,et al.  Deep Architecture for Traffic Flow Prediction: Deep Belief Networks With Multitask Learning , 2014, IEEE Transactions on Intelligent Transportation Systems.

[32]  H. J. Van Zuylen,et al.  Accurate freeway travel time prediction with state-space neural networks under missing data , 2005 .

[33]  Cyrus Shahabi,et al.  Diffusion Convolutional Recurrent Neural Network: Data-Driven Traffic Forecasting , 2017, ICLR.

[34]  Lawrence D. Jackel,et al.  Handwritten Digit Recognition with a Back-Propagation Network , 1989, NIPS.

[35]  Billy M. Williams Multivariate Vehicular Traffic Flow Prediction: Evaluation of ARIMAX Modeling , 2001 .

[36]  Mascha C. van der Voort,et al.  Combining kohonen maps with arima time series models to forecast traffic flow , 1996 .

[37]  Zuduo Zheng,et al.  Short-term traffic volume forecasting : a k-nearest neighbor approach enhanced by constrained linearly sewing principle component algorithm , 2014 .

[38]  Joel Ruben Antony Moniz,et al.  Nested LSTMs , 2018, ACML.

[39]  Huachun Tan,et al.  Short-term traffic flow forecasting with spatial-temporal correlation in a hybrid deep learning framework , 2016, ArXiv.

[40]  Yanru Zhang,et al.  A hybrid short-term traffic flow forecasting method based on spectral analysis and statistical volatility model , 2014 .