Sequence to sequence learning with attention mechanism for short-term passenger flow prediction in large-scale metro system

Abstract The accurate short-term passenger flow prediction is of great significance for real-time public transit management, timely emergency response as well as systematical medium and long-term planning. In this paper, we propose an end-to-end deep learning framework that can simultaneously make multi-step predictions for all stations in a large scale metro system. A sequence to sequence model embedded with the attention mechanism forms the backbone of this framework. The sequence to sequence model consists of an encoder network and a decoder network, making it good at modeling sequential data with varying lengths and the attention mechanism further enhances its ability to capture long-range dependencies. We use the proposed framework to predict the number of passengers alighting at each station in the near future, given the number of passengers boarding at each station in the last few short-term periods. The large quantities of real-world data collected from Singapore’s metro system are used to validate the proposed model. In addition, a set of comparisons made among our model and other classical approaches evidently indicates that the proposed model is more scalable and robust than other baselines in making multi-step and network-wide predictions for short-term passenger flow.

[1]  Etienne Côme,et al.  Short & long term forecasting of multimodal transport passenger flows with machine learning methods , 2017, 2017 IEEE 20th International Conference on Intelligent Transportation Systems (ITSC).

[2]  Jiawei Wang,et al.  Traffic speed prediction for urban transportation network: A path based deep learning approach , 2019, Transportation Research Part C: Emerging Technologies.

[3]  Mu-Chen Chen,et al.  Forecasting the short-term metro passenger flow with empirical mode decomposition and neural networks , 2012 .

[4]  Yoshua Bengio,et al.  Gradient-based learning applied to document recognition , 1998, Proc. IEEE.

[5]  Shing Chung Josh Wong,et al.  Urban traffic flow prediction using a fuzzy-neural approach , 2002 .

[6]  Yu Zheng,et al.  Deep Spatio-Temporal Residual Networks for Citywide Crowd Flows Prediction , 2016, AAAI.

[7]  Eleni I. Vlahogianni,et al.  Short-term traffic forecasting: Where we are and where we’re going , 2014 .

[8]  Billy M. Williams,et al.  Urban Freeway Traffic Flow Prediction: Application of Seasonal Autoregressive Integrated Moving Average and Exponential Smoothing Models , 1998 .

[9]  Tharam S. Dillon,et al.  Neural-Network-Based Models for Short-Term Traffic Flow Forecasting Using a Hybrid Exponential Smoothing and Levenberg–Marquardt Algorithm , 2012, IEEE Transactions on Intelligent Transportation Systems.

[10]  Jianping Wu,et al.  Traffic Flow Prediction with Rainfall Impact Using a Deep Learning Method , 2017 .

[11]  Michael J Demetsky,et al.  TRAFFIC FLOW FORECASTING: COMPARISON OF MODELING APPROACHES , 1997 .

[12]  Jürgen Schmidhuber,et al.  Long Short-Term Memory , 1997, Neural Computation.

[13]  Feng Shu,et al.  Short-term traffic flow prediction based on spatio-temporal analysis and CNN deep learning , 2019, Transportmetrica A: Transport Science.

[14]  Billy M. Williams,et al.  Modeling and Forecasting Vehicular Traffic Flow as a Seasonal ARIMA Process: Theoretical Basis and Empirical Results , 2003, Journal of Transportation Engineering.

[15]  Yoshua Bengio,et al.  Learning long-term dependencies with gradient descent is difficult , 1994, IEEE Trans. Neural Networks.

[16]  Xiqun Chen,et al.  Short-Term Forecasting of Passenger Demand under On-Demand Ride Services: A Spatio-Temporal Deep Learning Approach , 2017, ArXiv.

[17]  Man-Chun Tan,et al.  An Aggregation Approach to Short-Term Traffic Flow Prediction , 2009, IEEE Transactions on Intelligent Transportation Systems.

[18]  Jianqiang Li,et al.  An Attention-Based Air Quality Forecasting Method , 2018, 2018 17th IEEE International Conference on Machine Learning and Applications (ICMLA).

[19]  A. Pascale,et al.  Adaptive Bayesian network for traffic flow prediction , 2011, 2011 IEEE Statistical Signal Processing Workshop (SSP).

[20]  Der-Horng Lee,et al.  Short-term freeway traffic flow prediction : Bayesian combined neural network approach , 2006 .

[21]  Fei-Yue Wang,et al.  Travel time prediction with LSTM neural network , 2016, 2016 IEEE 19th International Conference on Intelligent Transportation Systems (ITSC).

[22]  Lorenzo Mussone,et al.  A Study of Hybrid Neural Network Approaches and the Effects of Missing Data on Traffic Forecasting , 2001, Neural Computing & Applications.

[23]  Quoc V. Le,et al.  Sequence to Sequence Learning with Neural Networks , 2014, NIPS.

[24]  Md Zakirul Alam Bhuiyan,et al.  Deep Irregular Convolutional Residual LSTM for Urban Traffic Passenger Flows Prediction , 2020, IEEE Transactions on Intelligent Transportation Systems.

[25]  Chuan Ding,et al.  Parallel Architecture of Convolutional Bi-Directional LSTM Neural Networks for Network-Wide Metro Ridership Prediction , 2019, IEEE Transactions on Intelligent Transportation Systems.

[26]  Praveen Edara,et al.  Network Scale Travel Time Prediction using Deep Learning , 2018, Transportation Research Record: Journal of the Transportation Research Board.

[27]  Brian L. Smith,et al.  Short-term traffic flow prediction models-a comparison of neural network and nonparametric regression approaches , 1994, Proceedings of IEEE International Conference on Systems, Man and Cybernetics.

[28]  Nicholas G. Polson,et al.  Deep learning for short-term traffic flow prediction , 2016, 1604.04527.

[29]  Mascha C. van der Voort,et al.  Combining kohonen maps with arima time series models to forecast traffic flow , 1996 .

[30]  Zhen Hu,et al.  Predicting the Metro Passengers Flow by Long-Short Term Memory , 2017, CSA/CUTE.

[31]  Pan Liu,et al.  The station-free sharing bike demand forecasting with a deep learning approach and large-scale datasets , 2018, Transportation Research Part C: Emerging Technologies.

[32]  Mark Dougherty,et al.  SHORT TERM INTER-URBAN TRAFFIC FORECASTS USING NEURAL NETWORKS , 1997 .

[33]  Yunpeng Wang,et al.  Long short-term memory neural network for traffic speed prediction using remote microwave sensor data , 2015 .

[34]  Sanghoon Bae,et al.  Deep Neural Networks for traffic flow prediction , 2017, 2017 IEEE International Conference on Big Data and Smart Computing (BigComp).

[35]  Bin Ran,et al.  Fuzzy-Neural Network Traffic Prediction Framework with Wavelet Decomposition , 2003 .

[36]  Yu Zheng,et al.  GeoMAN: Multi-level Attention Networks for Geo-sensory Time Series Prediction , 2018, IJCAI.

[37]  Yike Guo,et al.  Deep Sequence Learning with Auxiliary Information for Traffic Prediction , 2018, KDD.

[38]  Fei-Yue Wang,et al.  Traffic Flow Prediction With Big Data: A Deep Learning Approach , 2015, IEEE Transactions on Intelligent Transportation Systems.

[39]  Praveen Edara,et al.  Traffic Flow Forecasting for Urban Work Zones , 2015, IEEE Transactions on Intelligent Transportation Systems.

[40]  Rung-Ching Chen,et al.  A novel passenger flow prediction model using deep learning methods , 2017 .

[41]  Wei Li,et al.  Daily long-term traffic flow forecasting based on a deep neural network , 2019, Expert Syst. Appl..

[42]  Nicholas G. Polson,et al.  Bayesian analysis of traffic flow on interstate I-55: The LWR model , 2014, 1409.6034.

[43]  Yong Wang,et al.  Learning Traffic as Images: A Deep Convolutional Neural Network for Large-Scale Transportation Network Speed Prediction , 2017, Sensors.

[44]  Jing Li,et al.  Graph CNNs for Urban Traffic Passenger Flows Prediction , 2018, 2018 IEEE SmartWorld, Ubiquitous Intelligence & Computing, Advanced & Trusted Computing, Scalable Computing & Communications, Cloud & Big Data Computing, Internet of People and Smart City Innovation (SmartWorld/SCALCOM/UIC/ATC/CBDCom/IOP/SCI).

[45]  Jan-Ming Ho,et al.  Travel time prediction with support vector regression , 2003, Proceedings of the 2003 IEEE International Conference on Intelligent Transportation Systems.

[46]  Chen Gang,et al.  Accurate Multisteps Traffic Flow Prediction Based on SVM , 2013 .

[47]  Bo Peng,et al.  Short‐term traffic flow prediction with linear conditional Gaussian Bayesian network , 2016 .

[48]  Michael A. West,et al.  Bayesian Inference on Network Traffic Using Link Count Data , 1998 .

[49]  Shiliang Sun,et al.  Traffic Flow Forecasting Using a Spatio-temporal Bayesian Network Predictor , 2005, ICANN.

[50]  Latifa Oukhellou,et al.  Forecasting dynamic public transport Origin-Destination matrices with long-Short term Memory recurrent neural networks , 2016, 2016 IEEE 19th International Conference on Intelligent Transportation Systems (ITSC).

[51]  Xiaolei Li,et al.  Traffic Flow Forecasting by a Least Squares Support Vector Machine with a Fruit Fly Optimization Algorithm , 2016 .

[52]  Peter C. Y. Chen,et al.  LSTM network: a deep learning approach for short-term traffic forecast , 2017 .

[53]  Preeti R. Bajaj,et al.  Performance analysis of support vector machine for traffic flow prediction , 2016, 2016 International Conference on Global Trends in Signal Processing, Information Computing and Communication (ICGTSPICC).

[54]  Yunpeng Wang,et al.  Spatiotemporal Recurrent Convolutional Networks for Traffic Prediction in Transportation Networks , 2017, Sensors.