Cluster-Based LSTM Network for Short-Term Passenger Flow Forecasting in Urban Rail Transit

Short-term passenger flow forecasting is an essential component for the operation of urban rail transit (URT). Therefore, it is necessary to obtain a higher prediction precision with the development of URT. As artificial intelligence becomes increasingly prevalent, many prediction methods including the long short-term memory network (LSTM) in the deep learning field have been applied in road transportation systems, which can give critical insights for URT. First, we propose a novel two-step K-Means clustering model to capture not only the passenger flow variation trends but also the ridership volume characteristics. Then, a predictability assessment model is developed to recommend a reasonable time granularity interval to aggregate passenger flows. Based on the clustering results and the recommended time granularity interval, the LSTM model, which is called CB-LSTM model, is proposed to conduct short-term passenger flow forecasting. Results show that the prediction based on subway station clusters can not only avoid the complication of developing numerous models for each of the hundreds of stations, but also improve the prediction performance, which make it possible to predict short-term passenger flow on a network scale using limited dataset. The results provide critical insights for subway operators and transportation policymakers.

[1]  Ji Wei Li,et al.  Short-Time Passenger Volume Forecasting of Urban Rail Transit Based on Multiple Fusion , 2014 .

[2]  Jürgen Schmidhuber,et al.  Learning to forget: continual prediction with LSTM , 1999 .

[3]  Zhonghui Chen,et al.  Short-term traffic flow prediction with Conv-LSTM , 2017, 2017 9th International Conference on Wireless Communications and Signal Processing (WCSP).

[4]  B. Han,et al.  Urban Rail Transit in China: Progress Report and Analysis (2008–2015) , 2016 .

[5]  Yike Guo,et al.  Dest-ResNet: A Deep Spatiotemporal Residual Network for Hotspot Traffic Speed Prediction , 2018, ACM Multimedia.

[6]  Zhen Xie,et al.  Short-Term Abnormal Passenger Flow Prediction Based on the Fusion of SVR and LSTM , 2019, IEEE Access.

[7]  Ning Zhang,et al.  Forecasting of Short-Term Metro Ridership with Support Vector Machine Online Model , 2018, Journal of Advanced Transportation.

[8]  Gérald Gavin,et al.  A dynamic Bayesian network approach to forecast short-term urban rail passenger flows with incomplete data , 2017 .

[9]  Yang Li,et al.  Forecasting short-term subway passenger flow under special events scenarios using multiscale radial basis function networks ☆ , 2017 .

[10]  David Vandyke,et al.  Semantically Conditioned LSTM-based Natural Language Generation for Spoken Dialogue Systems , 2015, EMNLP.

[11]  Christian Overgaard Hansen,et al.  A passenger travel demand model for Copenhagen , 2003 .

[12]  Billy M. Williams,et al.  Modeling and Forecasting Vehicular Traffic Flow as a Seasonal ARIMA Process: Theoretical Basis and Empirical Results , 2003, Journal of Transportation Engineering.

[13]  Manoranjan Parida,et al.  Short Term Traffic Flow Prediction for a Non Urban Highway Using Artificial Neural Network , 2013 .

[14]  Yoshua Bengio,et al.  Learning long-term dependencies with gradient descent is difficult , 1994, IEEE Trans. Neural Networks.

[15]  Yu Zheng,et al.  Deep Spatio-Temporal Residual Networks for Citywide Crowd Flows Prediction , 2016, AAAI.

[16]  Harry Timmermans,et al.  Understanding urban mobility patterns from a spatiotemporal perspective: daily ridership profiles of metro stations , 2020 .

[17]  Ning Zhang,et al.  Short-term forecasting of urban rail transit ridership based on ARIMA and wavelet decomposition , 2018 .

[18]  Peter C. Y. Chen,et al.  LSTM network: a deep learning approach for short-term traffic forecast , 2017 .

[19]  Yunpeng Wang,et al.  Long short-term memory neural network for traffic speed prediction using remote microwave sensor data , 2015 .

[20]  B Ahmed,et al.  The Traditional Four Steps Transportation Modeling Using a Simplified Transport Network: A Case Study of Dhaka City, Bangladesh , 2012 .

[21]  Nebojsa J. Bojovic,et al.  A hybrid model for forecasting the volume of passenger flows on Serbian railways , 2016, Oper. Res..

[22]  Sepp Hochreiter,et al.  Untersuchungen zu dynamischen neuronalen Netzen , 1991 .

[23]  Jürgen Schmidhuber,et al.  Long Short-Term Memory , 1997, Neural Computation.

[24]  Mu-Chen Chen,et al.  Forecasting the short-term metro passenger flow with empirical mode decomposition and neural networks , 2012 .

[25]  Fei Ma,et al.  Random coefficient modeling research on short-term forecast of passenger flow into an urban rail transit station , 2016 .

[26]  Eleni I. Vlahogianni,et al.  Optimized and meta-optimized neural networks for short-term traffic flow prediction: A genetic approach , 2005 .

[27]  T. Başar,et al.  A New Approach to Linear Filtering and Prediction Problems , 2001 .

[28]  Dit-Yan Yeung,et al.  Convolutional LSTM Network: A Machine Learning Approach for Precipitation Nowcasting , 2015, NIPS.

[29]  Yong Wang,et al.  Learning Traffic as Images: A Deep Convolutional Neural Network for Large-Scale Transportation Network Speed Prediction , 2017, Sensors.

[30]  I Okutani,et al.  Dynamic prediction of traffic volume through Kalman Filtering , 1984 .

[31]  Hilmi Berk Celikoglu,et al.  Public transportation trip flow modeling with generalized regression neural networks , 2007, Adv. Eng. Softw..

[32]  Yang Liu,et al.  DeepPF: A deep learning based architecture for metro passenger flow prediction , 2019, Transportation Research Part C: Emerging Technologies.

[33]  Yizhou Sun,et al.  Real-Time Short-Term Traffic Flow Forecasting Based on Process Neural Network , 2008, ISNN.

[34]  Fei-Yue Wang,et al.  Traffic Flow Prediction With Big Data: A Deep Learning Approach , 2015, IEEE Transactions on Intelligent Transportation Systems.

[35]  Navdeep Jaitly,et al.  Hybrid speech recognition with Deep Bidirectional LSTM , 2013, 2013 IEEE Workshop on Automatic Speech Recognition and Understanding.

[36]  Stéphane Bonnevay,et al.  Dynamic Bayesian networks with Gaussian mixture models for short-term passenger flow forecasting , 2017, 2017 12th International Conference on Intelligent Systems and Knowledge Engineering (ISKE).

[37]  Guy Pujolle,et al.  A Long Short-Term Memory Recurrent Neural Network Framework for Network Traffic Matrix Prediction , 2017, ArXiv.

[38]  Cynthia Chen,et al.  Diurnal Pattern of Transit Ridership: Case Study of New York City Subway System , 2009 .

[39]  H. Sohn,et al.  Ridership patterns at subway stations of Seoul capital area and characteristics of station influence area , 2017 .

[40]  Li Li,et al.  Using LSTM and GRU neural network methods for traffic flow prediction , 2016, 2016 31st Youth Academic Annual Conference of Chinese Association of Automation (YAC).

[41]  Billy M. Williams,et al.  Comparison of parametric and nonparametric models for traffic flow forecasting , 2002 .

[42]  John E. Moody,et al.  Fast adaptive k-means clustering: some empirical results , 1990, 1990 IJCNN International Joint Conference on Neural Networks.

[43]  Shing Chung Josh Wong,et al.  Urban traffic flow prediction using a fuzzy-neural approach , 2002 .

[44]  Yang Ying,et al.  ST-LSTM: A Deep Learning Approach Combined Spatio-Temporal Features for Short-Term Forecast in Rail Transit , 2019, Journal of Advanced Transportation.

[45]  N. Amjady,et al.  Short-Term Bus Load Forecasting of Power Systems by a New Hybrid Method , 2007, IEEE Transactions on Power Systems.

[46]  Chi-Kang Lee,et al.  Neural network based temporal feature models for short-term railway passenger demand forecasting , 2009, Expert Syst. Appl..

[47]  Moshe Ben-Akiva,et al.  STRUCTURE OF PASSENGER TRAVEL DEMAND MODELS , 1974 .