Fine-Tuning Deep Hybrid Long Short-Term Memory and Restricted Boltzmann Machine Network for Urban Traffic Speed Prediction

Accurate urban traffic speed prediction plays a crucial role in intelligent transportation systems (ITSs). However, the potential spatial-temporal information of big urban traffic data captured from the complex behaviors of urban systems cannot be efficiently mined based on the state-of-art intelligent models. Therefore, we propose a hybrid Long Short-Term Memory (LSTM) and Restricted Boltzmann Machine (RBM) neural network with a fine-tuning strategy for urban traffic speed prediction. Firstly, the LSTM-RBM model dynamically combines the LSTM and RBM, such that the LSTM extracts the time-varying characteristics of the speed sequences to dynamically adjust the RBM, and then the RBM can capture the deep and detailed features of the speed sequences in the optimal way. Secondly, a transfer learning fine-tuning strategy is proposed to effectively pre-train the LSTM-RBM to achieve higher accuracy. Experimental results based on traffic speed data of the second ring road in Xi'an indicate that the proposed hybrid LSTM-RBM model with fine-tuning can outperform the existing deep models.

[1]  Tetsuya Takiguchi,et al.  Voice Conversion Using RNN Pre-Trained by Recurrent Temporal Restricted Boltzmann Machines , 2015, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[2]  Yong Qi,et al.  Long short-term memory neural network for network traffic prediction , 2017, 2017 12th International Conference on Intelligent Systems and Knowledge Engineering (ISKE).

[3]  Geoffrey E. Hinton A Practical Guide to Training Restricted Boltzmann Machines , 2012, Neural Networks: Tricks of the Trade.

[4]  Xi Fu Wang,et al.  Forecasting Traffic Volume with Space-Time ARIMA Model , 2010 .

[5]  Yoshua Bengio,et al.  Empirical Evaluation of Gated Recurrent Neural Networks on Sequence Modeling , 2014, ArXiv.

[6]  Wenhao Huang,et al.  Deep Architecture for Traffic Flow Prediction: Deep Belief Networks With Multitask Learning , 2014, IEEE Transactions on Intelligent Transportation Systems.

[7]  Yoshua Bengio,et al.  Modeling Temporal Dependencies in High-Dimensional Sequences: Application to Polyphonic Music Generation and Transcription , 2012, ICML.

[8]  Junjie Wu,et al.  Traffic Speed Prediction and Congestion Source Exploration: A Deep Learning Method , 2016, 2016 IEEE 16th International Conference on Data Mining (ICDM).

[9]  Alessandro Sperduti,et al.  Neural Networks for Sequential Data: a Pre-training Approach based on Hidden Markov Models , 2015, Neurocomputing.

[10]  Jürgen Schmidhuber,et al.  Long Short-Term Memory , 1997, Neural Computation.

[11]  Zhiyong Cui,et al.  Deep Bidirectional and Unidirectional LSTM Recurrent Neural Network for Network-wide Traffic Speed Prediction , 2018, ArXiv.

[12]  Yunpeng Wang,et al.  Long short-term memory neural network for traffic speed prediction using remote microwave sensor data , 2015 .

[13]  Viral Nagori Fine tuning the parameters of back propagation algorithm for optimum learning performance , 2016, 2016 2nd International Conference on Contemporary Computing and Informatics (IC3I).