Ensemble Learning for Short-Term Traffic Prediction Based on Gradient Boosting Machine

Short-term traffic prediction is vital for intelligent traffic systems and influenced by neighboring traffic condition. Gradient boosting decision trees (GBDT), an ensemble learning method, is proposed to make short-term traffic prediction based on the traffic volume data collected by loop detectors on the freeway. Each new simple decision tree is sequentially added and trained with the error of the previous whole ensemble model at each iteration. The relative importance of variables can be quantified in the training process of GBDT, indicating the interaction between input variables and response. The influence of neighboring traffic condition on prediction performance is identified through combining the traffic volume data collected by different upstream and downstream detectors as the input, which can also improve prediction performance. The relative importance of input variables for 15 GBDT models is different, and the impact of upstream traffic condition is not balanced with that of downstream. The prediction accuracy of GBDT is generally higher than SVM and BPNN for different steps ahead, and the accuracy of multi-step-ahead models is lower than 1-step-ahead models. For 1-step-ahead models, the prediction errors of GBDT are smaller than SVM and BPNN for both peak and nonpeak hours.

[1]  Agachai Sumalee,et al.  Short-Term Traffic State Prediction Based on Temporal–Spatial Correlation , 2013, IEEE Transactions on Intelligent Transportation Systems.

[2]  Jin Wang,et al.  Short-term traffic speed forecasting hybrid model based on Chaos–Wavelet Analysis-Support Vector Machine theory , 2013 .

[3]  Wanli Min,et al.  Real-time road traffic prediction with spatio-temporal correlations , 2011 .

[4]  Guillaume Leduc,et al.  Road Traffic Data: Collection Methods and Applications , 2008 .

[5]  Haitham Al-Deek,et al.  Predictions of Freeway Traffic Speeds and Volumes Using Vector Autoregressive Models , 2009, J. Intell. Transp. Syst..

[6]  J Elith,et al.  A working guide to boosted regression trees. , 2008, The Journal of animal ecology.

[7]  Mohamed Abdel-Aty,et al.  Application of Stochastic Gradient Boosting Technique to Enhance Reliability of Real-Time Risk Assessment , 2013 .

[8]  Guy Leshem,et al.  Traffic Flow Prediction using Adaboost Algorithm with Random Forests as a Weak Learner , 2007 .

[9]  Yanru Zhang,et al.  A gradient boosting method to improve travel time prediction , 2015 .

[10]  Wei Shen,et al.  Real-time road traffic forecasting using regime-switching space-time models and adaptive LASSO , 2012 .

[11]  Alois Knoll,et al.  Gradient boosting machines, a tutorial , 2013, Front. Neurorobot..

[12]  Billy M. Williams,et al.  Comparison of parametric and nonparametric models for traffic flow forecasting , 2002 .

[13]  Sherif Ishak,et al.  THE IMPACT OF REAL-TIME AND PREDICTIVE TRAFFIC INFORMATION ON TRAVELERS' BEHAVIOR IN THE I-4 CORRIDOR , 2003 .

[14]  Eleni I. Vlahogianni,et al.  Short-term traffic forecasting: Where we are and where we’re going , 2014 .

[15]  Yi-Shih Chung,et al.  Factor complexity of crash occurrence: An empirical demonstration using boosted regression trees. , 2013, Accident; analysis and prevention.

[16]  Lee D. Han,et al.  Short-Term Freeway Speed Profiling Based on Longitudinal Spatiotemporal Dynamics , 2014 .

[17]  Jerome H Friedman,et al.  Multiple additive regression trees with application in epidemiology , 2003, Statistics in medicine.

[18]  J. Friedman Greedy function approximation: A gradient boosting machine. , 2001 .

[19]  Qiuchen Liu,et al.  An Improved K-nearest Neighbor Model for Short-term Traffic Flow Prediction , 2013 .

[20]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[21]  Eleni I. Vlahogianni,et al.  Short‐term traffic forecasting: Overview of objectives and methods , 2004 .

[22]  Billy M. Williams Multivariate Vehicular Traffic Flow Prediction: Evaluation of ARIMAX Modeling , 2001 .

[23]  Gaetano Valenti,et al.  Traffic Estimation And Prediction Based On Real Time Floating Car Data , 2008, 2008 11th International IEEE Conference on Intelligent Transportation Systems.

[24]  Der-Horng Lee,et al.  Short-term freeway traffic flow prediction : Bayesian combined neural network approach , 2006 .

[25]  J. Friedman Stochastic gradient boosting , 2002 .

[26]  Leo Breiman,et al.  Statistical Modeling: The Two Cultures (with comments and a rejoinder by the author) , 2001 .

[27]  Yunlong Zhang,et al.  Forecasting of Short-Term Freeway Volume with v-Support Vector Machines , 2007 .

[28]  H. J. Van Zuylen,et al.  Accurate freeway travel time prediction with state-space neural networks under missing data , 2005 .

[29]  Y. Kamarianakis,et al.  Forecasting Traffic Flow Conditions in an Urban Network: Comparison of Multivariate and Univariate Approaches , 2003 .

[30]  Zhirui Ye,et al.  Short‐Term Traffic Volume Forecasting Using Kalman Filter with Discrete Wavelet Decomposition , 2007, Comput. Aided Civ. Infrastructure Eng..

[31]  Fei-Yue Wang,et al.  Data-Driven Intelligent Transportation Systems: A Survey , 2011, IEEE Transactions on Intelligent Transportation Systems.