SHE: Stepwise Heterogeneous Ensemble Method for Citywide Traffic Analysis

Sensored traffic data in modern cities have been collected and applied for various purposes in the domain of intelligent transportation systems (ITS). However, analyzing these traffic data often lacks in priori knowledge due to the dynamics of transportation systems, making it hard to cope with diverse scenarios with specific models. In view of the limitations of traditional approaches, in this paper, we propose the Stepwise Heterogeneous Ensemble (SHE) for citywide traffic analysis based on stacked generalization. We first prove SHE's effectiveness using error-ambiguity decomposition technique. Secondly we analyze the optimal linear combination of SHE and present the stepwise iterating strategy. We also demonstrate its validity based on Kullback-Leibler divergence analysis. Thirdly we integrate six classical approaches into SHE framework, including linear least squares regression (LLSR), autoregressive moving average (ARMA), historical mean (HM), artificial neural network (ANN), radical basis function neural network (RBFNN), support vector machine (SVM). We further compare SHE's performance with other four linear combination models, namely equal weights method (EW), optimal weights method (OW), minimum error method (ME) and minimum variance method (MV). A series of experiments are conducted with a real city traffic dataset in Beijing city. The results show that the proposed SHE method behaves more robust and precise than other six single methods. Moreover, this method also outperforms other four different combination strategies both in variance and bias. In addition, the SHE method provides an open-ending framework for citywide traffic analysis, which means any new promising models can be easily incorporated into it in the future.

[1]  Haibing Li,et al.  Applying Ant Colony Optimization to configuring stacking ensembles for data mining , 2014, Expert Syst. Appl..

[2]  Johan A. K. Suykens,et al.  EnsembleSVM: a library for ensemble learning using support vector machines , 2014, J. Mach. Learn. Res..

[3]  J. Scott Armstrong,et al.  Principles of forecasting : a handbook for researchers and practitioners , 2001 .

[4]  David H. Wolpert,et al.  Stacked generalization , 1992, Neural Networks.

[5]  Hui Chen,et al.  Estimation of Delay Induced by Downstream Operations at Signalized Intersections over Extended Control Time , 2008 .

[6]  J. Scott Armstrong,et al.  Principles of forecasting , 2001 .

[7]  Yang Zhang,et al.  Analysis of Peak and Non-peak Traffic Forecasts Using Combined Models , 2011 .

[8]  Luís Torgo,et al.  A Survey of Predictive Modeling on Imbalanced Domains , 2016, ACM Comput. Surv..

[9]  Yu Zheng,et al.  Trajectory Data Mining , 2015, ACM Trans. Intell. Syst. Technol..

[10]  Lisa Werner,et al.  Principles of forecasting: A handbook for researchers and practitioners , 2002 .

[11]  Alexandre M. Bayen,et al.  Delay Pattern Estimation for Signalized Intersections Using Sampled Travel Times , 2009 .

[12]  Ziyou Gao,et al.  Urban Traffic Jam Simulation Based on the Cell Transmission Model , 2011 .

[13]  Gaetano Fusco,et al.  Short-term speed predictions exploiting big data on large urban road networks , 2016 .

[14]  อนิรุธ สืบสิงห์,et al.  Data Mining Practical Machine Learning Tools and Techniques , 2014 .

[15]  Robert L. Winkler,et al.  Simple robust averages of forecasts: Some empirical results , 2008 .

[16]  Michael A. King,et al.  Ensemble methods for advanced skier days prediction , 2014, Expert Syst. Appl..

[17]  Lukas Ambühl,et al.  Data fusion algorithm for macroscopic fundamental diagram estimation , 2016 .

[18]  T. Evgeniou,et al.  To combine or not to combine: selecting among forecasts and their combinations , 2005 .

[19]  Robert E. Schapire,et al.  The strength of weak learnability , 1990, Mach. Learn..

[20]  Tom Van Woensel,et al.  Time-dependent vehicle routing problem with path flexibility , 2017 .

[21]  Feng Lu,et al.  A ST-CRF Map-Matching Method for Low-Frequency Floating Car Data , 2017, IEEE Transactions on Intelligent Transportation Systems.

[22]  Hamidreza Amindavar,et al.  Short-term traffic flow prediction using time-varying Vasicek model , 2017 .

[23]  João Gama,et al.  Ensemble learning for data stream analysis: A survey , 2017, Inf. Fusion.

[24]  Jean Paul Barddal,et al.  A Survey on Ensemble Learning for Data Stream Classification , 2017, ACM Comput. Surv..

[25]  Zhi-Hua Zhou,et al.  Ensemble Methods: Foundations and Algorithms , 2012 .

[26]  André L. V. Coelho,et al.  Ensembling Heterogeneous Learning Models with Boosting , 2009, ICONIP.

[27]  Alípio Mário Jorge,et al.  Ensemble approaches for regression: A survey , 2012, CSUR.

[28]  Hengcai Zhang,et al.  Intersection delay estimation from floating car data via principal curves: a case study on Beijing’s road network , 2013, Frontiers of Earth Science.

[29]  Anders Krogh,et al.  Neural Network Ensembles, Cross Validation, and Active Learning , 1994, NIPS.

[30]  Zhi-Hua Zhou,et al.  Exploiting unlabeled data to enhance ensemble diversity , 2009, 2010 IEEE International Conference on Data Mining.

[31]  David J. Galas,et al.  Expansion of the Kullback-Leibler Divergence, and a new class of information metrics , 2017, Axioms.

[32]  Fangfang Zheng,et al.  Urban link travel time estimation based on sparse probe vehicle data , 2013 .