Robust and flexible strategy for missing data imputation in intelligent transportation system

Rich and complete data play a fundamental role in intelligent traffic management and control applications. A great volume of missing data is found in the intelligent transportation system. In this paper, the authors introduce an ensemble strategy to handle the missing values. The proposed strategy is a general framework that different models, whether linear, neural networks, or other, can be applied. In this strategy, missing values are first computed by the forward and backward models, and their results are combined to recover the incomplete raw data. Then, the models are iterated for several times to enhance the accuracy. Three commonly used imputation models are tested in the proposed strategy using the data from real world. The results indicate that the proposed strategy can significantly improve the accuracy of the imputation with different missing types and during different traffic states. Moreover, the increase of the iteration is capable to improve the performance of the models.

[1]  Bin Ran,et al.  Tensor based missing traffic data completion with spatial–temporal correlation , 2016 .

[2]  Bin Ran,et al.  Estimating Missing Traffic Volume Using Low Multilinear Rank Tensor Completion , 2016, J. Intell. Transp. Syst..

[3]  Jinjun Tang,et al.  Characterizing traffic time series based on complex network theory , 2013 .

[4]  Jinjun Tang,et al.  Exploring dynamic property of traffic flow time series in multi-states based on complex networks: Phase space reconstruction versus visibility graph , 2016 .

[5]  Henry X. Liu,et al.  Use of Local Linear Regression Model for Short-Term Traffic Forecasting , 2003 .

[6]  Bin Ran,et al.  Short‐term highway traffic flow prediction based on a hybrid strategy considering temporal–spatial information , 2016 .

[7]  Thong Ngee Goh,et al.  A comparative study of neural network and Box-Jenkins ARIMA modeling in time series prediction , 2002 .

[8]  Fang Liu,et al.  An Improved Fuzzy Neural Network for Traffic Speed Prediction Considering Periodic Characteristic , 2017, IEEE Transactions on Intelligent Transportation Systems.

[9]  Tharam S. Dillon,et al.  Neural-Network-Based Models for Short-Term Traffic Flow Forecasting Using a Hybrid Exponential Smoothing and Levenberg–Marquardt Algorithm , 2012, IEEE Transactions on Intelligent Transportation Systems.

[10]  Guangdong Feng,et al.  A Tensor Based Method for Missing Traffic Data Completion , 2013 .

[11]  Yinhai Wang,et al.  A hybrid approach to integrate fuzzy C-means based imputation method with genetic algorithm for missing traffic volume data estimation , 2015 .

[12]  Li Li,et al.  Missing traffic data: comparison of imputation methods , 2014 .

[13]  Zhang Jian,et al.  Traffic Speed Prediction for Highway Operations Based on a Symbolic Regression Algorithm , 2017 .

[14]  Yi Zhang,et al.  PPCA-Based Missing Data Imputation for Traffic Flow Volume: A Systematical Approach , 2009, IEEE Transactions on Intelligent Transportation Systems.

[15]  M. Zhong,et al.  ESTIMATION OF MISSING TRAFFIC COUNTS USING FACTOR, GENETIC, NEURAL AND REGRESSION TECHNIQUES , 2004 .

[16]  Mascha C. van der Voort,et al.  Combining kohonen maps with arima time series models to forecast traffic flow , 1996 .

[17]  Eleni I. Vlahogianni,et al.  Statistical methods versus neural networks in transportation research: Differences, similarities and some insights , 2011 .

[18]  Alexander Skabardonis,et al.  Detecting Errors and Imputing Missing Data for Single-Loop Surveillance Systems , 2003 .

[19]  Fang Liu,et al.  Travel Time Estimation Using Freeway Point Detector Data Based on Evolving Fuzzy Neural Inference System , 2016, PloS one.

[20]  Byoung-Jo Yoon,et al.  Dynamic near-term traffic flow prediction: system- oriented approach based on past experiences , 2012 .

[21]  Hwasoo Yeo,et al.  Data-Driven Imputation Method for Traffic Data in Sectional Units of Road Links , 2016, IEEE Transactions on Intelligent Transportation Systems.

[22]  Yin Wang,et al.  The retrieval of intra-day trend and its influence on traffic prediction , 2012 .

[23]  Dali Wei,et al.  An Adaptive-Margin Support Vector Regression for Short-Term Traffic Flow Forecast , 2013, J. Intell. Transp. Syst..

[24]  Amir F. Atiya,et al.  Forward and Backward Forecasting Ensembles for the Estimation of Time Series Missing Data , 2014, ANNPR.

[25]  Muhammad Tayyab Asif,et al.  Matrix and Tensor Based Methods for Missing Data Estimation in Large Traffic Networks , 2016, IEEE Transactions on Intelligent Transportation Systems.

[26]  Ming Zhong,et al.  Assessing Robustness of Imputation Models Based on Data from Different Jurisdictions: Examples of Alberta and Saskatchewan, Canada , 2005 .

[27]  Srinivas Peeta,et al.  Multiple measures-based chaotic time series for traffic flow prediction based on Bayesian theory , 2016, Nonlinear Dynamics.

[28]  ChangHyunho,et al.  Multiple time period imputation technique for multiple missing traffic variables: nonparametric regression approach , 2012 .

[29]  Li Li,et al.  Efficient missing data imputing for traffic flow by considering temporal and spatial dependence , 2013 .

[30]  Chao Zhang,et al.  Multiple sources and multiple measures based traffic flow prediction using the chaos theory and support vector regression method , 2017 .

[31]  Jian Zhang,et al.  Using Tensor Completion Method to Achieving Better Coverage of Traffic State Estimation from Sparse Floating Car Data , 2016, PloS one.