Missing Value Imputation for Traffic-Related Time Series Data Based on a Multi-View Learning Method

In reality, readings of sensors on highways are usually missing at various unexpected moments due to some sensor or communication errors. These missing values do not only influence the real-time traffic monitoring but also prevent further traffic data mining. In this paper, we propose a multi-view learning method to estimate the missing values for traffic-related time series data. The model combines data-driven algorithms (long-short term memory and support vector regression) and collaborative filtering techniques. It can consider the local and global variation in temporal and spatial views to capture more information from the existing data. The estimations of missing values from four views are aggregated to obtain a final value with a kernel function. Data from a highway network are used to evaluate the performance of the proposed model in terms of accuracy, precision, and agreement. The results indicate that our proposed model outperforms other baselines, especially for block missing pattern with a high missing ratio. Furthermore, the sensitivity of the parameters is analyzed. We can conclude that combining different views can improve the performance of the imputation.

[1]  Hwasoo Yeo,et al.  Data-Driven Imputation Method for Traffic Data in Sectional Units of Road Links , 2016, IEEE Transactions on Intelligent Transportation Systems.

[2]  Yin Wang,et al.  The retrieval of intra-day trend and its influence on traffic prediction , 2012 .

[3]  Bin Ran,et al.  Freeway Multisensor Data Fusion Approach Integrating Data from Cellphone Probes and Fixed Sensors , 2016, J. Sensors.

[4]  Stephen H Richards,et al.  Combining the Statistical Model and Heuristic Model to Predict Flow Rate , 2014 .

[5]  Tianrui Li,et al.  ST-MVL: Filling Missing Values in Geo-Sensory Time Series Data , 2016, IJCAI.

[6]  Lorenzo Mussone,et al.  A Study of Hybrid Neural Network Approaches and the Effects of Missing Data on Traffic Forecasting , 2001, Neural Computing & Applications.

[7]  Ahmet Arslan,et al.  A hybrid method for imputation of missing values using optimized fuzzy c-means with support vector regression and a genetic algorithm , 2013, Inf. Sci..

[8]  Bin Ran,et al.  Short‐term highway traffic flow prediction based on a hybrid strategy considering temporal–spatial information , 2016 .

[9]  Yi Zhang,et al.  PPCA-Based Missing Data Imputation for Traffic Flow Volume: A Systematical Approach , 2009, IEEE Transactions on Intelligent Transportation Systems.

[10]  M. Zhong,et al.  ESTIMATION OF MISSING TRAFFIC COUNTS USING FACTOR, GENETIC, NEURAL AND REGRESSION TECHNIQUES , 2004 .

[11]  Vadlamani Ravi,et al.  A new online data imputation method based on general regression auto associative neural network , 2014, Neurocomputing.

[12]  Yajie Zou,et al.  Flexible and Robust Method for Missing Loop Detector Data Imputation , 2015 .

[13]  Yinhai Wang,et al.  A hybrid approach to integrate fuzzy C-means based imputation method with genetic algorithm for missing traffic volume data estimation , 2015 .

[14]  Li Li,et al.  Missing traffic data: comparison of imputation methods , 2014 .

[15]  Bin Ran,et al.  Robust and flexible strategy for missing data imputation in intelligent transportation system , 2017 .

[16]  W. Tobler On the First Law of Geography: A Reply , 2004 .

[17]  Guangdong Feng,et al.  A Tensor Based Method for Missing Traffic Data Completion , 2013 .

[18]  Jonathan D. Cryer,et al.  Time Series Analysis , 1986 .

[19]  Stephen H Richards,et al.  Flow rate and time mean speed predictions for the urban freeway network using state space models , 2014 .

[20]  Roberto Horowitz,et al.  Imputation of Ramp Flow Data for Freeway Traffic Simulation , 2009 .

[21]  Jürgen Schmidhuber,et al.  Long Short-Term Memory , 1997, Neural Computation.

[22]  Li Li,et al.  Efficient missing data imputing for traffic flow by considering temporal and spatial dependence , 2013 .

[23]  Bin Ran,et al.  Short-Term Traffic Prediction Based on Dynamic Tensor Completion , 2016, IEEE Transactions on Intelligent Transportation Systems.

[24]  Kay W. Axhausen,et al.  Understanding urban mobility patterns with a probabilistic tensor factorization framework , 2016 .

[25]  Fei-Yue Wang,et al.  An efficient realization of deep learning for traffic data imputation , 2016 .

[26]  Bin Ran,et al.  Tensor based missing traffic data completion with spatial–temporal correlation , 2016 .

[27]  Huachun Tan,et al.  Robust tensor decomposition based on Cauchy distribution and its applications , 2017, Neurocomputing.

[28]  H. J. Van Zuylen,et al.  Accurate freeway travel time prediction with state-space neural networks under missing data , 2005 .

[29]  Taghi M. Khoshgoftaar,et al.  A Survey of Collaborative Filtering Techniques , 2009, Adv. Artif. Intell..

[30]  Alexander Skabardonis,et al.  Freeway Performance Measurement System: Operational Analysis Tool , 2002 .

[31]  Yunpeng Wang,et al.  Long short-term memory neural network for traffic speed prediction using remote microwave sensor data , 2015 .

[32]  Jianwei Wang,et al.  Travel Time Prediction , 2008 .

[33]  Yunpeng Wang,et al.  A spatiotemporal correlative k-nearest neighbor model for short-term traffic multistep forecasting , 2016 .

[34]  Muhammad Tayyab Asif,et al.  Matrix and Tensor Based Methods for Missing Data Estimation in Large Traffic Networks , 2016, IEEE Transactions on Intelligent Transportation Systems.

[35]  Ming Zhong,et al.  Genetically Designed Models for Accurate Imputation of Missing Traffic Counts , 2004 .