Improving the Traffic Data Imputation Accuracy Using Temporal and Spatial Information

The missing data problem exists in a lot of transportation systems while many traffic applications require accurate traffic data to work well. So, various methods were proposed for traffic data imputation in recent years. However, most of these approaches only consider the information of a single detector. In this paper, we use the temporal and spatial information collected from multiple detectors to improve the traffic data imputation accuracy. Specially, we consider Probabilistic Principal Component Analysis (PPCA) and Mixed Probabilistic Principal Component Analysis (MPPCA), which had been proven to be useful methods to handle data imputation. We examine the performance of these two methods using the temporal and spatial information. Tests show that when only using the spatial information, the errors of these two methods are reduced and MPPCA works even better than PPCA. Moreover, when both of the temporal and spatial information from multiple detectors are used, more imputation accuracy is obtained than only using the spatial information for MPPCA.

[1]  Pravin Varaiya,et al.  Measuring Traffic , 2008, 0804.2982.

[2]  Li Li,et al.  Comparison on PPCA, KPPCA and MPPCA Based Missing Data Imputing for Traffic Flow , 2013 .

[3]  Li Li,et al.  Efficient missing data imputing for traffic flow by considering temporal and spatial dependence , 2013 .

[4]  Gopal K. Kanji,et al.  Performance measurement system , 2002 .

[5]  Stephen Graham Ritchie,et al.  TRANSPORTATION RESEARCH. PART C, EMERGING TECHNOLOGIES , 1993 .

[6]  Xing-yi Li,et al.  Short-term traffic flow forecasting model under missing data: Short-term traffic flow forecasting model under missing data , 2010 .

[7]  Wei Shen,et al.  Real-time road traffic forecasting using regime-switching space-time models and adaptive LASSO , 2012 .

[8]  Li Li,et al.  Missing traffic data: comparison of imputation methods , 2014 .

[9]  JeongYoung-Seon,et al.  Online-SVR for short-term traffic flow prediction under typical and atypical traffic conditions , 2009 .

[10]  Alexander Skabardonis,et al.  Detecting Errors and Imputing Missing Data for Single-Loop Surveillance Systems , 2003 .

[11]  Lee D. Han,et al.  Online-SVR for short-term traffic flow prediction under typical and atypical traffic conditions , 2009, Expert Syst. Appl..

[12]  Shi Hua-ji,et al.  Short-term traffic flow forecasting model under missing data , 2010 .

[13]  J. Leonard,et al.  Markov Chain Monte Carlo Multiple Imputation Using Bayesian Networks for Incomplete Intelligent Transportation Systems Data , 2005 .

[14]  Angshuman Guin,et al.  Multiple Imputation Scheme for Overcoming the Missing Values and Variability Issues in ITS Data , 2005 .

[15]  Haitham Al-Deek,et al.  New Algorithms for Filtering and Imputation of Real-Time and Archived Dual-Loop Detector Data in I-4 Data Warehouse , 2004 .

[16]  Christopher M. Bishop,et al.  Mixtures of Probabilistic Principal Component Analyzers , 1999, Neural Computation.

[17]  H. J. Van Zuylen,et al.  Accurate freeway travel time prediction with state-space neural networks under missing data , 2005 .

[18]  Yi Zhang,et al.  PPCA-Based Missing Data Imputation for Traffic Flow Volume: A Systematical Approach , 2009, IEEE Transactions on Intelligent Transportation Systems.

[19]  Daiheng Ni,et al.  Markov Chain Monte Carlo Multiple Imputation Using Bayesian Networks for Incomplete Intelligent Transportation Systems Data , 2005 .

[20]  Yin Wang,et al.  The retrieval of intra-day trend and its influence on traffic prediction , 2012 .

[21]  Daniel B. Fambro,et al.  Application of Subset Autoregressive Integrated Moving Average Model for Short-Term Freeway Traffic Volume Forecasting , 1999 .

[22]  Mark Dougherty,et al.  A REVIEW OF NEURAL NETWORKS APPLIED TO TRANSPORT , 1995 .

[23]  Shiliang Sun,et al.  Variational Inference for Infinite Mixtures of Gaussian Processes With Applications to Traffic Flow Prediction , 2011, IEEE Transactions on Intelligent Transportation Systems.