Missing Data Estimation for Traffic Volume by Searching an Optimum Closed Cut in Urban Networks

Traffic data imputation has drawn significant attention from both academia and industry because traffic data often suffer from data missing problems, caused by temporary deployment of sensors, detector malfunction, and lossy communication systems. To fully exploit the spatial-temporal correlation and road topological information in an urban traffic network, we propose an optimum closed cut (OCC)-based spatio-temporal imputation technique, which is implemented in two stages: a) employing graph theory to search the OCC in the road network, for which the traffic on roads intersected by the closed cut has the maximum correlation with that on the target road while minimizing the number of intersected roads; and b) estimating the missing data on the target road using OCC-based Kriging estimator, incorporating both the road topological information and flow conservation law to improve the estimation accuracy. Experimental results using traffic data collected on real roads indicate that the OCC search algorithm can effectively capture the optimum set of neighboring sensors. An OCC-based estimator can provide more accurate imputation results compared with nearest historical average and correlative ${k}$ -NN ( ${k}$ -nearest neighbors) methods. The road topological information and flow conservation law can be explored to further improve the estimation performance while reducing the number of sensors involved in the data imputation, hence improving the computational efficiency.

[1]  Noel A Cressie,et al.  Fixed rank fi ltering for spatio-temporal data , 2010 .

[2]  N. Cressie,et al.  Bayesian Inference for the Spatial Random Effects Model , 2011 .

[3]  Limin Jia,et al.  Road Traffic Flow Prediction with a Time-Oriented ARIMA Model , 2009, 2009 Fifth International Joint Conference on INC, IMS and IDC.

[4]  Enrique F. Castillo,et al.  The Observability Problem in Traffic Models: Algebraic and Topological Methods , 2008, IEEE Transactions on Intelligent Transportation Systems.

[5]  Yunpeng Wang,et al.  A spatiotemporal correlative k-nearest neighbor model for short-term traffic multistep forecasting , 2016 .

[6]  Billy M. Williams,et al.  Modeling and Forecasting Vehicular Traffic Flow as a Seasonal ARIMA Process: Theoretical Basis and Empirical Results , 2003, Journal of Transportation Engineering.

[7]  Eleni I. Vlahogianni,et al.  Short‐term traffic forecasting: Overview of objectives and methods , 2004 .

[8]  G. R. Nudd,et al.  The Viterbi Algorithm , 1993 .

[9]  Danya Yao,et al.  Missing data imputation for traffic flow based on improved local least squares , 2012 .

[10]  Muhammad Tayyab Asif,et al.  Matrix and Tensor Based Methods for Missing Data Estimation in Large Traffic Networks , 2016, IEEE Transactions on Intelligent Transportation Systems.

[11]  Markos Papageorgiou,et al.  Real-Time Freeway Traffic State Estimation Based on Extended Kalman Filter: A Case Study , 2007, Transp. Sci..

[12]  Yang Zhang,et al.  Data Imputation Using Least Squares Support Vector Machines in Urban Arterial Streets , 2009, IEEE Signal Processing Letters.

[13]  Manwo Ng Synergistic sensor location for link flow inference without path enumeration: A node-based approach , 2012 .

[14]  Francesco Corman,et al.  Assessing partial observability in network sensor location problems , 2014 .

[15]  Hiroshi Esaki,et al.  Traffic causality graphs: Profiling network applications through temporal and spatial causality of flows , 2011, 2011 23rd International Teletraffic Congress (ITC).

[16]  Zhixian Yan,et al.  Traj-ARIMA: a spatial-time series model for network-constrained trajectory , 2010, IWCTS '10.

[17]  Yang Zhang,et al.  Missing traffic flow data prediction using least squares support vector machines in urban arterial streets , 2009, 2009 IEEE Symposium on Computational Intelligence and Data Mining.

[18]  M J Lighthill,et al.  On kinematic waves II. A theory of traffic flow on long crowded roads , 1955, Proceedings of the Royal Society of London. Series A. Mathematical and Physical Sciences.

[19]  Guangdong Feng,et al.  A Tensor Based Method for Missing Traffic Data Completion , 2013 .

[20]  Yi Zhang,et al.  PPCA-Based Missing Data Imputation for Traffic Flow Volume: A Systematical Approach , 2009, IEEE Transactions on Intelligent Transportation Systems.

[21]  M. Zhong,et al.  ESTIMATION OF MISSING TRAFFIC COUNTS USING FACTOR, GENETIC, NEURAL AND REGRESSION TECHNIQUES , 2004 .

[22]  Eleni I. Vlahogianni,et al.  Statistical methods versus neural networks in transportation research: Differences, similarities and some insights , 2011 .

[23]  Alexander Skabardonis,et al.  Detecting Errors and Imputing Missing Data for Single-Loop Surveillance Systems , 2003 .

[24]  Mechthild Stoer,et al.  A simple min-cut algorithm , 1997, JACM.

[25]  Chang-Tien Lu,et al.  Urban Traffic Flow Prediction Using a Spatio-Temporal Random Effects Model , 2016, J. Intell. Transp. Syst..

[26]  Sailes K. Sengijpta Fundamentals of Statistical Signal Processing: Estimation Theory , 1995 .

[27]  Bin Ran,et al.  Tensor based missing traffic data completion with spatial–temporal correlation , 2016 .

[28]  N. Cressie,et al.  Fixed rank kriging for very large spatial data sets , 2008 .

[29]  H. J. Van Zuylen,et al.  Accurate freeway travel time prediction with state-space neural networks under missing data , 2005 .

[30]  Li Li,et al.  Efficient missing data imputing for traffic flow by considering temporal and spatial dependence , 2013 .

[31]  Yiannis Kamarianakis,et al.  Spatial Time-Series Modeling: A review of the proposed methodologies , 2006 .

[32]  Hwasoo Yeo,et al.  Data-Driven Imputation Method for Traffic Data in Sectional Units of Road Links , 2016, IEEE Transactions on Intelligent Transportation Systems.

[33]  Luis Miguel Romero Pérez,et al.  Traffic Flow Estimation Models Using Cellular Phone Data , 2012, IEEE Transactions on Intelligent Transportation Systems.