Content-Aware Traffic Data Completion in ITS Based on Generative Adversarial Nets

Big data analytics has been rapidly integrated into Intelligent Transportation System (ITS), empowering diverse applications such as real-time traffic prediction and management. However, incomplete traffic time-series data during the data analysis are nearly inevitable due to the constraints of data collection or packet loss in the communication process. Existing tensor-based completion methods fail to perform well in consecutively missing (CM) cases where a sub-series of the traffic time series is completely missing. Moreover, their high computational complexity prevents the road network-level implementation. Therefore, to tackle these problems, a batch-oriented traffic data completion method for large-scale road networks is proposed in this paper. In order to preserve multi-way natures of traffic data, we first model the traffic data by tensors so that traffic data completion becomes a tensor completion problem. Several types of tensor structures are adopted in order to analyze their impacts on traffic data completion. Thereafter, a Content-Aware traffic data completion method is further developed based on the Generative Adversarial Net (CA-GAN). More specifically, the formulated traffic tensors are interpreted as samples from a high-dimensional traffic distribution and a GAN is then proposed to learn this distribution. By considering effects of traffic data from different days and links, a weighted loss function is employed to search for the raw traffic tensor in the obtained traffic distribution. Afterwards, the estimated raw traffic tensor is used to complete the missing data. Visualization results demonstrate that our GANs can produce verisimilar traffic patterns with different time spans. Finally, our experiments on real traffic datasets show that the proposed CA-GAN method can effectively batch incomplete traffic tensors in CM cases with fast recovery process.

[1]  Emmanuel J. Candès,et al.  A Singular Value Thresholding Algorithm for Matrix Completion , 2008, SIAM J. Optim..

[2]  Ming Zhong,et al.  Genetically Designed Models for Accurate Imputation of Missing Traffic Counts , 2004 .

[3]  Xianbin Wang,et al.  A $Q$ -Learning-Based Proactive Caching Strategy for Non-Safety Related Services in Vehicular Networks , 2019, IEEE Internet of Things Journal.

[4]  J. Shao,et al.  Nearest Neighbor Imputation for Survey Data , 2000 .

[5]  Xuemin Shen,et al.  Real-Time Path Planning Based on Hybrid-VANET-Enhanced Transportation System , 2015, IEEE Transactions on Vehicular Technology.

[6]  Yi Zhang,et al.  Spatial-temporal traffic data analysis based on global data management using MAS , 2004, IEEE Trans. Intell. Transp. Syst..

[7]  Li Zhao,et al.  Vehicle-to-Everything (v2x) Services Supported by LTE-Based Systems and 5G , 2017, IEEE Communications Standards Magazine.

[8]  Yi Zhang,et al.  A BPCA based missing value imputing method for traffic flow volume data , 2008, 2008 IEEE Intelligent Vehicles Symposium.

[9]  Johan Håstad Tensor Rank is NP-Complete , 1990, J. Algorithms.

[10]  Minh N. Do,et al.  Semantic Image Inpainting with Deep Generative Models , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[11]  Xianbin Wang,et al.  Short-Term Traffic Prediction Based on DeepCluster in Large-Scale Road Networks , 2019, IEEE Transactions on Vehicular Technology.

[12]  Tamara G. Kolda,et al.  Scalable Tensor Factorizations for Incomplete Data , 2010, ArXiv.

[13]  Eleni I. Vlahogianni,et al.  Statistical methods versus neural networks in transportation research: Differences, similarities and some insights , 2011 .

[14]  Muhammad Tayyab Asif,et al.  Matrix and Tensor Based Methods for Missing Data Estimation in Large Traffic Networks , 2016, IEEE Transactions on Intelligent Transportation Systems.

[15]  Yuexing Peng,et al.  Enhanced Deep Networks for Short-Term and Medium-Term Load Forecasting , 2019, IEEE Access.

[16]  Soumith Chintala,et al.  Unsupervised Representation Learning with Deep Convolutional Generative Adversarial Networks , 2015, ICLR.

[17]  Shiqian Ma,et al.  Fixed point and Bregman iterative methods for matrix rank minimization , 2009, Math. Program..

[18]  A. Kibangou,et al.  Traffic data imputation via tensor completion based on soft thresholding of Tucker core , 2017 .

[19]  Yin Wang,et al.  The retrieval of intra-day trend and its influence on traffic prediction , 2012 .

[20]  John L.P. Thompson,et al.  Missing data , 2004, Amyotrophic lateral sclerosis and other motor neuron disorders : official publication of the World Federation of Neurology, Research Group on Motor Neuron Diseases.

[21]  Lili Zhang,et al.  The Study of Traffic Flow Information Completion Based on GAN Algorithm , 2018, 2018 IEEE 7th Data Driven Control and Learning Systems Conference (DDCLS).

[22]  Yoshua Bengio,et al.  Generative Adversarial Nets , 2014, NIPS.

[23]  Victor C. M. Leung,et al.  Delay-Optimal Virtualized Radio Resource Scheduling in Software-Defined Vehicular Networks via Stochastic Learning , 2016, IEEE Transactions on Vehicular Technology.

[24]  David Pfau,et al.  Unrolled Generative Adversarial Networks , 2016, ICLR.

[25]  Xinyu Chen,et al.  Spatial-temporal traffic speed patterns discovery and incomplete data recovery via SVD-combined tensor decomposition , 2018 .

[26]  Angshuman Guin,et al.  Multiple Imputation Scheme for Overcoming the Missing Values and Variability Issues in ITS Data , 2005 .

[27]  Huachun Tan,et al.  Low Multilinear Rank Approximation of Tensors and Application in Missing Traffic Data , 2014 .

[28]  Xianbin Wang,et al.  Patent Analytic Citation-Based VSM: Challenges and Applications , 2020, IEEE Access.

[29]  Jiannong Cao,et al.  Accurate Recovery of Internet Traffic Data: A Sequential Tensor Completion Approach , 2018, IEEE/ACM Transactions on Networking.

[30]  Yi Zhang,et al.  PPCA-Based Missing Data Imputation for Traffic Flow Volume: A Systematical Approach , 2009, IEEE Transactions on Intelligent Transportation Systems.

[31]  M. Zhong,et al.  ESTIMATION OF MISSING TRAFFIC COUNTS USING FACTOR, GENETIC, NEURAL AND REGRESSION TECHNIQUES , 2004 .

[32]  Xuemin Shen,et al.  An SMDP-Based Resource Allocation in Vehicular Cloud Computing Systems , 2015, IEEE Transactions on Industrial Electronics.

[33]  Li Li,et al.  Efficient missing data imputing for traffic flow by considering temporal and spatial dependence , 2013 .

[34]  Jieping Ye,et al.  Tensor Completion for Estimating Missing Values in Visual Data , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[35]  Johan Håstad,et al.  Tensor Rank is NP-Complete , 1989, ICALP.

[36]  Yue Tan,et al.  Deep Reinforcement Learning for Autonomous Internet of Things: Model, Applications and Challenges , 2019, IEEE Communications Surveys & Tutorials.

[37]  Bin Ran,et al.  Tensor based missing traffic data completion with spatial–temporal correlation , 2016 .

[38]  Fei-Yue Wang,et al.  Data-Driven Intelligent Transportation Systems: A Survey , 2011, IEEE Transactions on Intelligent Transportation Systems.

[39]  B. Recht,et al.  Tensor completion and low-n-rank tensor recovery via convex optimization , 2011 .

[40]  Guangdong Feng,et al.  A Tensor Based Method for Missing Traffic Data Completion , 2013 .

[41]  Yi Zhang,et al.  Trend Modeling for Traffic Time Series Analysis: An Integrated Study , 2015, IEEE Transactions on Intelligent Transportation Systems.