Robust Tensor Recovery with Fiber Outliers for Traffic Events

Event detection is gaining increasing attention in smart cities research. Large-scale mobility data serves as an important tool to uncover the dynamics of urban transportation systems, and more often than not the dataset is incomplete. In this article, we develop a method to detect extreme events in large traffic datasets, and to impute missing data during regular conditions. Specifically, we propose a robust tensor recovery problem to recover low-rank tensors under fiber-sparse corruptions with partial observations, and use it to identify events, and impute missing data under typical conditions. Our approach is scalable to large urban areas, taking full advantage of the spatio-temporal correlations in traffic patterns. We develop an efficient algorithm to solve the tensor recovery problem based on the alternating direction method of multipliers (ADMM) framework. Compared with existing l1 norm regularized tensor decomposition methods, our algorithm can exactly recover the values of uncorrupted fibers of a low-rank tensor and find the positions of corrupted fibers under mild conditions. Numerical experiments illustrate that our algorithm can achieve exact recovery and outlier detection even with missing data rates as high as 40% under 5% gross corruption, depending on the tensor size and the Tucker rank of the low rank tensor. Finally, we apply our method on a real traffic dataset corresponding to downtown Nashville, TN and successfully detect the events like severe car crashes, construction lane closures, and other large events that cause significant traffic disruptions.

[1]  L. Tucker,et al.  Some mathematical notes on three-mode factor analysis , 1966, Psychometrika.

[2]  Constantine Caramanis,et al.  Robust PCA via Outlier Pursuit , 2010, IEEE Transactions on Information Theory.

[3]  Fabien Moutarde,et al.  Spatial and temporal analysis of traffic states on large scale networks , 2010, 13th International IEEE Conference on Intelligent Transportation Systems.

[4]  Fabien Moutarde,et al.  Analysis of network-level traffic states using locality preservative non-negative matrix factorization , 2011, 2011 14th International IEEE Conference on Intelligent Transportation Systems (ITSC).

[5]  Xiaoming Yuan,et al.  Sparse and low-rank matrix decomposition via alternating direction method , 2013 .

[6]  Shawn Turner,et al.  Empirical Approaches to Outlier Detection in Intelligent Transportation Systems Data , 2003 .

[7]  Constantine Caramanis,et al.  Robust Matrix Completion and Corrupted Columns , 2011, ICML.

[8]  Yin Wang,et al.  The retrieval of intra-day trend and its influence on traffic prediction , 2012 .

[9]  Siyuan Liu,et al.  Anomaly Detection from Incomplete Data , 2014, TKDD.

[10]  Huachun Tan,et al.  Robust tensor decomposition based on Cauchy distribution and its applications , 2017, Neurocomputing.

[11]  Emmanuel J. Candès,et al.  A Singular Value Thresholding Algorithm for Matrix Completion , 2008, SIAM J. Optim..

[12]  Hong-Yuan Mark Liao,et al.  Simultaneous Tensor Decomposition and Completion Using Factor Priors , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[13]  P. Lions,et al.  Splitting Algorithms for the Sum of Two Nonlinear Operators , 1979 .

[14]  Tamara G. Kolda,et al.  Categories and Subject Descriptors: G.4 [Mathematics of Computing]: Mathematical Software— , 2022 .

[15]  Tamara G. Kolda,et al.  Temporal Link Prediction Using Matrix and Tensor Factorizations , 2010, TKDD.

[16]  Gongguo Tang,et al.  Robust principal component analysis based on low-rank and block-sparse matrix decomposition , 2011, 2011 45th Annual Conference on Information Sciences and Systems.

[17]  Yi Zhang,et al.  A BPCA based missing value imputing method for traffic flow volume data , 2008, 2008 IEEE Intelligent Vehicles Symposium.

[18]  Liqing Zhang,et al.  Bayesian CP Factorization of Incomplete Tensors with Automatic Rank Determination , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[19]  Chao Li,et al.  Discovering Urban Spatio-temporal Structure from Time-Evolving Traffic Networks , 2014, APWeb.

[20]  Tamara G. Kolda,et al.  Tensor Decompositions and Applications , 2009, SIAM Rev..

[21]  Yi Ma,et al.  Robust principal component analysis? , 2009, JACM.

[22]  Muhammad Tayyab Asif,et al.  Matrix and Tensor Based Methods for Missing Data Estimation in Large Traffic Networks , 2016, IEEE Transactions on Intelligent Transportation Systems.

[23]  Billy M. Williams,et al.  Modeling and Forecasting Vehicular Traffic Flow as a Seasonal ARIMA Process: Theoretical Basis and Empirical Results , 2003, Journal of Transportation Engineering.

[24]  Francisco Javier Díaz Pernas,et al.  Wavelet‐Based Denoising for Traffic Volume Time Series Forecasting with Self‐Organizing Neural Networks , 2010, Comput. Aided Civ. Infrastructure Eng..

[25]  Stephen P. Boyd,et al.  Distributed Optimization and Statistical Learning via the Alternating Direction Method of Multipliers , 2011, Found. Trends Mach. Learn..

[26]  Guangdong Feng,et al.  A Tensor Based Method for Missing Traffic Data Completion , 2013 .

[27]  Jiashi Feng,et al.  Outlier-Robust Tensor PCA , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[28]  Ming Shao,et al.  Multi-View Low-Rank Analysis with Applications to Outlier Detection , 2018, ACM Trans. Knowl. Discov. Data.

[29]  Donald Goldfarb,et al.  Robust Low-Rank Tensor Recovery: Models and Algorithms , 2013, SIAM J. Matrix Anal. Appl..

[30]  Zhixun Su,et al.  Linearized Alternating Direction Method with Adaptive Penalty for Low-Rank Representation , 2011, NIPS.

[31]  Wei Wang,et al.  A comparison of outlier detection algorithms for ITS data , 2010, Expert Syst. Appl..

[32]  Sanjay Chawla,et al.  On detection of emerging anomalous traffic patterns using GPS data , 2013, Data Knowl. Eng..

[33]  James Caverlee,et al.  Tensor Completion Algorithms in Big Data Analytics , 2017, ACM Trans. Knowl. Discov. Data.

[34]  Jianhua Guo,et al.  Real time traffic flow outlier detection using short-term traffic conditional variance prediction , 2015 .

[35]  Jarvis D. Haupt,et al.  Robust PCA via tensor outlier pursuit , 2016, 2016 50th Asilomar Conference on Signals, Systems and Computers.

[36]  Feng Xia,et al.  LoTAD: long-term traffic anomaly detection based on crowdsourced bus trajectory data , 2017, World Wide Web.

[37]  Victoria J. Hodge,et al.  A Survey of Outlier Detection Methodologies , 2004, Artificial Intelligence Review.

[38]  Wolfgang Nejdl,et al.  Predicting and visualizing traffic congestion in the presence of planned special events , 2014, J. Vis. Lang. Comput..

[39]  Charu C. Aggarwal,et al.  Outlier Detection for Temporal Data: A Survey , 2014, IEEE Transactions on Knowledge and Data Engineering.

[40]  Robert R. Meyer,et al.  A variable-penalty alternating directions method for convex optimization , 1998, Math. Program..

[41]  Konstantinos Pelechrinis,et al.  Event Analytics via Discriminant Tensor Factorization , 2018, ACM Trans. Knowl. Discov. Data.

[42]  Lorenzo Mussone,et al.  A Study of Hybrid Neural Network Approaches and the Effects of Missing Data on Traffic Forecasting , 2001, Neural Computing & Applications.

[43]  R. Elashoff,et al.  Missing Observations in Multivariate Statistics I. Review of the Literature , 1966 .

[44]  Qingquan Li,et al.  Identifying Urban Traffic Congestion Pattern from Historical Floating Car Data , 2013 .

[45]  Trevor Hastie,et al.  Statistical Learning with Sparsity: The Lasso and Generalizations , 2015 .

[46]  B. L. Smith,et al.  Applying quality control to traffic condition monitoring , 2000, ITSC2000. 2000 IEEE Intelligent Transportation Systems. Proceedings (Cat. No.00TH8493).

[47]  F. L. Hitchcock The Expression of a Tensor or a Polyadic as a Sum of Products , 1927 .

[48]  Konstantinos Kalpakis,et al.  Detecting Road Traffic Events by Coupling Multiple Timeseries With a Nonparametric Bayesian Method , 2014, IEEE Transactions on Intelligent Transportation Systems.

[49]  Johan Håstad Tensor Rank is NP-Complete , 1990, J. Algorithms.

[50]  Johan A. K. Suykens,et al.  Robust Low-Rank Tensor Recovery With Regularized Redescending M-Estimator , 2016, IEEE Transactions on Neural Networks and Learning Systems.

[51]  Xing Xie,et al.  Discovering spatio-temporal causal interactions in traffic data streams , 2011, KDD.

[52]  Jules White,et al.  DxNAT — Deep neural networks for explaining non-recurring traffic congestion , 2017, 2017 IEEE International Conference on Big Data (Big Data).

[53]  Huachun Tan,et al.  A Fused CP Factorization Method for Incomplete Tensors , 2019, IEEE Transactions on Neural Networks and Learning Systems.

[54]  Guangdong Feng,et al.  Traffic volume data outlier recovery via tensor model , 2013 .