Mining Dependencies Considering Time Lag in Spatio-Temporal Traffic Data

Learning dependency structure is meaningful to characterize causal or statistical relationships. Traditional dependencies learning algorithms only use the same time stamp data of variables. However, in many real-world applications, such as traffic system and climate, time lag is a key feature of hidden temporal dependencies, and plays an essential role in interpreting the cause of discovered temporal dependencies. In this paper, we propose a method for mining dependencies by considering the time lag. The proposed approach is based on a decomposition of the coefficients into products of two-level hierarchical coefficients, where one represents feature-level and the other represents time-level. Specially, we capture the prior information of time lag in spatio-temporal traffic data. We construct a probabilistic formulation by applying some probabilistic priors to these hierarchical coefficients, and devise an expectation-maximization (EM) algorithm to learn the model parameters. We evaluate our model on both synthetic and real-world highway traffic datasets. Experimental results show the effectiveness of our method.

[1]  Yan Liu,et al.  Temporal causal modeling with graphical granger methods , 2007, KDD '07.

[2]  N. Meinshausen,et al.  High-dimensional graphs and variable selection with the Lasso , 2006, math/0608017.

[3]  Vipin Kumar,et al.  Mining time-lagged relationships in spatio-temporal climate data , 2012, 2012 Conference on Intelligent Data Understanding.

[4]  Shiliang Sun,et al.  Multi-link traffic flow forecasting using neural networks , 2010, 2010 Sixth International Conference on Natural Computation.

[5]  Daniel Hernández-Lobato,et al.  Generalized spike-and-slab priors for Bayesian group feature selection using expectation propagation , 2013, J. Mach. Learn. Res..

[6]  Barry C. Arnold,et al.  On distributions whose component ratios are Cauchy , 1992 .

[7]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[8]  Richard Scheines,et al.  Learning the Structure of Linear Latent Variable Models , 2006, J. Mach. Learn. Res..

[9]  Stephen P. Boyd,et al.  Enhancing Sparsity by Reweighted ℓ1 Minimization , 2007, 0711.1612.

[10]  Seung-Jae Lee,et al.  Dynamic OD Estimation Using Three Phase Traffic Flow Theory , 2011 .

[11]  P. Spirtes,et al.  Causation, prediction, and search , 1993 .

[12]  Shiliang Sun,et al.  Network-Scale Traffic Modeling and Forecasting with Graphical Lasso and Neural Networks , 2012 .

[13]  D. Hunter,et al.  Optimization Transfer Using Surrogate Objective Functions , 2000 .

[14]  David P. Wipf,et al.  Iterative Reweighted 1 and 2 Methods for Finding Sparse Solutions , 2010, IEEE J. Sel. Top. Signal Process..

[15]  Chen Guanhua An Adaptive Traffic Flow Prediction Mechanism Based on Locally Weighted Learning , 2010 .

[16]  P. Zhao,et al.  Grouped and Hierarchical Model Selection through Composite Absolute Penalties , 2007 .

[17]  Michael I. Jordan,et al.  Dimensionality Reduction for Supervised Learning with Reproducing Kernel Hilbert Spaces , 2004, J. Mach. Learn. Res..

[18]  Lei Han,et al.  Overlapping decomposition for causal graphical modeling , 2012, KDD.

[19]  Su Yang,et al.  On feature selection for traffic congestion prediction , 2013 .

[20]  R. Tibshirani Regression Shrinkage and Selection via the Lasso , 1996 .

[21]  Jaume Barceló,et al.  A Kalman Filter Approach for the Estimation of Time Dependent OD Matrices Exploiting Bluetooth Traffic Data Collection , 2012 .

[22]  N. Zhang,et al.  Bayesian Variable Selection in Structured High-Dimensional Covariate Spaces With Applications in Genomics , 2010 .

[23]  Yuan Qi,et al.  Predictive automatic relevance determination by expectation propagation , 2004, ICML.

[24]  Peter Spirtes,et al.  Graphical Models for the Identification of Causal Structures in Multivariate Time Series Models , 2006, JCIS.

[25]  C. Granger Testing for causality: a personal viewpoint , 1980 .

[26]  M. Yuan,et al.  Model selection and estimation in regression with grouped variables , 2006 .

[27]  Yan Liu,et al.  Granger Causality Analysis in Irregular Time Series , 2012, SDM.

[28]  Nir Friedman,et al.  Learning Bayesian Network Structure from Massive Datasets: The "Sparse Candidate" Algorithm , 1999, UAI.