Large-Vector Autoregression for Multilayer Spatially Correlated Time Series

One of the most commonly used methods for modeling multivariate time series is the vector autoregressive model (VAR). VAR is generally used to identify lead, lag, and contemporaneous relationships describing Granger causality within and between time series. In this article, we investigate the VAR methodology for analyzing data consisting of multilayer time series that are spatially interdependent. When modeling VAR relationships for such data, the dependence between time series is both a curse and a blessing. The former because it requires modeling the between time-series correlation or the contemporaneous relationships which may be challenging when using likelihood-based methods. The latter because the spatial correlation structure can be used to specify the lead–lag relationships within and between time series, within and between layers. To address these challenges, we propose an L1\L2 regularized likelihood estimation method. The lead, lag, and contemporaneous relationships are estimated using an efficient algorithm that exploits sparsity in the VAR structure, accounts for the spatial dependence, and models the error dependence. We consider a case study to illustrate the applicability of our method. In the supplementary materials available online, we assess the performance of the proposed VAR model and compare it with existing methods within a simulation study.

[1]  Sara van de Geer,et al.  Statistics for High-Dimensional Data: Methods, Theory and Applications , 2011 .

[2]  Robert Kohn,et al.  Variable Selection and Covariance Selection in Multivariate Regression Models , 2005 .

[3]  Michael I. Jordan,et al.  Union support recovery in high-dimensional multivariate regression , 2008, 2008 46th Annual Allerton Conference on Communication, Control, and Computing.

[4]  Ji Zhu,et al.  Regularized Multivariate Regression for Identifying Master Predictors with Application to Integrative Genomics Study of Breast Cancer. , 2008, The annals of applied statistics.

[5]  M. Yuan,et al.  Model selection and estimation in regression with grouped variables , 2006 .

[6]  Stephen J. Wright,et al.  Simultaneous Variable Selection , 2005, Technometrics.

[7]  Adam J Rothman,et al.  Sparse Multivariate Regression With Covariance Estimation , 2010, Journal of computational and graphical statistics : a joint publication of American Statistical Association, Institute of Mathematical Statistics, Interface Foundation of North America.

[8]  G. Reinsel,et al.  Multivariate Reduced-Rank Regression: Theory and Applications , 1998 .

[9]  T. W. Anderson Estimating Linear Restrictions on Regression Coefficients for Multivariate Normal Distributions , 1951 .

[10]  Helmut Ltkepohl,et al.  New Introduction to Multiple Time Series Analysis , 2007 .

[11]  A. Izenman Reduced-rank regression for the multivariate linear model , 1975 .

[12]  R. Tibshirani,et al.  A note on the group lasso and a sparse group lasso , 2010, 1001.0736.

[13]  R. Tibshirani,et al.  Sparse inverse covariance estimation with the graphical lasso. , 2008, Biostatistics.

[14]  M. Yuan,et al.  Dimension reduction and coefficient estimation in multivariate linear regression , 2007 .

[15]  Richard Terrell,et al.  ON THE RECURSIVE FITTING OF SUBSET AUTOREGRESSIONS , 1982 .

[16]  L. Breiman Better subset regression using the nonnegative garrote , 1995 .

[17]  Chih-Ling Tsai,et al.  MODEL SELECTION FOR MULTIVARIATE REGRESSION IN SMALL SAMPLES , 1994 .

[18]  S. Pandey,et al.  What Are Degrees of Freedom , 2008 .

[19]  Shifeng Xiong,et al.  Better subset regression , 2012, 1212.0634.

[20]  Y. Fujikoshi,et al.  Modified AIC and Cp in multivariate linear regression , 1997 .

[21]  Nan-Jung Hsu,et al.  Subset selection for vector autoregressive processes using Lasso , 2008, Comput. Stat. Data Anal..

[22]  G. Schwarz Estimating the Dimension of a Model , 1978 .

[23]  R. Tibshirani,et al.  On the “degrees of freedom” of the lasso , 2007, 0712.0881.

[24]  秀俊 松井,et al.  Statistics for High-Dimensional Data: Methods, Theory and Applications , 2014 .

[25]  Yufeng Liu,et al.  Simultaneous multiple response regression and inverse covariance matrix estimation via penalized Gaussian maximum likelihood , 2012, J. Multivar. Anal..

[26]  Alan J. Izenman,et al.  Multivariate Reduced-Rank Regression , 2011, International Encyclopedia of Statistical Science.

[27]  Sara van de Geer,et al.  Statistics for High-Dimensional Data , 2011 .

[28]  Neil H. Timm,et al.  Multivariate Reduced-Rank Regression , 1999, Technometrics.

[29]  D. Giannone,et al.  Large Bayesian vector auto regressions , 2010 .

[30]  R. Tibshirani Regression Shrinkage and Selection via the Lasso , 1996 .

[31]  C. Sims MACROECONOMICS AND REALITY , 1977 .

[32]  Alfred O. Hero,et al.  $l_{0}$ Sparse Inverse Covariance Estimation , 2014, IEEE Transactions on Signal Processing.

[33]  Ellen B. Roecker,et al.  Prediction error and its estimation for subset-selected models , 1991 .

[34]  David F. Hendry,et al.  Computer Automation of General-to-Specific Model Selection Procedures , 2001 .

[35]  F. Breidt,et al.  Spatial Lasso With Applications to GIS Model Selection , 2010 .

[36]  Sushant Sachdeva,et al.  Dimension Reduction , 2008, Encyclopedia of GIS.

[37]  Yan Liu,et al.  Spatial-temporal causal modeling for climate change attribution , 2009, KDD.