Non-Asymptotic Guarantees for Reliable Identification of Granger Causality via the LASSO

Granger causality is among the widely used data-driven approaches for causal analysis of time series data with applications in various areas including economics, molecular biology, and neuroscience. Two of the main challenges of this methodology are: 1) over-fitting as a result of limited data duration, and 2) correlated process noise as a confounding factor, both leading to errors in identifying the causal influences. Sparse estimation via the LASSO has successfully addressed these challenges for parameter estimation. However, the classical statistical tests for Granger causality resort to asymptotic analysis of ordinary least squares, which require long data durations to be useful and are not immune to confounding effects. In this work, we close this gap by introducing a LASSO-based statistic and studying its non-asymptotic properties under the assumption that the true models admit sparse autoregressive representations. We establish that the sufficient conditions of LASSO also suffice for robust identification of Granger causal influences. We also characterize the false positive error probability of a simple thresholding rule for identifying Granger causal effects. We present simulation studies and application to real data to compare the performance of the ordinary least squares and LASSO in detecting Granger causal influences, which corroborate our theoretical results.

[1]  Trevor Hastie,et al.  Statistical Learning with Sparsity: The Lasso and Generalizations , 2015 .

[2]  Jianfeng Feng,et al.  Granger causality vs. dynamic Bayesian network inference: a comparative study , 2009, BMC Bioinformatics.

[3]  Kam Chung Wong,et al.  LASSO GUARANTEES FOR β-MIXING HEAVY TAILED TIME SERIES ∗ By , 2019 .

[4]  Dino Sejdinovic,et al.  Detecting and quantifying causal associations in large nonlinear time series datasets , 2017, Science Advances.

[5]  Maxim Volgushev,et al.  Properties of Slow Oscillation during Slow-Wave Sleep and Anesthesia in Cats , 2011, The Journal of Neuroscience.

[6]  Emery N. Brown,et al.  A Granger Causality Measure for Point Process Models of Ensemble Neural Spiking Activity , 2011, PLoS Comput. Biol..

[7]  Adel Javanmard,et al.  Debiasing the lasso: Optimal sample size for Gaussian designs , 2015, The Annals of Statistics.

[8]  H. Akaike A new look at the statistical model identification , 1974 .

[9]  George Michailidis,et al.  Low Rank and Structured Modeling of High-Dimensional Vector Autoregressions , 2018, IEEE Transactions on Signal Processing.

[10]  A. Seth,et al.  Granger Causality Analysis in Neuroscience and Neuroimaging , 2015, The Journal of Neuroscience.

[11]  Ali Shojaie,et al.  Discovering graphical Granger causality using the truncating lasso penalty , 2010, Bioinform..

[12]  H. Zou,et al.  Regularization and variable selection via the elastic net , 2005 .

[13]  Ali Shojaie,et al.  Network granger causality with inherent grouping structure , 2012, J. Mach. Learn. Res..

[14]  Po-Ling Loh,et al.  High-dimensional regression with noisy and missing data: Provable guarantees with non-convexity , 2011, NIPS.

[15]  Jie Ding,et al.  Model Selection Techniques: An Overview , 2018, IEEE Signal Processing Magazine.

[16]  Xinsheng Zhang,et al.  Model Selection for Vector Autoregressive Processes via Adaptive Lasso , 2013 .

[17]  A. Wald Tests of statistical hypotheses concerning several parameters when the number of observations is large , 1943 .

[18]  Xinsheng Zhang,et al.  Subset selection for vector autoregressive processes via adaptive Lasso , 2010 .

[19]  Gene H. Golub,et al.  Tikhonov Regularization and Total Least Squares , 1999, SIAM J. Matrix Anal. Appl..

[20]  Yan Liu,et al.  Sparse-GEV: Sparse Latent Space Model for Multivariate Extreme Value Time Serie Modeling , 2012, ICML.

[21]  Jian Huang,et al.  SCAD-penalized regression in high-dimensional partially linear models , 2009, 0903.5474.

[22]  Sara van de Geer,et al.  Statistics for High-Dimensional Data: Methods, Theory and Applications , 2011 .

[23]  Jürgen Kurths,et al.  Identifying causal gateways and mediators in complex spatio-temporal systems , 2015, Nature Communications.

[24]  Adel Javanmard,et al.  False Discovery Rate Control via Debiased Lasso , 2018, Electronic Journal of Statistics.

[25]  Emery N. Brown,et al.  Local cortical dynamics of burst suppression in the anaesthetized brain , 2013, Brain : a journal of neurology.

[26]  G. Michailidis,et al.  Autoregressive models for gene regulatory network inference: sparsity, stability and causality issues. , 2013, Mathematical biosciences.

[27]  Laura D. Lewis,et al.  Rapid fragmentation of neuronal networks at the onset of propofol-induced unconsciousness , 2012, Proceedings of the National Academy of Sciences.

[28]  Peng Zhao,et al.  On Model Selection Consistency of Lasso , 2006, J. Mach. Learn. Res..

[29]  S. Geer,et al.  On asymptotically optimal confidence regions and tests for high-dimensional models , 2013, 1303.0518.

[30]  A. Zeevi,et al.  NONASYMPTOTIC BOUNDS FOR AUTOREGRESSIVE TIME SERIES MODELING , 2001 .

[31]  Martin J. Wainwright,et al.  A unified framework for high-dimensional analysis of $M$-estimators with decomposable regularizers , 2009, NIPS.

[32]  F. Natterer Error bounds for tikhonov regularization in hilbert scales , 1984 .

[33]  Nan-Jung Hsu,et al.  Subset selection for vector autoregressive processes using Lasso , 2008, Comput. Stat. Data Anal..

[34]  Tom Goldstein,et al.  The Split Bregman Method for L1-Regularized Problems , 2009, SIAM J. Imaging Sci..

[35]  Daniel Levenstein,et al.  Network Homeostasis and State Dynamics of Neocortical Sleep , 2016, Neuron.

[36]  Steven L. Bressler,et al.  Wiener–Granger Causality: A well established methodology , 2011, NeuroImage.

[37]  P. Mitra,et al.  Analysis of dynamic brain imaging data. , 1998, Biophysical journal.

[38]  S. Geer,et al.  On the asymptotic variance of the debiased Lasso , 2019, Electronic Journal of Statistics.

[39]  Yan Liu,et al.  Temporal causal modeling with graphical granger methods , 2007, KDD '07.

[40]  G. Schwarz Estimating the Dimension of a Model , 1978 .

[41]  Axel Kowald,et al.  Systems Biology in Practice: Concepts, Implementation and Application , 2005 .

[42]  Chih-Ling Tsai,et al.  Regression coefficient and autoregressive order shrinkage and selection via the lasso , 2007 .

[43]  Fang Han,et al.  Transition Matrix Estimation in High Dimensional Time Series , 2013, ICML.

[44]  Min Wu,et al.  Sampling Requirements for Stable Autoregressive Estimation , 2016, IEEE Transactions on Signal Processing.

[45]  Naoki Abe,et al.  Grouped graphical Granger modeling for gene expression regulatory networks discovery , 2009, Bioinform..

[46]  Anil K. Seth,et al.  A MATLAB toolbox for Granger causal connectivity analysis , 2010, Journal of Neuroscience Methods.

[47]  Jianqing Fan,et al.  Variable Selection via Nonconcave Penalized Likelihood and its Oracle Properties , 2001 .

[48]  Yan Liu,et al.  An Examination of Practical Granger Causality Inference , 2013, SDM.

[49]  J. Geweke,et al.  Measures of Conditional Linear Dependence and Feedback between Time Series , 1984 .

[50]  A. Skripnikov,et al.  Regularized joint estimation of related vector autoregressive models , 2019, Comput. Stat. Data Anal..

[51]  Y. Nardi,et al.  Autoregressive process modeling via the Lasso procedure , 2008, J. Multivar. Anal..

[52]  Karl J. Friston,et al.  Analysing connectivity with Granger causality and dynamic causal modelling , 2013, Current Opinion in Neurobiology.

[53]  E. Fox,et al.  Neural Granger Causality for Nonlinear Time Series , 2018, 1802.05842.

[54]  C. Granger Investigating causal relations by econometric models and cross-spectral methods , 1969 .

[55]  Maurizio Corbetta,et al.  Measuring Granger Causality between Cortical Regions from Voxelwise fMRI BOLD Signals with LASSO , 2012, PLoS Comput. Biol..

[56]  W. E. Lever,et al.  THE LIMITING DISTRIBUTION OF THE LIKELIHOOD RATIO STATISTIC UNDER A CLASS OF LOCAL ALTERNATIVES. , 1967 .

[57]  Konrad Paul Kording,et al.  Quasi-experimental causality in neuroscience and behavioural research , 2018, Nature Human Behaviour.

[58]  Bernhard Schölkopf,et al.  Inferring causation from time series in Earth system sciences , 2019, Nature Communications.

[59]  R. Tibshirani Regression Shrinkage and Selection via the Lasso , 1996 .

[60]  J. Geweke,et al.  Measurement of Linear Dependence and Feedback between Multiple Time Series , 1982 .

[61]  G. Michailidis,et al.  Regularized estimation in sparse high-dimensional time series models , 2013, 1311.4175.

[62]  Dimitris Kugiumtzis,et al.  Non-uniform state space reconstruction and coupling detection , 2010, Physical review. E, Statistical, nonlinear, and soft matter physics.

[63]  Helmut Ltkepohl,et al.  New Introduction to Multiple Time Series Analysis , 2007 .

[64]  Adel Javanmard,et al.  Confidence intervals and hypothesis testing for high-dimensional regression , 2013, J. Mach. Learn. Res..

[65]  Mingzhou Ding,et al.  Analyzing information flow in brain networks with nonparametric Granger causality , 2008, NeuroImage.