Independent innovation analysis for nonlinear vector autoregressive process

The nonlinear vector autoregressive (NVAR) model provides an appealing framework to analyze multivariate time series obtained from a nonlinear dynamical system. However, the innovation (or error), which plays a key role by driving the dynamics, is almost always assumed to be additive. Additivity greatly limits the generality of the model, hindering analysis of general NVAR process which have nonlinear interactions between the innovations. Here, we propose a new general framework called independent innovation analysis (IIA), which estimates the innovations from completely general NVAR. We assume mutual independence of the innovations as well as their modulation by a fully observable auxiliary variable (which is often taken as the time index and simply interpreted as nonstationarity). We show that IIA guarantees the identifiability of the innovations with arbitrary nonlinearities, up to a permutation and component-wise invertible nonlinearities. We propose two practical estimation methods, both of which can be easily implemented by ordinary neural network training. We thus provide the first rigorous identifiability result for general NVAR, as well as very general tools for learning such models.

[1]  Simon Hanslmayr,et al.  Across-subjects classification of stimulus modality from human MEG high frequency activity , 2017, bioRxiv.

[2]  Seunghoon Hong,et al.  Decomposing Motion and Content for Natural Video Sequence Prediction , 2017, ICLR.

[3]  J. Kruskal Three-way arrays: rank and uniqueness of trilinear decompositions, with application to arithmetic complexity and statistics , 1977 .

[4]  Kurt Hornik,et al.  Multilayer feedforward networks are universal approximators , 1989, Neural Networks.

[5]  Aapo Hyvarinen,et al.  Hidden Markov Nonlinear ICA: Unsupervised Learning from Nonstationary Time Series , 2020, UAI.

[6]  Georgios B. Giannakis,et al.  Nonlinear Structural Vector Autoregressive Models With Application to Directed Brain Networks , 2019, IEEE Transactions on Signal Processing.

[7]  Giorgio E. Primiceri Time Varying Structural Vector Autoregressions and Monetary Policy , 2002 .

[8]  Sergey Levine,et al.  Unsupervised Learning for Physical Interaction through Video Prediction , 2016, NIPS.

[9]  Jim E. Griffin,et al.  Bayesian Nonparametric Vector Autoregressive Models , 2017 .

[10]  H. Holzmann,et al.  Nonparametric identification of hidden Markov models , 2014 .

[11]  Max Welling,et al.  Auto-Encoding Variational Bayes , 2013, ICLR.

[12]  Aapo Hyvärinen,et al.  Fast and robust fixed-point algorithms for independent component analysis , 1999, IEEE Trans. Neural Networks.

[13]  Ivan Jeliazkov,et al.  Nonparametric Vector Autoregressions: Specification, Estimation, and Inference , 2013 .

[14]  Luigi Gresele,et al.  Relative gradient optimization of the Jacobian term in unsupervised deep learning , 2020, NeurIPS.

[15]  Karen O. Egiazarian,et al.  Measuring directional coupling between EEG sources , 2008, NeuroImage.

[16]  C. Sims MACROECONOMICS AND REALITY , 1977 .

[17]  G. Koop,et al.  Bayesian Multivariate Time Series Methods for Empirical Macroeconomics , 2009 .

[18]  Gabriel Kreiman,et al.  Deep Predictive Coding Networks for Video Prediction and Unsupervised Learning , 2016, ICLR.

[19]  Nitish Srivastava,et al.  Unsupervised Learning of Video Representations using LSTMs , 2015, ICML.

[20]  Aapo Hyvärinen,et al.  Noise-Contrastive Estimation of Unnormalized Statistical Models, with Applications to Natural Image Statistics , 2012, J. Mach. Learn. Res..

[21]  Aapo Hyvärinen,et al.  Density Estimation in Infinite Dimensional Exponential Families , 2013, J. Mach. Learn. Res..

[22]  Oyer,et al.  Causal Inference by Independent Component Analysis: Theory and Applications∗ , 2012 .

[23]  Aapo Hyvärinen,et al.  Blind source separation by nonstationarity of variance: a cumulant-based approach , 2001, IEEE Trans. Neural Networks.

[24]  Aapo Hyvärinen,et al.  Nonlinear independent component analysis: Existence and uniqueness results , 1999, Neural Networks.

[25]  R. Tsay Testing and modeling multivariate threshold models , 1998 .

[26]  P. Saikkonen,et al.  Identification and estimation of non-Gaussian structural vector autoregressions , 2015 .

[27]  Aapo Hyvärinen,et al.  Unsupervised Feature Extraction by Time-Contrastive Learning and Nonlinear ICA , 2016, NIPS.

[28]  Eric R. Ziegel,et al.  The Elements of Statistical Learning , 2003, Technometrics.

[29]  Aapo Hyvärinen,et al.  Nonlinear ICA Using Auxiliary Variables and Generalized Contrastive Learning , 2018, AISTATS.

[30]  T. Teräsvirta Specification, Estimation, and Evaluation of Smooth Transition Autoregressive Models , 1994 .

[31]  Honglak Lee,et al.  Action-Conditional Video Prediction using Deep Networks in Atari Games , 2015, NIPS.

[32]  Heiga Zen,et al.  WaveNet: A Generative Model for Raw Audio , 2016, SSW.

[33]  Aapo Hyvärinen,et al.  Independent Component Analysis for Time-dependent Stochastic Processes , 1998 .

[34]  C. Matias,et al.  Identifiability of parameters in latent structure models with many observed variables , 2008, 0809.5032.

[35]  Aapo Hyvärinen,et al.  Estimation of a Structural Vector Autoregression Model Using Non-Gaussianity , 2010, J. Mach. Learn. Res..

[36]  Aapo Hyvärinen,et al.  Variational Autoencoders and Nonlinear ICA: A Unifying Framework , 2019, AISTATS.

[37]  Ruben Villegas,et al.  Hierarchical Long-term Video Prediction without Supervision , 2018, ICML.