Disentangling Identifiable Features from Noisy Data with Structured Nonlinear ICA

We introduce a new general identifiable framework for principled disentanglement referred to as Structured Nonlinear Independent Component Analysis (SNICA). Our contribution is to extend the identifiability theory of deep generative models for a very broad class of structured models. While previous works have shown identifiability for specific classes of time-series models, our theorems extend this to more general temporal structures as well as to models with more complex structures such as spatial dependencies. In particular, we establish the major result that identifiability for this framework holds even in the presence of noise of unknown distribution. Finally, as an example of our framework’s flexibility, we introduce the first nonlinear ICA model for time-series that combines the following very useful properties: it accounts for both nonstationarity and autocorrelation in a fully unsupervised setting; performs dimensionality reduction; models hidden states; and enables principled estimation and inference by variational maximum-likelihood.

[1]  Max Welling,et al.  Auto-Encoding Variational Bayes , 2013, ICLR.

[2]  Stephen M. Smith,et al.  Brain network dynamics are hierarchically organized in time , 2017, Proceedings of the National Academy of Sciences.

[3]  Luc Leh'ericy,et al.  Consistent order estimation for nonparametric hidden Markov models , 2019, Bernoulli.

[4]  Stephen M Smith,et al.  Fast transient networks in spontaneous human brain activity , 2014, eLife.

[5]  E. L. Lehmann,et al.  Theory of point estimation , 1950 .

[6]  Geoffrey E. Hinton,et al.  Variational Learning for Switching State-Space Models , 2000, Neural Computation.

[7]  Aapo Hyvärinen,et al.  Nonlinear ICA Using Auxiliary Variables and Generalized Contrastive Learning , 2018, AISTATS.

[8]  Xue-Xin Wei,et al.  Learning identifiable and interpretable latent models of high-dimensional neural activity using pi-VAE , 2020, NeurIPS.

[9]  Stéphane Robin,et al.  Inference in finite state space non parametric Hidden Markov Models and applications , 2016, Stat. Comput..

[10]  Andrew Zisserman,et al.  Look, Listen and Learn , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[11]  Aapo Hyvärinen,et al.  Unsupervised Feature Extraction by Time-Contrastive Learning and Nonlinear ICA , 2016, NIPS.

[12]  Alexander D'Amour,et al.  Underspecification Presents Challenges for Credibility in Modern Machine Learning , 2020, J. Mach. Learn. Res..

[13]  David M. Blei,et al.  Frequentist Consistency of Variational Bayes , 2017, Journal of the American Statistical Association.

[14]  Ryan P. Adams,et al.  Composing graphical models with neural networks for structured representations and fast inference , 2016, NIPS.

[15]  Cam-CAN Group,et al.  The Cambridge Centre for Ageing and Neuroscience (Cam-CAN) data repository: Structural and functional MRI, MEG, and cognitive data from a cross-sectional adult lifespan sample , 2017, NeuroImage.

[16]  K. Ito,et al.  On State Estimation in Switching Environments , 1970 .

[17]  Aapo Hyvärinen,et al.  Causal Discovery with General Non-Linear Relationships using Non-Linear ICA , 2019, UAI.

[18]  Aapo Hyvarinen,et al.  Hidden Markov Nonlinear ICA: Unsupervised Learning from Nonstationary Time Series , 2020, UAI.

[19]  Aapo Hyvärinen,et al.  Independent innovation analysis for nonlinear vector autoregressive process , 2020, AISTATS.

[20]  Darren Price,et al.  Investigating the electrophysiological basis of resting state networks using magnetoencephalography , 2011, Proceedings of the National Academy of Sciences.

[21]  Aapo Hyvärinen,et al.  Nonlinear independent component analysis: Existence and uniqueness results , 1999, Neural Networks.

[22]  Christopher Burgess,et al.  beta-VAE: Learning Basic Visual Concepts with a Constrained Variational Framework , 2016, ICLR 2016.

[23]  Sylvain Le Corff,et al.  Deconvolution with unknown noise distribution is possible for multivariate signals , 2020, The Annals of Statistics.

[24]  James D. Hamilton Analysis of time series subject to changes in regime , 1990 .

[25]  Diederik P. Kingma,et al.  ICE-BeeM: Identifiable Conditional Energy-Based Deep Models , 2020, NeurIPS.

[26]  Matthias Bethge,et al.  Towards Nonlinear Disentanglement in Natural Data with Temporal Sparse Coding , 2020, ICLR.

[27]  Ben Poole,et al.  Weakly-Supervised Disentanglement Without Compromises , 2020, ICML.

[28]  Aapo Hyvärinen,et al.  Nonlinear ICA of fMRI reveals primitive temporal structures linked to rest, task, and behavioral traits , 2020, NeuroImage.

[29]  Elisabeth Gassiat,et al.  Identifiability and Consistent Estimation of Nonparametric Translation Hidden Markov Models with General State Space , 2020, J. Mach. Learn. Res..

[30]  Harald Oberhauser,et al.  Nonlinear Independent Component Analysis for Continuous-Time Signals , 2021, ArXiv.

[31]  Martin Luessi,et al.  MEG and EEG data analysis with MNE-Python , 2013, Front. Neuroinform..

[32]  Luigi Gresele,et al.  Relative gradient optimization of the Jacobian term in unsupervised deep learning , 2020, NeurIPS.

[33]  Aapo Hyvärinen,et al.  Uncovering the structure of clinical EEG signals with self-supervised learning , 2020, Journal of neural engineering.

[34]  Aapo Hyvärinen,et al.  Nonlinear ICA of Temporally Dependent Stationary Sources , 2017, AISTATS.

[35]  Laurenz Wiskott,et al.  An extension of slow feature analysis for nonlinear blind source separation , 2014, J. Mach. Learn. Res..

[36]  Bernhard Schölkopf,et al.  Challenging Common Assumptions in the Unsupervised Learning of Disentangled Representations , 2018, ICML.

[37]  Seungjin Choi,et al.  Independent Component Analysis , 2009, Handbook of Natural Computing.

[38]  K. Fukumizu,et al.  Causal Mosaic: Cause-Effect Inference via Nonlinear ICA and Ensemble Method , 2020, AISTATS.

[39]  M. Athans,et al.  State Estimation for Discrete Systems with Switching Parameters , 1978, IEEE Transactions on Aerospace and Electronic Systems.

[40]  Aapo Hyvärinen,et al.  Variational Autoencoders and Nonlinear ICA: A Unifying Framework , 2019, AISTATS.

[41]  William D. Marslen-Wilson,et al.  The Cambridge Centre for Ageing and Neuroscience (Cam-CAN) study protocol: a cross-sectional, lifespan, multidisciplinary examination of healthy cognitive ageing , 2014, BMC Neurology.