We consider the bivariate causal discovery problem. While this problem has been extensively studied, the majority of current methods assume a linear causal relationship, and the few methods which consider non-linear dependencies usually make the assumption of additive noise. Here, we propose a framework through which we can perform causal discovery in the presence of general non-linear relationships. The proposed method exploits a correspondence between a piecewise stationary non-linear ICA model and non-linear causal models. We show that in the case of bivariate causal discovery, non-linear ICA can be used to infer the causal direction via a series of independence tests. A series of experiments on simulated data demonstrate the capabilities of the proposed method. Causal models play a fundamental role in modern scientific endeavor [Spirtes et al., 2000, Pearl, 2009]. While randomized control studies are the gold standard, such an approach is unfeasible or unethical in many scenarios [Spirtes and Zhang, 2016]. Furthermore, big data sets publicly available on the internet often try to be generic and thus cannot be strongly based on specific intervention. As such, it is both necessary and important to develop causal discovery methods through which to uncover causal structure from (potentially large-scale) passively observed data. Data collected without the explicit manipulation of certain variables is often termed observational data, in contrast to experimental data where certain variables are intervened upon, as in randomized controlled trials. In this work we focus on the bivariate causal discovery problem. This corresponds to recovering the causal structure using observations from two variables, which we denote by X1 and X2. While bivariate causal discovery is a (simplified) special case of the more general causal discovery problem, it remains a challenging task. To date, the identification of causal relationships between two variables has primarily been studied under the assumption of linear causal dependencies [Kano and Shimizu, 2003, Hyvärinen and Smith, 2013]. Several extensions to accommodate non-linear causal dependencies have also been proposed, however, the majority of such methods typically assume additive noise [Hoyer et al., 2009, Peters et al., 2014] Here, we propose a general method for bivariate causal discovery in the presence of general nonlinearities. The proposed method is able to uncover non-linear causal relationships without requiring assumptions such as linear causal structure or additive noise. Our approach is based on the correspondence between a piece-wise stationary non-linear ICA model and a non-linear structural equation model (SEM). The proposed method therefore shares similarities with linear-ICA based causal discovery methods [Shimizu et al., 2006]. Under the assumption that we observed bivariate data generated under a specific non-linear ICA model utilizing non-stationarity, we demonstrate that if latent sources can be recovered via non-linear ICA, then a series of independence tests can be employed to uncover causal structure.
[1]
Illtyd Trethowan.
Causality
,
1938
.
[2]
Aapo Hyvärinen,et al.
Nonlinear independent component analysis: Existence and uniqueness results
,
1999,
Neural Networks.
[3]
Tom Burr,et al.
Causation, Prediction, and Search
,
2003,
Technometrics.
[4]
Y. Kano,et al.
Causal Inference Using Nonnormality
,
2004
.
[5]
Bernhard Schölkopf,et al.
Measuring Statistical Dependence with Hilbert-Schmidt Norms
,
2005,
ALT.
[6]
Aapo Hyvärinen,et al.
A Linear Non-Gaussian Acyclic Model for Causal Discovery
,
2006,
J. Mach. Learn. Res..
[7]
Bernhard Schölkopf,et al.
Nonlinear causal discovery with additive noise models
,
2008,
NIPS.
[8]
Aapo Hyvärinen,et al.
On the Identifiability of the Post-Nonlinear Causal Model
,
2009,
UAI.
[9]
Aapo Hyvärinen,et al.
DirectLiNGAM: A Direct Method for Learning a Linear Non-Gaussian Structural Equation Model
,
2011,
J. Mach. Learn. Res..
[10]
Aapo Hyvärinen,et al.
Pairwise likelihood ratios for estimation of non-Gaussian structural equation models
,
2013,
J. Mach. Learn. Res..
[11]
Bernhard Schölkopf,et al.
Causal discovery with continuous additive noise models
,
2013,
J. Mach. Learn. Res..
[12]
Jonas Peters,et al.
Causal inference by using invariant prediction: identification and confidence intervals
,
2015,
1501.01332.
[13]
Aapo Hyvärinen,et al.
Unsupervised Feature Extraction by Time-Contrastive Learning and Nonlinear ICA
,
2016,
NIPS.
[14]
Peter Spirtes,et al.
Causal discovery and inference: concepts and recent methodological advances
,
2016,
Applied Informatics.
[15]
Bernhard Schölkopf,et al.
Causal Discovery from Nonstationary/Heterogeneous Data: Skeleton Estimation and Orientation Determination
,
2017,
IJCAI.
[16]
Aapo Hyvärinen,et al.
Nonlinear ICA of Temporally Dependent Stationary Sources
,
2017,
AISTATS.