Measuring Conditional Independence by Independent Residuals: Theoretical Results and Application in Causal Discovery

We investigate the relationship between conditional independence (CI) x ⊥ y|Z and the independence of two residuals x − E(x|Z) ⊥ y − E(y|Z), where x and y are two random variables, and Z is a set of random variables. We show that if x, y and Z are generated by following linear structural equation model and all external influences follow Gaussian distributions, then x ⊥ y|Z if and only if x − E(x|Z) ⊥ y − E(y|Z). That is, the test of x ⊥ y|Z can be relaxed to a simpler unconditional independence test of x − E(x|Z) ⊥ y − E(y|Z). Furthermore, if all these external influences follow non-Gaussian distributions and the model satisfies structural faithfulness condition, then we have x ⊥ y|Z ⇔ x− E(x|Z) ⊥ y − E(y|Z). We apply the results above to the causal discovery problem, where the causal directions are generally determined by a set of V -structures and their consistent propagations, so CI test-based methods can return a set of Markov equivalence classes. We show that in linear non-Gaussian context, x − E(x|Z) ⊥ y − E(y|Z) ⇒ x − E(x|Z) ⊥ z or y − E(y|Z) ⊥ z (∀z ∈ Z) if Z is a minimal d-separator, which implies z causes x (or y) if z directly connects to x (or y). Therefore, we conclude that CIs have useful information for distinguishing Markov equivalence classes. In summary, compared with the existing discretization-based and kernel-based CI testing methods, the proposed method provides a simpler way to measure CI, which needs only one unconditional independence test and two regression operations. When being applied to causal discovery, it can find more causal relationships, which is experimentally validated.

[1]  J. Daudin Partial association measures and an application to qualitative regression , 1980 .

[2]  H. White,et al.  A NONPARAMETRIC HELLINGER METRIC TEST FOR CONDITIONAL INDEPENDENCE , 2008, Econometric Theory.

[3]  Aapo Hyvärinen,et al.  DirectLiNGAM: A Direct Method for Learning a Linear Non-Gaussian Structural Equation Model , 2011, J. Mach. Learn. Res..

[4]  Illtyd Trethowan Causality , 1938 .

[5]  Bernhard Schölkopf,et al.  Causal Inference on Discrete Data Using Additive Noise Models , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[6]  Eric V. Strobl,et al.  Approximate Kernel-Based Conditional Independence Tests for Fast Non-Parametric Causal Discovery , 2017, Journal of Causal Inference.

[7]  Ruichu Cai,et al.  SADA: A General Framework to Support Robust Causation Discovery , 2013, ICML.

[8]  G. Darmois,et al.  Analyse générale des liaisons stochastiques: etude particulière de l'analyse factorielle linéaire , 1953 .

[9]  David Maxwell Chickering,et al.  Learning Equivalence Classes of Bayesian Network Structures , 1996, UAI.

[10]  Bernhard Schölkopf,et al.  Nonlinear causal discovery with additive noise models , 2008, NIPS.

[11]  Aapo Hyvärinen,et al.  A Linear Non-Gaussian Acyclic Model for Causal Discovery , 2006, J. Mach. Learn. Res..

[12]  Bernhard Schölkopf,et al.  Kernel Measures of Conditional Dependence , 2007, NIPS.

[13]  Christopher Meek,et al.  Causal inference and causal explanation with background knowledge , 1995, UAI.

[14]  Joseph Ramsey,et al.  A Scalable Conditional Independence Test for Nonlinear, Non-Gaussian Data , 2014, ArXiv.

[15]  Bernhard Schölkopf,et al.  A Permutation-Based Kernel Conditional Independence Test , 2014, UAI.

[16]  Shuigeng Zhou,et al.  Causal Discovery Using Regression-Based Conditional Independence Tests , 2017, AAAI.

[17]  Peter Spirtes,et al.  Causal discovery and inference: concepts and recent methodological advances , 2016, Applied Informatics.

[18]  Bernhard Schölkopf,et al.  Kernel-based Conditional Independence Test and Application in Causal Discovery , 2011, UAI.

[19]  R. Shibata,et al.  PARTIAL CORRELATION AND CONDITIONAL CORRELATION AS MEASURES OF CONDITIONAL INDEPENDENCE , 2004 .

[20]  Alexander J. Smola,et al.  Gaussian Processes for Independence Tests with Non-iid Data in Causal Inference , 2015, ACM Trans. Intell. Syst. Technol..

[21]  P. Spirtes,et al.  Causation, prediction, and search , 1993 .

[22]  Wicher P. Bergsma,et al.  Testing conditional independence for continuous random variables , 2004 .

[23]  Bernhard Schölkopf,et al.  Identification of causal relations in neuroimaging data with latent confounders: An instrumental variable approach , 2016, NeuroImage.

[24]  E. Lukács,et al.  A Property of the Normal Distribution , 1954 .

[25]  Bernhard Schölkopf,et al.  A Kernel Method for the Two-Sample-Problem , 2006, NIPS.

[26]  Aapo Hyvärinen,et al.  On the Identifiability of the Post-Nonlinear Causal Model , 2009, UAI.