Multi-domain Causal Structure Learning in Linear Systems

We study the problem of causal structure learning in linear systems from observational data given in multiple domains, across which the causal coefficients and/or the distribution of the exogenous noises may vary. The main tool used in our approach is the principle that in a causally sufficient system, the causal modules, as well as their included parameters, change independently across domains. We first introduce our approach for finding causal direction in a system comprising two variables and propose efficient methods for identifying causal direction. Then we generalize our methods to causal structure learning in networks of variables. Most of previous work in structure learning from multi-domain data assume that certain types of invariance are held in causal modules across domains. Our approach unifies the idea in those works and generalizes to the case that there is no such invariance across the domains. Our proposed methods are generally capable of identifying causal direction from fewer than ten domains. When the invariance property holds, two domains are generally sufficient.

[1]  Bernhard Schölkopf,et al.  Behind Distribution Shift: Mining Driving Forces of Changes and Causal Arrows , 2017, 2017 IEEE International Conference on Data Mining (ICDM).

[2]  Kun Zhang,et al.  Learning Causal Structures Using Regression Invariance , 2017, NIPS.

[3]  Michael I. Jordan,et al.  Advances in Neural Information Processing Systems 30 , 1995 .

[4]  Bernhard Schölkopf,et al.  Distinguishing Cause from Effect Based on Exogeneity , 2015, ArXiv.

[5]  Bernhard Schölkopf,et al.  Nonlinear causal discovery with additive noise models , 2008, NIPS.

[6]  Bernhard Schölkopf,et al.  Domain Adaptation under Target and Conditional Shift , 2013, ICML.

[7]  Dinh Phung,et al.  Journal of Machine Learning Research: Preface , 2014 .

[8]  Le Song,et al.  A Kernel Statistical Test of Independence , 2007, NIPS.

[9]  D. Weed On the logic of causal inference. , 1986, American journal of epidemiology.

[10]  P. Cochat,et al.  Et al , 2008, Archives de pediatrie : organe officiel de la Societe francaise de pediatrie.

[11]  N. Burgess,et al.  The hippocampus and memory: insights from spatial processing , 2008, Nature Reviews Neuroscience.

[12]  Bernhard Schölkopf,et al.  On causal and anticausal learning , 2012, ICML.

[13]  Aapo Hyvärinen,et al.  Distinguishing causes from effects using nonlinear acyclic causal models , 2008, NIPS 2010.

[14]  Illtyd Trethowan Causality , 1938 .

[15]  M. Pourahmadi Covariance Estimation: The GLM and Regularization Perspectives , 2011, 1202.1661.

[16]  Lai-Wan Chan,et al.  Extensions of ICA for Causality Discovery in the Hong Kong Stock Market , 2006, ICONIP.

[17]  Yuhao Wang,et al.  Direct Estimation of Differences in Causal Graphs , 2018, NeurIPS.

[18]  Aapo Hyvärinen,et al.  A Linear Non-Gaussian Acyclic Model for Causal Discovery , 2006, J. Mach. Learn. Res..

[19]  Quanquan Gu,et al.  Semiparametric Differential Graph Models , 2016, NIPS.

[20]  Aapo Hyvärinen,et al.  DirectLiNGAM: A Direct Method for Learning a Linear Non-Gaussian Structural Equation Model , 2011, J. Mach. Learn. Res..

[21]  Iranga Samindani Weerakkody චත්තාරික සමය හා බැඳි සාම්ප්රධායික පසන් ගායන ශෛලිය පිළිබඳ අධ්යයනයක් (Unpublished doctoral dissertation) , 2017 .

[22]  Jonas Peters,et al.  Causal inference by using invariant prediction: identification and confidence intervals , 2015, 1501.01332.

[23]  F. R. Rosendaal,et al.  Prediction , 2015, Journal of thrombosis and haemostasis : JTH.

[24]  Bernhard Schölkopf,et al.  Causal Discovery from Nonstationary/Heterogeneous Data: Skeleton Estimation and Orientation Determination , 2017, IJCAI.

[25]  David Maxwell Chickering,et al.  Optimal Structure Identification With Greedy Search , 2002, J. Mach. Learn. Res..

[26]  D. Madigan,et al.  A characterization of Markov equivalence classes for acyclic digraphs , 1997 .