Local Causal Discovery in Multiple Manipulated Datasets

We consider the problem of distinguishing direct causes from direct effects of a target variable of interest from multiple manipulated datasets with unknown manipulated variables and nonidentical data distributions. Recent studies have shown that datasets attained from manipulated experiments (i.e., manipulated data) contain richer causal information than observational data for causal structure learning. Thus, in this article, we propose a new algorithm, which makes full use of the interventional properties of a causal model to discover the direct causes and direct effects of a target variable from multiple datasets with different manipulations. It is more suited to real-world cases and is also a challenge to be addressed in this article. First, we apply the backward framework to learn parents and children (PC) of a given target from multiple manipulated datasets. Second, we orient some edges connected to the target in advance through the assumption that the target variable is not manipulated and then orient the remaining undirected edges by finding invariant V-structures from multiple datasets. Third, we analyze the correctness of the proposed algorithm. To the best of our knowledge, the proposed algorithm is the first that can identify the local causal structure of a given target from multiple manipulated datasets with unknown manipulated variables. Experimental results on standard Bayesian networks validate the effectiveness of our algorithm.

[1]  Hao Wang,et al.  Using Feature Selection for Local Causal Structure Learning , 2021, IEEE Transactions on Emerging Topics in Computational Intelligence.

[2]  Xindong Wu,et al.  Towards Efficient Local Causal Structure Learning , 2021, IEEE Transactions on Big Data.

[3]  Yoshua Bengio,et al.  Towards Causal Representation Learning , 2021, ArXiv.

[4]  Kui Yu,et al.  Multi-Source Causal Feature Selection , 2020, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[5]  Alexandre Lacoste,et al.  Differentiable Causal Discovery from Interventional Data , 2020, NeurIPS.

[6]  Kun Zhang,et al.  Causal Discovery from Multiple Data Sets with Non-Identical Variable Sets , 2020, AAAI.

[7]  Caroline Uhler,et al.  Permutation-Based Causal Structure Learning with Unknown Intervention Targets , 2019, UAI.

[8]  E. Xing,et al.  Learning Sparse Nonparametric DAGs , 2019, AISTATS.

[9]  L. Goldberg,et al.  The Book of Why: The New Science of Cause and Effect† , 2019, Quantitative Finance.

[10]  Bernhard Schölkopf,et al.  Causal Discovery from Heterogeneous/Nonstationary Data , 2019, J. Mach. Learn. Res..

[11]  Mélanie Frappier,et al.  The Book of Why: The New Science of Cause and Effect , 2018, Science.

[12]  Pradeep Ravikumar,et al.  DAGs with NO TEARS: Continuous Optimization for Structure Learning , 2018, NeurIPS.

[13]  Caroline Uhler,et al.  Characterizing and Learning Equivalence Classes of Causal DAGs under Interventions , 2018, ICML.

[14]  Kui Yu,et al.  Discovering Markov Blanket from Multiple interventional Datasets , 2018, ArXiv.

[15]  Bernhard Schölkopf,et al.  Causal Discovery from Nonstationary/Heterogeneous Data: Skeleton Estimation and Orientation Determination , 2017, IJCAI.

[16]  Kshitij P. Fadnis,et al.  Local-to-Global Bayesian Network Structure Learning , 2017, ICML.

[17]  Kun Zhang,et al.  Learning Causal Structures Using Regression Invariance , 2017, NIPS.

[18]  Joris M. Mooij,et al.  Joint Causal Inference from Multiple Contexts , 2016, J. Mach. Learn. Res..

[19]  Zhi Geng,et al.  Causal Network Learning from Multiple Interventions of Unknown Manipulated Targets , 2016, ArXiv.

[20]  N. Meinshausen,et al.  Methods for causal inference from gene perturbation experiments and validation , 2016, Proceedings of the National Academy of Sciences.

[21]  J. Pearl,et al.  Causal inference and the data-fusion problem , 2016, Proceedings of the National Academy of Sciences.

[22]  Garnett,et al.  Ancestral Causal Inference , 2016, NIPS.

[23]  Qiang Ji,et al.  Local Causal Discovery of Direct Causes and Effects , 2015, NIPS.

[24]  Bernhard Schölkopf,et al.  Invariant Models for Causal Transfer Learning , 2015, J. Mach. Learn. Res..

[25]  Jonas Peters,et al.  Causal inference by using invariant prediction: identification and confidence intervals , 2015, 1501.01332.

[26]  Ioannis Tsamardinos,et al.  Constraint-based causal discovery from multiple interventions over overlapping variable sets , 2014, J. Mach. Learn. Res..

[27]  Alain Hauser,et al.  Characterization and greedy learning of interventional Markov equivalence classes of directed acyclic graphs , 2011, J. Mach. Learn. Res..

[28]  Ping He,et al.  Partial orientation and local structural learning of causal networks for prediction , 2008, WCCI Causation and Prediction Challenge.

[29]  Aapo Hyvärinen,et al.  A Linear Non-Gaussian Acyclic Model for Causal Discovery , 2006, J. Mach. Learn. Res..

[30]  Constantin F. Aliferis,et al.  The max-min hill-climbing Bayesian network structure learning algorithm , 2006, Machine Learning.

[31]  Tom Burr,et al.  Causation, Prediction, and Search , 2003, Technometrics.

[32]  David Maxwell Chickering,et al.  Learning Equivalence Classes of Bayesian Network Structures , 1996, UAI.

[33]  Murat Kocaoglu,et al.  Causal Discovery from Soft Interventions with Unknown Targets: Characterization and Learning , 2020, NeurIPS.

[34]  TriantafillouSofia,et al.  Constraint-based causal discovery from multiple interventions over overlapping variable sets , 2015 .