Collaborative Causal Discovery with Atomic Interventions

We introduce a new Collaborative Causal Discovery problem, through which we model a common scenario in which we have multiple independent entities each with their own causal graph, and the goal is to simultaneously learn all these causal graphs. We study this problem without the causal sufficiency assumption, using Maximal Ancestral Graphs (MAG) to model the causal graphs, and assuming that we have the ability to actively perform independent single vertex (or atomic) interventions on the entities. If the M underlying (unknown) causal graphs of the entities satisfy a natural notion of clustering, we give algorithms that leverage this property, and recovers all the causal graphs using roughly logarithmic inM number of atomic interventions per entity. These are significantly fewer than n atomic interventions per entity required to learn each causal graph separately, where n is the number of observable nodes in the causal graph. We complement our results with a lower bound and discuss various extensions of our collaborative setting.

[1]  Mathias Frisch,et al.  Causation and intervention , 2014 .

[2]  Peter Bühlmann,et al.  Causal Inference Using Graphical Models with the R Package pcalg , 2012 .

[3]  Sivaraman Balakrishnan,et al.  Minimax optimal conditional independence testing , 2021, The Annals of Statistics.

[4]  Alexandros G. Dimakis,et al.  Experimental Design for Cost-Aware Learning of Causal Graphs , 2018, NeurIPS.

[5]  Avrim Blum,et al.  Center-based clustering under perturbation stability , 2010, Inf. Process. Lett..

[6]  Karthikeyan Shanmugam,et al.  Experimental Design for Learning Causal Graphs with Latent Variables , 2017, NIPS.

[7]  AmirEmad Ghassami,et al.  Interventional Experiment Design for Causal Structure Learning , 2019, ArXiv.

[8]  Kathleen M. Gates,et al.  Group search algorithm recovers effective connectivity maps for individuals in homogeneous and heterogeneous samples , 2012, NeuroImage.

[9]  Andrew McGregor,et al.  Intervention Efficient Algorithms for Approximate Learning of Causal Graphs , 2020, ALT.

[10]  Dmitriy Katz,et al.  Size of Interventional Markov Equivalence Classes in Random DAG Models , 2019, AISTATS.

[11]  Peter Bühlmann,et al.  Characterization and Greedy Learning of Interventional Markov Equivalence Classes of Directed Acyclic Graphs (Abstract) , 2011, UAI.

[12]  Thomas S. Richardson,et al.  Learning high-dimensional directed acyclic graphs with latent and selection variables , 2011, 1104.5617.

[13]  Frederick Eberhardt,et al.  Experiment selection for causal discovery , 2013, J. Mach. Learn. Res..

[14]  Karthikeyan Shanmugam,et al.  Characterization and Learning of Causal Graphs with Latent Variables from Soft Interventions , 2019, NeurIPS.

[15]  Clark Glymour,et al.  On meta-analyses of imaging data and the mixture of records , 2011, NeuroImage.

[16]  Alexandros G. Dimakis,et al.  Learning Causal Graphs with Small Interventions , 2015, NIPS.

[17]  Constantinos Daskalakis,et al.  Learning and Testing Causal Models with Interventions , 2018, NeurIPS.

[18]  J. Pearl Causal diagrams for empirical research , 1995 .

[19]  Adrian Vetta,et al.  Randomized Experimental Design for Causal Graph Discovery , 2014, NIPS.

[20]  Eric Zamai,et al.  Fault diagnosis for the complex manufacturing system , 2016 .

[21]  Basil Saeed,et al.  Causal Structure Discovery from Distributions Arising from Mixtures of DAGs , 2020, ICML.

[22]  Jiji Zhang,et al.  On the completeness of orientation rules for causal discovery in the presence of latent confounders and selection bias , 2008, Artif. Intell..

[23]  M. Kerimov The theory of error-correcting codes☆ , 1980 .

[24]  Tom Burr,et al.  Causation, Prediction, and Search , 2003, Technometrics.

[25]  Shai Ben-David,et al.  Clustering with Same-Cluster Queries , 2016, NIPS.

[26]  J. Pearl Causality: Models, Reasoning and Inference , 2000 .

[27]  O. Antoine,et al.  Theory of Error-correcting Codes , 2022 .

[28]  Jiji Zhang,et al.  Causal Reasoning with Ancestral Graphs , 2008, J. Mach. Learn. Res..

[29]  Jean Honorio,et al.  Computationally and statistically efficient learning of causal Bayes nets using path queries , 2017, NeurIPS.

[30]  Andrew McGregor,et al.  Efficient Intervention Design for Causal Discovery with Latents , 2020, ICML.

[31]  Andrew Chi-Chih Yao,et al.  Probabilistic computations: Toward a unified measure of complexity , 1977, 18th Annual Symposium on Foundations of Computer Science (sfcs 1977).

[32]  Thomas S. Richardson,et al.  Causal Inference in the Presence of Latent Variables and Selection Bias , 1995, UAI.

[33]  Prasad Tadepalli,et al.  PAC Learning of Causal Trees with Latent Variables , 2021, AAAI.

[34]  Kristjan H. Greenewald,et al.  Sample Efficient Active Learning of Causal Trees , 2019, NeurIPS.

[35]  P. Spirtes,et al.  Ancestral graph Markov models , 2002 .

[36]  Murat Kocaoglu,et al.  Causal Discovery from Soft Interventions with Unknown Targets: Characterization and Learning , 2020, NeurIPS.

[37]  Elias Bareinboim,et al.  Budgeted Experiment Design for Causal Structure Learning , 2017, ICML.

[38]  David J. Spiegelhalter,et al.  Local computations with probabilities on graphical structures and their application to expert systems , 1990 .

[39]  Paolo Vineis,et al.  Causal diagrams in systems epidemiology , 2012, Emerging Themes in Epidemiology.