Constraint-based causal discovery from multiple interventions over overlapping variable sets

Scientific practice typically involves repeatedly studying a system, each time trying to unravel a different perspective. In each study, the scientist may take measurements under different experimental conditions (interventions, manipulations, perturbations) and measure different sets of quantities (variables). The result is a collection of heterogeneous data sets coming from different data distributions. In this work, we present algorithm COmbINE, which accepts a collection of data sets over overlapping variable sets under different experimental conditions; COmbINE then outputs a summary of all causal models indicating the invariant and variant structural characteristics of all models that simultaneously fit all of the input data sets. COmbINE converts estimated dependencies and independencies in the data into path constraints on the data-generating causal model and encodes them as a SAT instance. The algorithm is sound and complete in the sample limit. To account for conflicting constraints arising from statistical errors, we introduce a general method for sorting constraints in order of confidence, computed as a function of their corresponding p-values. In our empirical evaluation, COmbINE outperforms in terms of efficiency the only pre-existing similar algorithm; the latter additionally admits feedback cycles, but does not admit conflicting constraints which hinders the applicability on real data. As a proof-of-concept, COmbINE is employed to co-analyze 4 real, mass-cytometry data sets measuring phosphorylated protein concentrations of overlapping protein sets under 3 different interventions.

[1]  M. J. Bayarri,et al.  Calibration of ρ Values for Testing Precise Null Hypotheses , 2001 .

[2]  Daphne Koller,et al.  Active Learning for Structure in Bayesian Networks , 2001, IJCAI.

[3]  James M. Robins,et al.  Nested Markov Properties for Acyclic Directed Mixed Graphs , 2012, UAI.

[4]  Gregory F. Cooper,et al.  The ALARM Monitoring System: A Case Study with two Probabilistic Inference Techniques for Belief Networks , 1989, AIME.

[5]  Richard Scheines,et al.  A Statistical Problem for Inference to Regulatory Structure from Associations of Gene Expression Measurements with Microarrays , 2003, Bioinform..

[6]  J. Pearl,et al.  On the Identication of Causal Eects , 2003 .

[7]  Thomas S. Richardson,et al.  Learning high-dimensional directed acyclic graphs with latent and selection variables , 2011, 1104.5617.

[8]  Peter Spirtes,et al.  Learning equivalence classes of acyclic models with latent and selection variables from multiple datasets with overlapping variables , 2011, AISTATS.

[9]  Ioannis G. Tollis,et al.  Learning Causal Structure from Overlapping Variable Sets , 2010, AISTATS.

[10]  David Heckerman,et al.  Learning Gaussian Networks , 1994, UAI.

[11]  R. A. Fisher,et al.  Design of Experiments , 1936 .

[12]  James M. Robins,et al.  Sparse Nested Markov models with Log-linear Parameters , 2013, UAI.

[13]  Sean C. Bendall,et al.  Single-Cell Mass Cytometry of Differential Immune and Drug Responses Across a Human Hematopoietic Continuum , 2011, Science.

[14]  Bernard Manderick,et al.  Learning Semi-Markovian Causal Models using Experiments , 2006, Probabilistic Graphical Models.

[15]  Jiji Zhang,et al.  Adjacency-Faithfulness and Conservative Causal Inference , 2006, UAI.

[16]  Judea Pearl,et al.  Equivalence and Synthesis of Causal Models , 1990, UAI.

[17]  David Danks,et al.  Integrating Locally Learned Causal Structures with Overlapping Variables , 2008, NIPS.

[18]  K. Sachs,et al.  Causal Protein-Signaling Networks Derived from Multiparameter Single-Cell Data , 2005, Science.

[19]  R. Fisher On the Interpretation of χ 2 from Contingency Tables , and the Calculation of P Author , 2022 .

[20]  P. Spirtes,et al.  Causation, prediction, and search , 1993 .

[21]  Tom Heskes,et al.  Learning causal network structure from multiple (in)dependence models , 2010 .

[22]  Frederick Eberhardt,et al.  Discovering Cyclic Causal Models with Latent Variables: A General SAT-Based Procedure , 2013, UAI.

[23]  P. Spirtes,et al.  A Polynomial Time Algorithm for Determining DAG Equivalence in the Presence of Latent Variables and Selection Bias , 1997, AISTATS.

[24]  P. Spirtes,et al.  Ancestral graph Markov models , 2002 .

[25]  Thomas S. Richardson,et al.  Maximum likelihood fitting of acyclic directed mixed graphs to binary data , 2010, UAI.

[26]  Tom Heskes,et al.  Causal discovery in multiple models from different experiments , 2010, NIPS.

[27]  Jin Tian,et al.  On the Identification of Causal Effects , 2015 .

[28]  Kevin P. Murphy,et al.  Exact Bayesian structure learning from uncertain interventions , 2007, AISTATS.

[29]  Kayvan Sadeghi,et al.  Graphical representation of independence structures , 2012 .

[30]  Frederick Eberhardt,et al.  Learning linear cyclic causal models with latent variables , 2012, J. Mach. Learn. Res..

[31]  Frederick Eberhardt,et al.  Causal Discovery of Linear Cyclic Models from Multiple Experimental Data Sets with Overlapping Variables , 2012, UAI.

[32]  Gregory F. Cooper,et al.  Causal Discovery from a Mixture of Experimental and Observational Data , 1999, UAI.

[33]  R. Scheines,et al.  Interventions and Causal Inference , 2007, Philosophy of Science.

[34]  Tom Heskes,et al.  A Bayesian Approach to Constraint Based Causal Inference , 2012, UAI.

[35]  Peter Bühlmann,et al.  Characterization and Greedy Learning of Interventional Markov Equivalence Classes of Directed Acyclic Graphs (Abstract) , 2011, UAI.

[36]  T. Richardson Markov Properties for Acyclic Directed Mixed Graphs , 2003 .

[37]  Jiji Zhang,et al.  Causal Reasoning with Ancestral Graphs , 2008, J. Mach. Learn. Res..

[38]  John D. Storey,et al.  Statistical significance for genomewide studies , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[39]  Jiji Zhang,et al.  On the completeness of orientation rules for causal discovery in the presence of latent confounders and selection bias , 2008, Artif. Intell..

[40]  R. Fisher On the Interpretation of χ2 from Contingency Tables, and the Calculation of P , 2018, Journal of the Royal Statistical Society Series A (Statistics in Society).

[41]  T. Richardson,et al.  Marginal log‐linear parameters for graphical Markov models , 2011, Journal of the Royal Statistical Society. Series B, Statistical methodology.

[42]  Trey Ideker,et al.  Cytoscape 2.8: new features for data integration and network visualization , 2010, Bioinform..

[43]  Jiji Zhang,et al.  Causal Inference and Reasoning in Causally Insu-cient Systems , 2006 .

[44]  Niklas Sörensson,et al.  An Extensible SAT-solver , 2003, SAT.

[45]  Kevin Murphy,et al.  Active Learning of Causal Bayes Net Structure , 2006 .

[46]  Vincenzo Lagani,et al.  Towards Integrative Causal Analysis of Heterogeneous Data Sets and Studies , 2012, J. Mach. Learn. Res..

[47]  Joris M. Mooij,et al.  Cyclic Causal Discovery from Continuous Equilibrium Data , 2013, UAI.

[48]  Munther A. Dahleh,et al.  Structure learning in causal cyclic networks , 2008 .

[49]  K. Murphy,et al.  Belief net structure learning from uncertain interventions , 2007 .

[50]  Karen Sachs,et al.  Multiplexed mass cytometry profiling of cellular states perturbed by small-molecule regulators , 2012, Nature Biotechnology.

[51]  Adrian Kügel,et al.  Improved Exact Solver for the Weighted MAX-SAT Problem , 2010, POS@SAT.

[52]  Peipei Ping,et al.  p38 Kinase-dependent MAPKAPK-2 Activation Functions as 3-Phosphoinositide-dependent Kinase-2 for Akt in Human Neutrophils* , 2001, The Journal of Biological Chemistry.

[53]  Bart Selman,et al.  Heavy-Tailed Phenomena in Satisfiability and Constraint Satisfaction Problems , 2000, Journal of Automated Reasoning.

[54]  P. Spirtes,et al.  MARKOV EQUIVALENCE FOR ANCESTRAL GRAPHS , 2009, 0908.3605.