A Ladder of Causal Distances

Causal discovery, the task of automatically constructing a causal model from data, is of major significance across the sciences. Evaluating the performance of causal discovery algorithms should ideally involve comparing the inferred models to ground-truth models available for benchmark datasets, which in turn requires a notion of distance between causal models. While such distances have been proposed previously, they are limited by focusing on graphical properties of the causal models being compared. Here, we overcome this limitation by defining distances derived from the causal distributions induced by the models, rather than exclusively from their graphical structure. Pearl and Mackenzie (2018) have arranged the properties of causal models in a hierarchy called the "ladder of causation" spanning three rungs: observational, interventional, and counterfactual. Following this organization, we introduce a hierarchy of three distances, one for each rung of the ladder. Our definitions are intuitively appealing as well as efficient to compute approximately. We put our causal distances to use by benchmarking standard causal discovery systems on both synthetic and real-world datasets for which ground-truth causal models are available. Finally, we highlight the usefulness of our causal distances by briefly discussing further applications beyond the evaluation of causal discovery techniques.

[1]  K. Sachs,et al.  Causal Protein-Signaling Networks Derived from Multiparameter Single-Cell Data , 2005, Science.

[2]  Robert L. Shook The book of why , 1983 .

[3]  C. Villani Optimal Transport: Old and New , 2008 .

[4]  David Maxwell Chickering,et al.  Optimal Structure Identification With Greedy Search , 2002, J. Mach. Learn. Res..

[5]  David Maxwell Chickering,et al.  Finding Optimal Bayesian Networks , 2002, UAI.

[6]  Judy Hall,et al.  The Book of Why , 2008 .

[7]  Zachary Chase Lipton The mythos of model interpretability , 2016, ACM Queue.

[8]  Qing Zhou,et al.  Concave penalized estimation of sparse Gaussian Bayesian networks , 2014, J. Mach. Learn. Res..

[9]  J. Pearl Causality: Models, Reasoning and Inference , 2000 .

[10]  Constantin F. Aliferis,et al.  Algorithms for Large Scale Markov Blanket Discovery , 2003, FLAIRS.

[11]  Marek J. Druzdzel,et al.  A comparison of structural distance measures for causal Bayesian network models , 2009 .

[12]  Judea Pearl,et al.  Equivalence and Synthesis of Causal Models , 1990, UAI.

[13]  Matt J. Kusner,et al.  Counterfactual Fairness , 2017, NIPS.

[14]  Bernhard Schölkopf,et al.  Elements of Causal Inference: Foundations and Learning Algorithms , 2017 .

[15]  Constantin F. Aliferis,et al.  Time and sample efficient discovery of Markov blankets and direct causal relations , 2003, KDD '03.

[16]  Matthias Bethge,et al.  A note on the evaluation of generative models , 2015, ICLR.

[17]  Aapo Hyvärinen,et al.  A Linear Non-Gaussian Acyclic Model for Causal Discovery , 2006, J. Mach. Learn. Res..

[18]  David J. Spiegelhalter,et al.  Local computations with probabilities on graphical structures and their application to expert systems , 1990 .

[19]  Bernhard Schölkopf,et al.  Hilbert Space Embeddings and Metrics on Probability Measures , 2009, J. Mach. Learn. Res..

[20]  Luis M. de Campos,et al.  Searching for Bayesian Network Structures in the Space of Restricted Acyclic Partially Directed Graphs , 2011, J. Artif. Intell. Res..

[21]  Daniel Zelterman,et al.  Bayesian Artificial Intelligence , 2005, Technometrics.

[22]  Tom Burr,et al.  Causation, Prediction, and Search , 2003, Technometrics.

[23]  Stuart J. Russell,et al.  Adaptive Probabilistic Networks with Hidden Variables , 1997, Machine Learning.

[24]  Jean-Baptiste Denis,et al.  Bayesian Networks , 2014 .

[25]  Gautam Shroff,et al.  Comparative Benchmarking of Causal Discovery Techniques , 2017, ArXiv.

[26]  Constantinos Daskalakis,et al.  Learning and Testing Causal Models with Interventions , 2018, NeurIPS.

[27]  Amanda Gentzel,et al.  The Case for Evaluating Causal Models Using Interventional Measures and Empirical Data , 2019, NeurIPS.