论文信息 - A Ladder of Causal Distances

A Ladder of Causal Distances

Causal discovery, the task of automatically constructing a causal model from data, is of major significance across the sciences. Evaluating the performance of causal discovery algorithms should ideally involve comparing the inferred models to ground-truth models available for benchmark datasets, which in turn requires a notion of distance between causal models. While such distances have been proposed previously, they are limited by focusing on graphical properties of the causal models being compared. Here, we overcome this limitation by defining distances derived from the causal distributions induced by the models, rather than exclusively from their graphical structure. Pearl and Mackenzie (2018) have arranged the properties of causal models in a hierarchy called the "ladder of causation" spanning three rungs: observational, interventional, and counterfactual. Following this organization, we introduce a hierarchy of three distances, one for each rung of the ladder. Our definitions are intuitively appealing as well as efficient to compute approximately. We put our causal distances to use by benchmarking standard causal discovery systems on both synthetic and real-world datasets for which ground-truth causal models are available. Finally, we highlight the usefulness of our causal distances by briefly discussing further applications beyond the evaluation of causal discovery techniques.

Robert West | Maxime Peyrard

[1] K. Sachs,et al. Causal Protein-Signaling Networks Derived from Multiparameter Single-Cell Data , 2005, Science.

[2] Robert L. Shook. The book of why , 1983 .

[3] C. Villani. Optimal Transport: Old and New , 2008 .

[4] David Maxwell Chickering,et al. Optimal Structure Identification With Greedy Search , 2002, J. Mach. Learn. Res..

[5] David Maxwell Chickering,et al. Finding Optimal Bayesian Networks , 2002, UAI.

[6] Judy Hall,et al. The Book of Why , 2008 .

[7] Zachary Chase Lipton. The mythos of model interpretability , 2016, ACM Queue.

[8] Qing Zhou,et al. Concave penalized estimation of sparse Gaussian Bayesian networks , 2014, J. Mach. Learn. Res..

[9] J. Pearl. Causality: Models, Reasoning and Inference , 2000 .

[10] Constantin F. Aliferis,et al. Algorithms for Large Scale Markov Blanket Discovery , 2003, FLAIRS.

[11] Marek J. Druzdzel,et al. A comparison of structural distance measures for causal Bayesian network models , 2009 .