Two Optimal Strategies for Active Learning of Causal Models from Interventions

From observational data alone, a causal DAG is in general only identifiable up to Markov equivalence. Interventional data generally improves identifiability; however, the gain of an intervention strongly depends on the intervention target, i.e., the intervened variables. We present active learning strategies calculating optimal interventions for two different learning goals. The first one is a greedy approach using single-vertex interventions that maximizes the number of edges that can be oriented after each intervention. The second one yields in polynomial time a minimum set of targets of arbitrary size that guarantees full identifiability. This second approach proves a conjecture of Eberhardt (2008) indicating the number of unbounded intervention targets which is sufficient and in the worst case necessary for full identifiability. We compare our two active learning approaches to random interventions in a simulation study.

[1]  Bernhard Schölkopf,et al.  Identifiability of Causal Graphs using Functional Models , 2011, UAI.

[2]  Judea Pearl,et al.  On the Equivalence of Causal Models , 2013, 1304.1108.

[3]  Frederick Eberhardt,et al.  Almost Optimal Intervention Sets for Causal Discovery , 2008, UAI.

[4]  V. Chvátal Perfectly Ordered Graphs , 1984 .

[5]  D. Madigan,et al.  A characterization of Markov equivalence classes for acyclic digraphs , 1997 .

[6]  Mao-cheng Cai On separating systems of graphs , 1984, Discret. Math..

[7]  Peter Bühlmann,et al.  Characterization and Greedy Learning of Interventional Markov Equivalence Classes of Directed Acyclic Graphs (Abstract) , 2011, UAI.

[8]  Kevin B. Korb,et al.  Varieties of Causal Intervention , 2004, PRICAI.

[9]  D. Rose Triangulated graphs and the elimination process , 1970 .

[10]  J. Pearl Causal diagrams for empirical research , 1995 .

[11]  Daphne Koller,et al.  Active Learning for Structure in Bayesian Networks , 2001, IJCAI.

[12]  P. Spirtes,et al.  Causation, prediction, and search , 1993 .

[13]  Robert E. Tarjan,et al.  Algorithmic Aspects of Vertex Elimination on Graphs , 1976, SIAM J. Comput..

[14]  E. Kaplan,et al.  Nonparametric Estimation from Incomplete Observations , 1958 .

[15]  Yangbo He,et al.  Active Learning of Causal Networks with Intervention Experiments and Optimal Designs , 2008 .

[16]  Yuichiro Kitajima,et al.  Causation and Intervention in Algebraic Quantum Field Theory , 2010 .