Active Learning of Causal Networks with Intervention Experiments and Optimal Designs

The causal discovery from data is important for various scientific investigations. Because we cannot distinguish the different directed acyclic graphs (DAGs) in a Markov equivalence class learned from observational data, we have to collect further information on causal structures from experiments with external interventions. In this paper, we propose an active learning approach for discovering causal structures in which we first find a Markov equivalence class from observational data, and then we orient undirected edges in every chain component via intervention experiments separately. In the experiments, some variables are manipulated through external interventions. We discuss two kinds of intervention experiments, randomized experiment and quasi-experiment. Furthermore, we give two optimal designs of experiments, a batch-intervention design and a sequential-intervention design, to minimize the number of manipulated variables and the set of candidate structures based on the minimax and the maximum entropy criteria. We show theoretically that structural learning can be done locally in subgraphs of chain components without need of checking illegal v-structures and cycles in the whole network and that a Markov equivalence subclass obtained after each intervention can still be depicted as a chain graph.

[1]  Judea Pearl,et al.  Probabilistic reasoning in intelligent systems , 1988 .

[2]  Judea Pearl,et al.  Equivalence and Synthesis of Causal Models , 1990, UAI.

[3]  P. Spirtes,et al.  Causation, prediction, and search , 1993 .

[4]  Judea Pearl,et al.  Causal inference from indirect experiments , 1995, Artif. Intell. Medicine.

[5]  Steffen L. Lauritzen,et al.  Graphical models in R , 1996 .

[6]  D. Madigan,et al.  A characterization of Markov equivalence classes for acyclic digraphs , 1997 .

[7]  Gregory F. Cooper,et al.  Causal Discovery from a Mixture of Experimental and Observational Data , 1999, UAI.

[8]  J. Pearl Causality: Models, Reasoning and Inference , 2000 .

[9]  Daphne Koller,et al.  Active Learning for Structure in Bayesian Networks , 2001, IJCAI.

[10]  Jin Tian,et al.  Causal Discovery from Changes: a Bayesian Approach , 2001, UAI 2001.

[11]  Jin Tian,et al.  Causal Discovery from Changes , 2001, UAI.

[12]  Robert Castelo,et al.  Learning Essential Graph Markov Models From Data , 2002, Probabilistic Graphical Models.

[13]  S. Lauritzen,et al.  Chain graph models and their causal interpretations , 2002 .

[14]  Constantin F. Aliferis,et al.  Causal Explorer: A Causal Probabilistic Network Learning Toolkit for Biomedical Discovery , 2003, METMBS.

[15]  M. Gerstein,et al.  A Bayesian Networks Approach for Predicting Protein-Protein Interactions from Genomic Data , 2003, Science.

[16]  David Maxwell Chickering,et al.  Learning Bayesian Networks: The Combination of Knowledge and Statistical Data , 1994, Machine Learning.

[17]  Martin A. Nowak,et al.  Inferring Cellular Networks Using Probabilistic Graphical Models , 2004 .

[18]  Yang Bo He,et al.  Learning Causal Structures Based on Markov Equivalence Class , 2005, ALT.

[19]  Kevin Murphy,et al.  Active Learning of Causal Bayes Net Structure , 2006 .

[20]  D. Heckerman,et al.  A Bayesian Approach to Causal Discovery , 2006 .

[21]  Peter Bühlmann,et al.  Estimating High-Dimensional Directed Acyclic Graphs with the PC-Algorithm , 2007, J. Mach. Learn. Res..