Ancestor Relations in the Presence of Unobserved Variables

Bayesian networks (BNs) are an appealing model for causal and noncausal dependencies among a set of variables. Learning BNs from observational data is challenging due to the nonidentifiability of the network structure and model misspecification in the presence of unobserved (latent) variables. Here, we investigate the prospects of Bayesian learning of ancestor relations, including arcs, in the presence and absence of unobserved variables. An exact dynamic programming algorithm to compute the respective posterior probabilities is developed, under the complete data assumption. Our experimental results show that ancestor relations between observed variables, arcs in particular, can be learned with good power even when a majority of the involved variables are unobserved. For comparison, deduction of ancestor relations from single maximum a posteriori network structures or their Markov equivalence class appears somewhat inferior to Bayesian averaging. We also discuss some shortcomings of applying existing conditional independence test based methods for learning ancestor relations.

[1]  Gregory F. Cooper,et al.  A Bayesian Method for the Induction of Probabilistic Networks from Data , 1992 .

[2]  Robert Kennes,et al.  Computational aspects of the Mobius transformation of graphs , 1992, IEEE Trans. Syst. Man Cybern..

[3]  P. Spirtes,et al.  Causation, prediction, and search , 1993 .

[4]  J. York,et al.  Bayesian Graphical Models for Discrete Data , 1995 .

[5]  Thomas S. Richardson,et al.  Causal Inference in the Presence of Latent Variables and Selection Bias , 1995, UAI.

[6]  Nir Friedman,et al.  Learning Belief Networks in the Presence of Missing Values and Hidden Variables , 1997, ICML.

[7]  Nir Friedman,et al.  Discovering Hidden Variables: A Structure-Based Approach , 2000, NIPS.

[8]  J. Pearl Causality: Models, Reasoning and Inference , 2000 .

[9]  Nir Friedman,et al.  Learning the Dimensionality of Hidden Variables , 2001, UAI.

[10]  David Maxwell Chickering,et al.  Learning Bayesian Networks: The Combination of Knowledge and Statistical Data , 1994, Machine Learning.

[11]  Mikko Koivisto,et al.  Exact Bayesian Structure Discovery in Bayesian Networks , 2004, J. Mach. Learn. Res..

[12]  Nir Friedman,et al.  Being Bayesian About Network Structure. A Bayesian Approach to Structure Discovery in Bayesian Networks , 2004, Machine Learning.

[13]  Nir Friedman,et al.  Learning Hidden Variable Networks: The Information Bottleneck Approach , 2005, J. Mach. Learn. Res..

[14]  Mikko Koivisto,et al.  Advances in Exact Bayesian Structure Discovery in Bayesian Networks , 2006, UAI.

[15]  Kevin P. Murphy,et al.  Bayesian structure learning using dynamic programming and MCMC , 2007, UAI.

[16]  W. Wong,et al.  Learning Causal Bayesian Network Structures From Experimental Data , 2008 .

[17]  Marco Grzegorczyk,et al.  Improving the structure MCMC sampler for Bayesian networks by introducing a new edge reversal move , 2008, Machine Learning.

[18]  Jin Tian,et al.  Computing Posterior Probabilities of Structural Features in Bayesian Networks , 2009, UAI.

[19]  Jin Tian,et al.  Bayesian model averaging using the k -best Bayesian network structures , 2010, UAI 2010.

[20]  Eleazar Eskin,et al.  Respecting Markov Equivalence in Computing Posterior Probabilities of Causal Graphical Features , 2010, AAAI.