Maximum likelihood fitting of acyclic directed mixed graphs to binary data

Acyclic directed mixed graphs, also known as semi-Markov models represent the conditional independence structure induced on an observed margin by a DAG model with latent variables. In this paper we present the first method for fitting these models to binary data using maximum likelihood estimation.

[1]  Thomas S. Richardson,et al.  Towards Characterizing Markov Equivalence Classes for Directed Acyclic Graphs with Latent Variables , 2005, UAI.

[2]  T. Richardson Markov Properties for Acyclic Directed Mixed Graphs , 2003 .

[3]  Stephen E. Fienberg,et al.  The analysis of cross-classified categorical data , 1980 .

[4]  Søren Johansen,et al.  Introduction to the theory of regular exponential families , 1979 .

[5]  Judea Pearl,et al.  Identification of Conditional Interventional Distributions , 2006, UAI.

[6]  Zoubin Ghahramani,et al.  The Hidden Life of Latent Variables: Bayesian Learning with Mixed Graph Models , 2009, J. Mach. Learn. Res..

[7]  Brendan J. Frey,et al.  Cumulative Distribution Networks and the Derivative-sum-product Algorithm: Models and Inference for Cumulative Distribution Functions on Graphs , 2008, J. Mach. Learn. Res..

[8]  Joseph B. Lang,et al.  Association-Marginal Modeling of Multivariate Categorical Responses: A Maximum Likelihood Approach , 1999 .

[9]  P. Spirtes,et al.  Ancestral graph Markov models , 2002 .

[10]  M. Drton Likelihood ratio tests and singularities , 2007, math/0703360.

[11]  Marco Valtorta,et al.  Pearl's Calculus of Intervention Is Complete , 2006, UAI.

[12]  T. Richardson,et al.  Binary models for marginal independence , 2007, 0707.3794.

[13]  Thomas S. Richardson,et al.  A factorization criterion for acyclic directed mixed graphs , 2009, UAI.

[14]  A. Dawid Conditional Independence in Statistical Theory , 1979 .

[15]  Judea Pearl,et al.  Probabilistic reasoning in intelligent systems , 1988 .

[16]  Thomas S. Richardson,et al.  Graphical Methods for Efficient Likelihood Inference in Gaussian Covariance Models , 2007, J. Mach. Learn. Res..

[17]  Judea Pearl,et al.  Probabilistic reasoning in intelligent systems - networks of plausible inference , 1991, Morgan Kaufmann series in representation and reasoning.

[18]  Dimitri P. Bertsekas,et al.  Nonlinear Programming , 1997 .