Graphical Markov Models

Graphical Markov models are multivariate statistical models which are currently under vigorous development and which combine two simple but most powerful notions, generating processes in single and joint response variables and conditional independences captured by graphs. The development of graphical Markov started with work by Wermuth (1976, 1980) and Darroch, Lauritzen and Speed (1980) which built on early results in 1920 to 1930 by geneticist Sewall Wright and probabilist Andrej Markov as well as on results for log-linear models by Birch (1963), Goodman (1970), Bishop, Fienberg and Holland (1973) and for covariance selection by Dempster (1972). Wright used graphs, in which nodes represent variables and arrows indicate linear dependence, to describe hypotheses about stepwise processes in single responses that could have generated his data. He developed a method, called path analysis, to estimate linear dependences and to judge whether the hypotheses are well compatible with his data which he summarized in terms of simple and partial correlations. With this approach he was far ahead of his time, since corresponding formal statistical methods for estimation and tests of goodness of fit were developed much later and graphs that capture independences even much later than tests of goodness of fit. It remains a primary objective of graphical Markov models to uncover graphical representations that lead to an understanding of data generating processes. Such processes are no longer restricted to linear relations but contain linear dependences as special cases. A probabilistic data generating process is a recursive sequence of conditional distributions in which response variables may be vector variables that contain discrete or continuous components. Thereby, each conditional distribution specifies both the dependence of response Ya, say, on an explanatory variable vector Yb and the undirected associations of the components of Ya. Graphical Markov models also generalize sequences in single responses and single explanatory variables that have been named Markov chains, after probabilist Markov. He recognized at the beginning of the 29th century that seemingly complex joint probability distributions may be radically simplified by using the notion of conditional independence and defined what are now called Markov chains. In a Markov chain of random variables Y1, . . . , Yi, . . . , Yd, the joint distribution is built up by starting with the density of fd of Ydand generating next fd−1|d. Then, conditional independence of Yd−2 from Yd given Yd−1 is taken into account with fd−2|d−1,d = fd−2|d−1. One continues such that, with fi|i+1,...d = fi|i+1, response Yi is conditionally independent of Yi+2, . . . , Yd given Yi+1, written compactly

[1]  Steffen L. Lauritzen,et al.  Independence properties of directed markov fields , 1990, Networks.

[2]  David R. Cox,et al.  PRINCIPLES OF STATISTICAL INFERENCE , 2017 .

[3]  A. Dobra Variable selection and dependency networks for genomewide data. , 2009, Biostatistics.

[4]  Nanny Wermuth,et al.  Multivariate Dependencies: Models, Analysis and Interpretation , 1996 .

[5]  J. Lingoes The Multivariate Analysis Of Qualitative Data , 1968 .

[6]  David Edwards,et al.  Selecting high-dimensional mixed graphical models using minimal AIC or BIC forests , 2010, BMC Bioinformatics.

[7]  J. Pearl Causal inference in statistics: An overview , 2009 .

[8]  M. Drton Discrete chain graph models , 2009, 0909.0843.

[9]  N. Wermuth Analogies between Multiplicative Models in Contingency Tables and Covariance Selection , 1976 .

[10]  J. Koster Marginalizing and conditioning in graphical models , 2002 .

[11]  P. Spirtes,et al.  Ancestral graph Markov models , 2002 .

[12]  Jin Tian,et al.  Markov Properties for Linear Causal Models with Correlated Errors , 2009, J. Mach. Learn. Res..

[13]  N. Wermuth PROBABILITY DISTRIBUTIONS WITH SUMMARY GRAPH STRUCTURE , 2010, 1003.3259.

[14]  Michael Eichler,et al.  Computing Maximum Likelihood Estimates in Recursive Linear Models with Correlated Errors , 2009, J. Mach. Learn. Res..

[15]  N. Wermuth,et al.  Graphical Models for Associations between Variables, some of which are Qualitative and some Quantitative , 1989 .

[16]  M. Mouchart,et al.  Ignorable Common Information, Null sets and Basu’s First Theorem , 2005 .

[17]  N. Wermuth,et al.  Causality: a Statistical View , 2004 .

[18]  D. Madigan,et al.  Alternative Markov Properties for Chain Graphs , 2001 .

[19]  P. Holland,et al.  Discrete Multivariate Analysis. , 1976 .

[20]  Chenlei Leng,et al.  Unified LASSO Estimation by Least Squares Approximation , 2007 .

[21]  A. Dawid Some Misleading Arguments Involving Conditional Independence , 1979 .

[22]  N. Wermuth,et al.  Partial inversion for linear systems and partial closure of independence graphs , 2006 .

[23]  N. Wermuth,et al.  Joint response graphs and separation induced by triangular systems , 2004 .

[24]  N. Wermuth,et al.  Distortion of effects caused by indirect confounding , 2008 .

[25]  M. W. Birch Maximum Likelihood in Three-Way Contingency Tables , 1963 .

[26]  N. Wermuth,et al.  Linear Dependencies Represented by Chain Graphs , 1993 .

[27]  G. M. Marchetti,et al.  Chain graph models of multivariate regression type for categorical data , 2009, 0906.2098.

[28]  N. Wermuth,et al.  Matrix representations and independencies in directed acyclic graphs , 2009, 0904.0333.

[29]  N. Wermuth Linear Recursive Equations, Covariance Selection, and Path Analysis , 1980 .

[30]  C.J.H. Mann,et al.  Probabilistic Conditional Independence Structures , 2005 .

[31]  Michael I. Jordan Graphical Models , 1998 .