1 6 M ar 2 01 0 PROBABILITY DISTRIBUTIONS WITH SUMMARY GRAPH STRUCTURE

A set of independence statements may define the independence structure of interest in a family of joint probability distributions. This structure is often captured by a graph that consists of nodes representing the random variables and of edges that couple node pairs. One important class are multivariate regression chain graphs. They describe the independences of stepwise processes, in which at each step single or joint responses are generated given the relevant explanatory variables in their past. For joint densities that then result after possible marginalising or conditioning, we use summary graphs. These graphs reflect the independence structure implied by the generating process for the reduced set of variables and they preserve the implied independences after additional marginalising and conditioning. They can identify generating dependences which remain unchanged and alert to possibly severe distortions due to direct and indirect confounding. Operators for matrix representations of graphs are used to derive these properties of summary graphs and to translate them into special types of path in graphs. 1. Motivation, some previous and some of the new results. 1.1. Motivation. Graphical Markov models are probability distributions defined for a d V × 1 random vector variable Y V whose component variables may be discrete or continuous and whose joint density f V satisfies the independence statements specified directly as well as those implied by an associated graph. The set of all such statements is the independence structure captured by the graph. One such type of graph had been introduced for multivariate regression A multivariate regression chain graph consists of nodes in set, V , that represent random variables and of edges that couple node pairs such that a re-cursive order of the joint responses is reflected in the graph and each defining independence constraint respects the given ordering; see Marchetti and Lupparelli (2010). This distinguishes multivariate regression graphs from all other currently known types of chain graphs; see Drton (2009) for implications on associated discrete distributions. Because of this property, multivariate regression chain graphs are particularly well suited for studies of effects of hypothesized causes on joint responses, see Cox and Wermuth (2004), and more generally for modeling developmental processes, such as in panel studies. These provide data on a group of individuals, termed the 'panel', collected repeatedly, say over years or decades. Often one wants to compare corresponding analyses with results in other studies that have core sets of variables in common, …

[1]  E. Lehmann,et al.  Completeness, Similar Regions, and Unbiased Estimation—Part II , 2012 .

[2]  N. Wermuth,et al.  Changing Parameters by Partial Mappings , 2010 .

[3]  Jin Tian,et al.  Markov Properties for Linear Causal Models with Correlated Errors , 2009, J. Mach. Learn. Res..

[4]  M. Drton Discrete chain graph models , 2009, 0909.0843.

[5]  N. Wermuth,et al.  Matrix representations and independencies in directed acyclic graphs , 2009, 0904.0333.

[6]  Michael Eichler,et al.  Computing Maximum Likelihood Estimates in Recursive Linear Models with Correlated Errors , 2009, J. Mach. Learn. Res..

[7]  N. Wermuth,et al.  Triangular systems for symmetric binary variables , 2009 .

[8]  N. Wermuth,et al.  Distortion of effects caused by indirect confounding , 2008 .

[9]  Kayvan Sadeghi Graph Theoretical Proofs for Separation Criteria , 2008 .

[10]  M. Mouchart,et al.  On joint completeness: sampling and Bayesian versions, and their connections , 2007 .

[11]  N. Wermuth,et al.  Partial inversion for linear systems and partial closure of independence graphs , 2006 .

[12]  David R. Cox,et al.  PRINCIPLES OF STATISTICAL INFERENCE , 2017 .

[13]  M. Mouchart,et al.  Ignorable Common Information, Null sets and Basu’s First Theorem , 2005 .

[14]  N. Wermuth,et al.  Causality: a Statistical View , 2004 .

[15]  N. Wermuth,et al.  Joint response graphs and separation induced by triangular systems , 2004 .

[16]  David Madigan,et al.  A graphical characterization of lattice conditional independence models , 2004, Annals of Mathematics and Artificial Intelligence.

[17]  Tom Burr,et al.  Causation, Prediction, and Search , 2003, Technometrics.

[18]  Arno Siebes,et al.  A characterization of moral transitive acyclic directed graph Markov models as labeled trees , 2003 .

[19]  J. Koster Marginalizing and conditioning in graphical models , 2002 .

[20]  J. Pearl,et al.  A New Identification Condition for Recursive Models With Correlated Errors , 2002 .

[21]  Nanny Wermuth,et al.  On association models defined over independence graphs , 1998 .

[22]  Michael I. Jordan Graphical Models , 1998 .

[23]  James M. Robins,et al.  Estimation of Effects of Sequential Treatments by Reparameterizing Directed Acyclic Graphs , 1997, UAI.

[24]  Nanny Wermuth,et al.  Multivariate Dependencies: Models, Analysis and Interpretation , 1996 .

[25]  G. Kauermann On a dualization of graphical Gaussian models , 1996 .

[26]  N. Wermuth,et al.  Linear Dependencies Represented by Chain Graphs , 1993 .

[27]  S. Orbom,et al.  When Can Association Graphs Admit A Causal Interpretation? , 1993 .

[28]  N. Wermuth,et al.  On Substantive Research Hypotheses, Conditional Independence Graphs and Graphical Chain Models , 1990 .

[29]  Dan Geiger,et al.  Identifying independence in bayesian networks , 1990, Networks.

[30]  N. Wermuth,et al.  Graphical Models for Associations between Variables, some of which are Qualitative and some Quantitative , 1989 .

[31]  Judea Pearl,et al.  Probabilistic reasoning in intelligent systems , 1988 .

[32]  A. Mandelbaum,et al.  Complete and Symmetrically Complete Families of Distributions , 1987 .

[33]  P. Rosenbaum,et al.  Conditional Association and Unidimensionality in Monotone Latent Variable Models , 1985 .

[34]  L. Brown Fundamentals of statistical exponential families: with applications in statistical decision theory , 1986 .

[35]  T. Speed,et al.  Gaussian Markov Distributions over Finite Graphs , 1986 .

[36]  A. Dawid Some Misleading Arguments Involving Conditional Independence , 1979 .

[37]  G. Dirac On rigid circuit graphs , 1961 .

[38]  J. Sargan THE ESTIMATION OF ECONOMIC RELATIONSHIPS USING INSTRUMENTAL VARIABLES , 1958 .