Differential networking with path weights in Gaussian trees

Marginal and partial correlations quantify the strength of the associations represented by the edges of a graphical Gaussian model. The identification of changes in these quantities across different multivariate distributions, defined on the same vector of random variables, is often used to analyze regulatory networks in molecular biology, doing what is popularly known as differential networking, or differential coexpression analysis. However, the strength of associations along the paths of a graphical model has remained largely unexplored in this type of analysis. Here we investigate how to quantify this strength over the paths of a Gaussian tree, leading to a factorization of what we shall call path weights. We show that tree structures allow for an intuitive interpretation of path weights and that the proposed factorization conveys information that is not captured by marginal or partial correlations alone. Path weights can help to improve our understanding of a multivariate system under study and provide a new tool for differential coexpression analysis.

[1]  Vincent Y. F. Tan,et al.  Learning Latent Tree Graphical Models , 2010, J. Mach. Learn. Res..

[2]  C. N. Liu,et al.  Approximating discrete probability distributions with dependence trees , 1968, IEEE Trans. Inf. Theory.

[3]  David Edwards,et al.  Selecting high-dimensional mixed graphical models using minimal AIC or BIC forests , 2010, BMC Bioinformatics.

[4]  N. L. Johnson,et al.  Multivariate Analysis , 1958, Nature.

[5]  Robert Castelo,et al.  Mapping eQTL Networks with Mixed Graphical Markov Models , 2014, Genetics.

[6]  S. E. Ahmed,et al.  Univariate and Multivariate General Linear Models—Theory and Applications With SAS , 2008, Technometrics.

[7]  Steffen L. Lauritzen,et al.  Graphical models in R , 1996 .

[8]  H. Hotelling Relations Between Two Sets of Variates , 1936 .

[9]  William W. Rozeboom,et al.  Linear correlations between sets of variables , 1965, Psychometrika.

[10]  M. Robinson,et al.  A scaling normalization method for differential expression analysis of RNA-seq data , 2010, Genome Biology.

[11]  N. H. Timm Applied Multivariate Analysis , 2002 .

[12]  Michael A. West,et al.  Covariance decomposition in undirected Gaussian graphical models , 2005 .

[13]  S. Chatterjee,et al.  Regression Analysis by Example , 1979 .

[14]  A. G. de la Fuente From 'differential expression' to 'differential networking' - identification of dysfunctional regulatory networks in diseases. , 2010, Trends in genetics : TIG.

[15]  S. Lauritzen,et al.  Maximum likelihood estimation in Gaussian models under total positivity , 2017, The Annals of Statistics.

[16]  A. Roverato,et al.  The networked partial correlation and its application to the analysis of genetic interactions , 2015, 1510.02510.

[17]  G. Barton,et al.  How many biological replicates are needed in an RNA-seq experiment and which differential expression tool should you use? , 2015, RNA.

[18]  Patrick Danaher,et al.  The joint graphical lasso for inverse covariance estimation across multiple classes , 2011, Journal of the Royal Statistical Society. Series B, Statistical methodology.

[19]  David A. Belsley,et al.  Regression Analysis and its Application: A Data-Oriented Approach.@@@Applied Linear Regression.@@@Regression Diagnostics: Identifying Influential Data and Sources of Collinearity , 1981 .

[20]  E. Levina,et al.  Joint estimation of multiple graphical models. , 2011, Biometrika.

[21]  P. Games Correlation and Causation: A Logical Snafu , 1990 .

[22]  Nanny Wermuth,et al.  Multivariate Dependencies: Models, Analysis and Interpretation , 1996 .

[23]  Piotr Zwiernik,et al.  Semialgebraic Statistics and Latent Tree Models , 2015 .