Modeling Liquid Association

In 2002, Ker-Chau Li introduced the liquid association measure to characterize three-way interactions between genes, and developed a computationally efficient estimator that can be used to screen gene expression microarray data for such interactions. That study, and others published since then, have established the biological validity of the method, and clearly demonstrated it to be a useful tool for the analysis of genomic data sets. To build on this work, we have sought a parametric family of multivariate distributions with the flexibility to model the full range of trivariate dependencies encompassed by liquid association. Such a model could situate liquid association within a formal inferential theory. In this article, we describe such a family of distributions, a trivariate, conditional normal model having Gaussian univariate marginal distributions, and in fact including the trivariate Gaussian family as a special case. Perhaps the most interesting feature of the distribution is that the parameterization naturally parses the three-way dependence structure into a number of distinct, interpretable components. One of these components is very closely aligned to liquid association, and is developed as a measure we call modified liquid association. We develop two methods for estimating this quantity, and propose statistical tests for the existence of this type of dependence. We evaluate these inferential methods in a set of simulations and illustrate their use in the analysis of publicly available experimental data.

[1]  Raymond J Carroll,et al.  Covariate Adjusted Correlation Analysis with Application to FMR1 Premutation Female Carrier Data , 2009, Biometrics.

[2]  L. Zhao,et al.  Estimating equations for parameters in means and covariances of multivariate discrete and continuous responses. , 1991, Biometrics.

[3]  Liang Chen,et al.  A statistical method for identifying differential gene-gene co-expression patterns , 2004, Bioinform..

[4]  K-C Li,et al.  A functional genomic study on NCI's anticancer drug screen , 2004, The Pharmacogenomics Journal.

[5]  N. Friedman,et al.  Structure and function of a transcriptional network activated by the MAPK Hog1 , 2008, Nature Genetics.

[6]  P. McCullagh,et al.  Generalized Linear Models , 1972, Predictive Analytics.

[7]  John A. Nelder,et al.  Generalized linear models. 2nd ed. , 1993 .

[8]  A. Kikuchi Regulation of beta-catenin signaling in the Wnt pathway. , 2000, Biochemical and biophysical research communications.

[9]  Insuk Sohn,et al.  BMC Bioinformatics BioMed Central Methodology article A copula method for modeling directional dependence of genes , 2022 .

[10]  Lars Malmström,et al.  The Yeast Resource Center Public Data Repository , 2004, Nucleic Acids Res..

[11]  R. Nelsen An Introduction to Copulas , 1998 .

[12]  Michael Ruogu Zhang,et al.  Comprehensive identification of cell cycle-regulated genes of the yeast Saccharomyces cerevisiae by microarray hybridization. , 1998, Molecular biology of the cell.

[13]  Ker-Chau Li,et al.  A system for enhancing genome-wide coexpression dynamics study. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[14]  A. Kikuchi Regulation of β-Catenin Signaling in the Wnt Pathway , 2000 .

[15]  Jun Yan,et al.  Rejoinder to Franke, Kastner and Ziegler , 2004 .

[16]  Yuan Ji,et al.  Extracting three-way gene interactions from microarray data , 2007, Bioinform..

[17]  C. Stein Estimation of the Mean of a Multivariate Normal Distribution , 1981 .

[18]  Yen-Yi Ho,et al.  Statistical methods for identifying differentially expressed gene combinations. , 2007, Methods in molecular biology.

[19]  Ker-Chau Li,et al.  Genome-wide coexpression dynamics: Theory and application , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[20]  Giovanni Parmigiani,et al.  Searching for differentially expressed gene combinations , 2005, Genome Biology.

[21]  Jason Fine,et al.  Estimating equations for association structures , 2004, Statistics in medicine.

[22]  M C Paik,et al.  Parametric variance function estimation for nonnormal repeated measurement data. , 1992, Biometrics.