Integrative modeling of multiple genomic data from different types of genetic association studies.

Genome-wide association studies (GWASs) and expression-/methylation-quantitative trait loci (eQTL/mQTL) studies constitute popular approaches for investigating the association of single nucleotide polymorphisms (SNPs) with disease and expression/methylation, respectively. Here, we propose to integrate QTL studies to more powerfully test the SNP effect on disease in GWASs when they are conducted among different subjects. We propose a model for the joint effect of SNPs, methylation, and gene expression on disease risk and obtain the marginal model for SNPs by integrating out methylation and expression. We characterize all possible causal relations among SNPs, methylation, and expression and study the corresponding null hypotheses of no SNP effect in terms of the regression coefficients in the joint model. We develop a score test for variance components of regression coefficients to evaluate the genetic effect. We further propose an omnibus test to accommodate different models. We illustrate the utility of the proposed method in an asthma GWAS study, a brain tumor study, and numerical simulations.

[1]  M. Kendall Statistical Methods for Research Workers , 1937, Nature.

[2]  Benjamin A. Logsdon,et al.  Simultaneously testing for marginal genetic association and gene-environment interaction. , 2012, American journal of epidemiology.

[3]  J. Castle,et al.  An integrative genomics approach to infer causal associations between gene expression and disease , 2005, Nature Genetics.

[4]  Xihong Lin,et al.  JOINT ANALYSIS OF SNP AND GENE EXPRESSION DATA IN GENETIC ASSOCIATION STUDIES OF COMPLEX DISEASES. , 2014, The annals of applied statistics.

[5]  T. Pastinen,et al.  Interaction between genetic and epigenetic variation defines gene expression patterns at the asthma-associated locus 17q12-q21 in lymphoblastoid cell lines , 2012, Human Genetics.

[6]  Peter Kraft,et al.  Exploiting Gene-Environment Interaction to Detect Genetic Associations , 2007, Human Heredity.

[7]  C. Molony,et al.  Genetic analysis of genome-wide variation in human gene expression , 2004, Nature.

[8]  C. Croce Oncogenes and cancer. , 2008, The New England journal of medicine.

[9]  D. Mackinnon Introduction to Statistical Mediation Analysis , 2008 .

[10]  Xihong Lin Variance component testing in generalised linear models with random effects , 1997 .

[11]  Deanne M. Taylor,et al.  Powerful SNP-set analysis for case-control genome-wide association studies. , 2010, American journal of human genetics.

[12]  R. Davies The distribution of a linear combination of 2 random variables , 1980 .

[13]  P. Donnelly,et al.  A new multipoint method for genome-wide association studies by imputation of genotypes , 2007, Nature Genetics.

[14]  John K Wiencke,et al.  A novel approach to the discovery of survival biomarkers in glioblastoma using a joint analysis of DNA methylation and gene expression , 2014, Epigenetics.

[15]  R. Mirimanoff,et al.  Radiotherapy plus concomitant and adjuvant temozolomide for glioblastoma. , 2005, The New England journal of medicine.

[16]  R. Pyke,et al.  Logistic disease incidence models and case-control studies , 1979 .

[17]  T. Furey,et al.  Integrating genetic and gene expression evidence into genome-wide association analysis of gene sets. , 2011, Genome research.

[18]  Liming Liang,et al.  A cross-platform analysis of 14,177 expression quantitative trait loci derived from lymphoblastoid cell lines , 2013, Genome research.

[19]  L. Liang,et al.  A genome-wide association study of global gene expression , 2007, Nature Genetics.

[20]  Rachel B. Brem,et al.  Integrating large-scale functional genomic data to dissect the complexity of yeast regulatory networks , 2008, Nature Genetics.

[21]  Gonçalo R. Abecasis,et al.  Genetic variants regulating ORMDL3 expression contribute to the risk of childhood asthma , 2007, Nature.

[22]  Z. Ying,et al.  A resampling method based on pivotal estimating functions , 1994 .

[23]  J. Robins,et al.  Identifiability and Exchangeability for Direct and Indirect Effects , 1992, Epidemiology.