A two‐stage mixed‐effects model approach for gene‐set analyses in candidate gene studies

In genetic association studies, a gene-set analysis can be more powerful than the separate analyses of multiple genetic variants and can offer unique insights into the genetic basis of many common human diseases. The goal of such an analysis is to study the joint effect of multiple single-nucleotide polymorphisms (SNPs) which belong to certain genes, and these genes are assumed to be involved in a common biological function. Currently, few approaches acknowledge the within-genes and between-genes correlations when testing for gene-set effects. Thus, here we propose a two-stage approach, which in the first stage uses a mixed-effects model with a general random-effects structure to capture the correlation between the SNPs and in the second stage tests for gene-set effects by using the empirical Bayes estimates of the random effects of the first stage as covariates in the model for the longitudinal phenotype. The advantage of this approach is its broad applicability because it can be used for any phenotypic outcome and any genetic model and can be implemented with standard statistical software.

[1]  Manuel A. R. Ferreira,et al.  Gene ontology analysis of GWA study data sets provides insights into the biology of bipolar disorder. , 2009, American journal of human genetics.

[2]  Wei Pan,et al.  A Data-Adaptive Sum Test for Disease Association with Multiple Common or Rare Variants , 2010, Human Heredity.

[3]  Manuel A. R. Ferreira,et al.  PLINK: a tool set for whole-genome association and population-based linkage analyses. , 2007, American journal of human genetics.

[4]  W. Bodmer,et al.  Common and rare variants in multifactorial susceptibility to common diseases , 2008, Nature Genetics.

[5]  Lin S. Chen,et al.  Insights into colon cancer etiology via a regularized approach to gene set analysis of GWAS data. , 2010, American journal of human genetics.

[6]  D. Rubin INFERENCE AND MISSING DATA , 1975 .

[7]  Kai Wang,et al.  Pathway-based approaches for analysis of genomewide association studies. , 2007, American journal of human genetics.

[8]  Hae-Won Uh,et al.  Pathway analysis for family data using nested random-effects models , 2011, BMC proceedings.

[9]  Hae-Won Uh,et al.  Does pathway analysis make it easier for common variants to tag rare ones? , 2011, BMC proceedings.

[10]  Rachel Knevel,et al.  Predicting arthritis outcomes--what can be learned from the Leiden Early Arthritis Clinic? , 2011, Rheumatology.

[11]  S. Holm A Simple Sequentially Rejective Multiple Test Procedure , 1979 .

[12]  Chiara Sabatti,et al.  Human genetics: Variants in common diseases , 2007, Nature.

[13]  Désirée van der Heijde,et al.  Plain X-rays in rheumatoid arthritis: overview of scoring methods, their reliability and applicability. , 1996 .

[14]  Geert Molenberghs,et al.  Shared parameter models under random effects misspecification , 2008 .

[15]  G. Church,et al.  A computational analysis of whole-genome expression data reveals chromosomal domains of gene expression , 2000, Nature Genetics.