Powerful Cocktail Methods for Detecting Genome‐Wide Gene‐Environment Interaction

Identifying gene and environment interaction (G × E) can provide insights into biological networks of complex diseases, identify novel genes that act synergistically with environmental factors, and inform risk prediction. However, despite the fact that hundreds of novel disease‐associated loci have been identified from genome‐wide association studies (GWAS), few G × Es have been discovered. One reason is that most studies are underpowered for detecting these interactions. Several new methods have been proposed to improve power for G × E analysis, but performance varies with scenario. In this article, we present a module‐based approach to integrating various methods that exploits each method's most appealing aspects. There are three modules in our approach: (1) a screening module for prioritizing Single Nucleotide Polymorphisms (SNPs); (2) a multiple comparison module for testing G × E; and (3) a G × E testing module. We combine all three of these modules and develop two novel “cocktail” methods. We demonstrate that the proposed cocktail methods maintain the type I error, and that the power tracks well with the best existing methods, despite that the best methods may be different under various scenarios and interaction models. For GWAS, where the true interaction models are unknown, methods like our “cocktail” methods that are powerful under a wide range of situations are particularly appealing. Broadly speaking, the modular approach is conceptually straightforward and computationally simple. It builds on common test statistics and is easily implemented without additional computational efforts. It also allows for an easy incorporation of new methods as they are developed. Our work provides a comprehensive and powerful tool for devising effective strategies for genome‐wide detection of gene‐environment interactions.

[1]  Simon C. Potter,et al.  Genome-wide association study of 14,000 cases of seven common diseases and 3,000 shared controls , 2007, Nature.

[2]  Juan Pablo Lewinger,et al.  Sample size requirements to detect gene‐environment interactions in genome‐wide association studies , 2011, Genetic epidemiology.

[3]  Bhramar Mukherjee,et al.  Exploiting Gene‐Environment Independence for Analysis of Case–Control Studies: An Empirical Bayes‐Type Shrinkage Estimator to Trade‐Off between Bias and Efficiency , 2008, Biometrics.

[4]  Larry Wasserman,et al.  Genome-Wide Significance Levels and Weighted Hypothesis Testing. , 2009, Statistical science : a review journal of the Institute of Mathematical Statistics.

[5]  William Wheeler,et al.  A multi-stage genome-wide association study of bladder cancer identifies multiple susceptibility loci , 2010, Nature Genetics.

[6]  Iuliana Ionita-Laza,et al.  Genomewide weighted hypothesis testing in family-based association studies, with an application to a 100K scan. , 2007, American journal of human genetics.

[7]  David V Conti,et al.  Detecting gene-environment interactions using a combined case-only and case-control approach. , 2008, American journal of epidemiology.

[8]  L. Wasserman,et al.  Improving power in genome‐wide association studies: weights tip the scale , 2007, Genetic epidemiology.

[9]  Jack A. Taylor,et al.  Non-hierarchical logistic models and case-only designs for assessing susceptibility in population-based case-control studies. , 1994, Statistics in medicine.

[10]  P S Albert,et al.  Limitations of the case-only design for identifying gene-environment interactions. , 2001, American journal of epidemiology.

[11]  Jaeil Ahn,et al.  Testing gene-environment interaction in large-scale case-control association studies: possible choices and comparisons. , 2012, American journal of epidemiology.

[12]  M. LeBlanc,et al.  Increasing the power of identifying gene × gene interactions in genome‐wide association studies , 2008, Genetic epidemiology.

[13]  N E Day,et al.  The design of case-control studies: the influence of confounding and interaction effects. , 1984, International journal of epidemiology.

[14]  D. Thomas,et al.  Gene–environment-wide association studies: emerging approaches , 2010, Nature Reviews Genetics.

[15]  B. Burt Gerstman,et al.  Epidemiology Kept Simple: An Introduction to Traditional and Modern Epidemiology , 2013 .

[16]  Charles Kooperberg,et al.  On two-stage hypothesis testing procedures via asymptotically independent statistics , 2010 .

[17]  K. Mossman The Wellcome Trust Case Control Consortium, U.K. , 2008 .

[18]  W. Gauderman,et al.  Gene-environment interaction in genome-wide association studies. , 2008, American journal of epidemiology.