Genome‐Wide Analysis of Gene‐Gene and Gene‐Environment Interactions Using Closed‐Form Wald Tests

Despite the successful discovery of hundreds of variants for complex human traits using genome‐wide association studies, the degree to which genes and environmental risk factors jointly affect disease risk is largely unknown. One obstacle toward this goal is that the computational effort required for testing gene‐gene and gene‐environment interactions is enormous. As a result, numerous computationally efficient tests were recently proposed. However, the validity of these methods often relies on unrealistic assumptions such as additive main effects, main effects at only one variable, no linkage disequilibrium between the two single‐nucleotide polymorphisms (SNPs) in a pair or gene‐environment independence. Here, we derive closed‐form and consistent estimates for interaction parameters and propose to use Wald tests for testing interactions. The Wald tests are asymptotically equivalent to the likelihood ratio tests (LRTs), largely considered to be the gold standard tests but generally too computationally demanding for genome‐wide interaction analysis. Simulation studies show that the proposed Wald tests have very similar performances with the LRTs but are much more computationally efficient. Applying the proposed tests to a genome‐wide study of multiple sclerosis, we identify interactions within the major histocompatibility complex region. In this application, we find that (1) focusing on pairs where both SNPs are marginally significant leads to more significant interactions when compared to focusing on pairs where at least one SNP is marginally significant; and (2) parsimonious parameterization of interaction effects might decrease, rather than increase, statistical power.

[1]  D. Conti,et al.  Efficient Two‐Step Testing of Gene‐Gene Interactions in Genome‐Wide Association Studies , 2013, Genetic epidemiology.

[2]  J. Hein,et al.  Using biological networks to search for interacting loci in genome-wide association studies , 2009, European Journal of Human Genetics.

[3]  C. Carlson,et al.  Genome-Wide Search for Gene-Gene Interactions in Colorectal Cancer , 2012, PloS one.

[4]  K. Roeder,et al.  Screen and clean: a tool for identifying interactions in genome‐wide association studies , 2010, Genetic epidemiology.

[5]  David Curtis,et al.  Application of Logistic Regression to Case-Control Association Studies Involving Two Causative Loci , 2005, Human Heredity.

[6]  Pei Wang,et al.  Testing Gene–Gene Interactions in Genome Wide Association Studies , 2014, Genetic epidemiology.

[7]  Chris S. Haley,et al.  Epistasis: too often neglected in complex trait studies? , 2004, Nature Reviews Genetics.

[8]  BMC Medical Genetics BioMed Central , 2003 .

[9]  Yi Wang,et al.  Exploration of gene–gene interaction effects using entropy-based methods , 2008, European Journal of Human Genetics.

[10]  Momiao Xiong,et al.  A Novel Statistic for Genome-Wide Interaction Analysis , 2010, PLoS genetics.

[11]  David M. Evans,et al.  Two-Stage Two-Locus Models in Genome-Wide Association , 2006, PLoS genetics.

[12]  David V Conti,et al.  A testing framework for identifying susceptibility genes in the presence of epistasis. , 2006, American journal of human genetics.

[13]  John D. Storey The positive false discovery rate: a Bayesian interpretation and the q-value , 2003 .

[14]  Tyler J VanderWeele,et al.  Tests for Compositional Epistasis under Single Interaction‐Parameter Models , 2011, Annals of human genetics.

[15]  Manuel A. R. Ferreira,et al.  PLINK: a tool set for whole-genome association and population-based linkage analyses. , 2007, American journal of human genetics.

[16]  Andrew G. Clark,et al.  Knowledge-Driven Analysis Identifies a Gene–Gene Interaction Affecting High-Density Lipoprotein Cholesterol Levels in Multi-Ethnic Populations , 2012, PLoS genetics.

[17]  J. Ott,et al.  Testing Association with Interactions by Partitioning Chi‐Squares , 2009, Annals of human genetics.

[18]  Scott M. Williams,et al.  Epistasis and its implications for personal genetics. , 2009, American journal of human genetics.

[19]  Judy H. Cho,et al.  Finding the missing heritability of complex diseases , 2009, Nature.

[20]  J. Dennis,et al.  Genetics and the environment converge to dysregulate N-glycosylation in multiple sclerosis , 2011, Nature communications.

[21]  James Y Dai,et al.  Two-stage testing procedures with independent filtering for genome-wide gene-environment interaction. , 2012, Biometrika.

[22]  Romdhane Rekaya,et al.  AntEpiSeeker: detecting epistatic interactions for case-control studies using a two-stage ant colony optimization algorithm , 2010, BMC Research Notes.

[23]  D. Nicolae,et al.  Restricted parameter space models for testing gene‐gene interaction , 2009, Genetic epidemiology.

[24]  J. Tukey One Degree of Freedom for Non-Additivity , 1949 .

[25]  Zhaoxia Yu,et al.  Family studies of Type 1 diabetes reveal additive and epistatic effects between MGAT1 and three other polymorphisms , 2014, Genes and Immunity.

[26]  Holger Schwender,et al.  Rapid Testing of SNPs and Gene–Environment Interactions in Case–Parent Trio Data Based on Exact Analytic Parameter Estimation , 2012, Biometrics.

[27]  Qiang Yang,et al.  BOOST: A fast approach to detecting gene-gene interactions in genome-wide case-control studies , 2010, American journal of human genetics.

[28]  Bhramar Mukherjee,et al.  Efficient designs of gene–environment interaction studies: implications of Hardy–Weinberg equilibrium and gene–environment independence , 2012, Statistics in medicine.

[29]  J. Darroch Interactions in Multi‐Factor Contingency Tables , 1962 .

[30]  David Heckerman,et al.  CORRIGENDUM: An Exhaustive Epistatic SNP Association Analysis on Expanded Wellcome Trust Data , 2013, Scientific Reports.

[31]  Sreeram V Ramagopalan,et al.  Epistasis among HLA-DRB1, HLA-DQA1, and HLA-DQB1 loci determines multiple sclerosis susceptibility , 2009, Proceedings of the National Academy of Sciences.

[32]  Christoph Lange,et al.  Gene‐environment interaction tests for dichotomous traits in trios and sibships , 2009, Genetic epidemiology.

[33]  R. Plackett A Note on Interactions in Contingency Tables , 1962 .

[34]  W Bateson,et al.  FACTS LIMITING THE THEORY OF HEREDITY. , 1907, Science.

[35]  Masao Ueki,et al.  Improved Statistics for Genome-Wide Interaction Analysis , 2012, PLoS genetics.

[36]  Lynne Pearce,et al.  Partners in crime. , 2008, Nursing standard (Royal College of Nursing (Great Britain) : 1987).

[37]  Andrew P Morris,et al.  Rapid Testing of Gene-Gene Interactions in Genome-Wide Association Studies of Binary and Quantitative Phenotypes , 2011, Genetic epidemiology.

[38]  Simon C. Potter,et al.  Association scan of 14,500 nonsynonymous SNPs in four diseases identifies autoimmunity variants , 2007, Nature Genetics.

[39]  Qiang Yang,et al.  SNPHarvester: a filtering-based approach for detecting epistatic interactions in genome-wide association studies , 2009, Bioinform..

[40]  Marvin A. Kastenbaum,et al.  On the Hypothesis of No "Interaction" In a Multi-way Contingency Table , 1956 .

[41]  Yang Liu,et al.  Genome-Wide Interaction-Based Association Analysis Identified Multiple New Susceptibility Loci for Common Diseases , 2011, PLoS genetics.

[42]  Robert Culbertson,et al.  Rapid Testing , 2002 .

[43]  Carolyn Hutter,et al.  Powerful Cocktail Methods for Detecting Genome‐Wide Gene‐Environment Interaction , 2012, Genetic epidemiology.

[44]  M. Xiong,et al.  Test for interaction between two unlinked loci. , 2006, American journal of human genetics.

[45]  Eric J Tchetgen Tchetgen,et al.  On the robustness of tests of genetic associations incorporating gene-environment interaction when the environmental exposure is misspecified. , 2011, Epidemiology.

[46]  Jason H. Moore,et al.  Missing heritability and strategies for finding the underlying causes of complex disease , 2010, Nature Reviews Genetics.

[47]  H. Cordell Epistasis: what it means, what it doesn't mean, and statistical methods to detect it in humans. , 2002, Human molecular genetics.

[48]  Robert M. Plenge,et al.  Defining the Role of the MHC in Autoimmunity: A Review and Pooled Analysis , 2008, PLoS genetics.

[49]  A. Ziegler,et al.  A Genotype-Based Approach to Assessing the Association between Single Nucleotide Polymorphisms , 2008, Human Heredity.

[50]  P. Sasieni From genotypes to genes: doubling the sample size. , 1997, Biometrics.

[51]  N. Morton Genetic epidemiology , 1997, International Journal of Obesity.

[52]  C I Amos,et al.  Entropy‐based information gain approaches to detect and to characterize gene‐gene and gene‐environment interactions/correlations of complex diseases , 2011, Genetic epidemiology.

[53]  James M. Robins,et al.  Multiply Robust Inference for Statistical Interactions , 2008, Journal of the American Statistical Association.

[54]  M. LeBlanc,et al.  Increasing the power of identifying gene × gene interactions in genome‐wide association studies , 2008, Genetic epidemiology.

[55]  Lon R. Cardon,et al.  Functional epistasis on a common MHC haplotype associated with multiple sclerosis , 2006, Nature.

[56]  N. Chatterjee,et al.  Powerful multilocus tests of genetic association in the presence of gene-gene and gene-environment interactions. , 2006, American journal of human genetics.

[57]  M. Dubé,et al.  Testing for Gene-Gene Interaction with AMMI Models , 2010, Statistical applications in genetics and molecular biology.

[58]  D. Allison,et al.  Detection of gene x gene interactions in genome-wide association studies of human population data. , 2007, Human heredity.

[59]  R. Newcombe Two-sided confidence intervals for the single proportion: comparison of seven methods. , 1998, Statistics in medicine.

[60]  R. Elston,et al.  The Meaning of Interaction , 2010, Human Heredity.

[61]  W. Gauderman,et al.  Gene-environment interaction in genome-wide association studies. , 2008, American journal of epidemiology.

[62]  P. Donnelly,et al.  Genome-wide strategies for detecting multiple loci that influence complex diseases , 2005, Nature Genetics.

[63]  Raymond J Carroll,et al.  Retrospective analysis of haplotype-based case control studies under a flexible model for gene environment association. , 2008, Biostatistics.

[64]  Zhaoxia Yu Testing Gene-Gene Interactions in the Case-Parents Design , 2011, Human Heredity.

[65]  luliana lonita,et al.  Optimal two-stage strategy for detecting interacting genes in complex diseases , 2006, BMC Genetics.