Modeling interactions with known risk loci—a Bayesian model averaging approach

Genome‐wide association studies (GWAS) are now clearly established as a powerful method for detecting loci involved in the etiology of common complex diseases. Most diseases and traits studied using the GWAS approach now have several loci that have been shown to be convincingly replicated. It is generally the case that these loci have been identified using single locus association scans of genotyped or imputed SNPs and very few loci have been identified by taking interactions into account. We propose a method that assesses the evidence of association at each SNP by modeling the effect of the locus in combination with other known loci. We use a Bayesian model averaging approach that combines the evidence across several different plausible models for the way in which the loci interact. We show that the method has good power both when the association is the result of marginal effects only, and when interaction with a known locus occurs. The method is implemented as an option in the program SNPTEST.

[1]  Scott M. Williams,et al.  New strategies for identifying gene-gene interactions in hypertension , 2002, Annals of medicine.

[2]  P. Donnelly,et al.  Genome-wide strategies for detecting multiple loci that influence complex diseases , 2005, Nature Genetics.

[3]  J. Hirschhorn,et al.  A comprehensive review of genetic association studies , 2002, Genetics in Medicine.

[4]  T. Hudson,et al.  A genome-wide association study identifies novel risk loci for type 2 diabetes , 2007, Nature.

[5]  Jason H Moore,et al.  Computational analysis of gene-gene interactions using multifactor dimensionality reduction , 2004, Expert review of molecular diagnostics.

[6]  B. Maher Personal genomes: The case of the missing heritability , 2008, Nature.

[7]  Taesung Park,et al.  Odds ratio based multifactor-dimensionality reduction method for detecting gene – gene interactions , 2006 .

[8]  M. Garrett,et al.  A genome scan for Loci associated with aerobic running capacity in rats. , 2002, Genomics.

[9]  Jonathan C. Cohen,et al.  A Common Allele on Chromosome 9 Associated with Coronary Heart Disease , 2007, Science.

[10]  M. Stephens,et al.  Imputation-Based Analysis of Association Studies: Candidate Regions and Quantitative Traits , 2007, PLoS genetics.

[11]  Juliet M Chapman,et al.  Detecting association using epistatic information , 2007, Genetic epidemiology.

[12]  D. Heckerman,et al.  Efficient Control of Population Structure in Model Organism Association Mapping , 2008, Genetics.

[13]  G. Zubenko,et al.  D10S1423 identifies a susceptibility locus for Alzheimer's disease in a prospective, longitudinal, double-blind study of asymptomatic individuals , 2001, Molecular Psychiatry.

[14]  F. Morón,et al.  A method for detecting epistasis in genome-wide studies using case-control multi-locus association analysis , 2008, BMC Genomics.

[15]  D. Allison,et al.  Detection of gene x gene interactions in genome-wide association studies of human population data. , 2007, Human heredity.

[16]  Ayellet V. Segrè,et al.  Twelve type 2 diabetes susceptibility loci identified through large-scale association analysis , 2010, Nature Genetics.

[17]  M. Kendall,et al.  Kendall's advanced theory of statistics , 1995 .

[18]  Manuel A. R. Ferreira,et al.  PLINK: a tool set for whole-genome association and population-based linkage analyses. , 2007, American journal of human genetics.

[19]  Vincent Plagnol,et al.  Meta-analysis of genome-wide association study data identifies additional type 1 diabetes risk loci , 2008, Nature Genetics.

[20]  Jason H. Moore,et al.  Renin-angiotensin system gene polymorphisms and coronary artery disease in a large angiographic cohort: detection of high order gene-gene interaction. , 2007, Atherosclerosis.

[21]  Jun S. Liu,et al.  Bayesian inference of epistatic interactions in case-control studies , 2007, Nature Genetics.

[22]  E. Rogaev,et al.  Conversion and compensatory evolution of the gamma-crystallin genes and identification of a cataractogenic mutation that reverses the sequence of the human CRYGD gene to an ancestral state. , 2007, American journal of human genetics.

[23]  D. Clayton Prediction and Interaction in Complex Disease Genetics: Experience in Type 1 Diabetes , 2009, PLoS genetics.

[24]  Scott M. Williams,et al.  The use of animal models in the study of complex disease: all else is never equal or why do so many human studies fail to replicate animal findings? , 2004, BioEssays : news and reviews in molecular, cellular and developmental biology.

[25]  D. Lalo,et al.  Oligogenic combinations associated with breast cancer risk in women under 53 years of age , 2005, Human Genetics.

[26]  Jason H. Moore,et al.  Multifactor dimensionality reduction software for detecting gene-gene and gene-environment interactions , 2003, Bioinform..

[27]  E. Topol,et al.  Genetic susceptibility to myocardial infarction and coronary artery disease. , 2006, Human molecular genetics.

[28]  C. Sing,et al.  A combinatorial partitioning method to identify multilocus genotypic partitions that predict quantitative trait variation. , 2001, Genome research.

[29]  S. Hunt,et al.  Identification of a congenic mouse line with obesity and body length phenotypes , 2004, Mammalian Genome.

[30]  J. Marchini,et al.  Genotype imputation for genome-wide association studies , 2010, Nature Reviews Genetics.

[31]  Anbupalam Thalamuthu,et al.  TRAF1-C5 as a risk locus for rheumatoid arthritis--a genomewide study. , 2007, The New England journal of medicine.

[32]  Bhramar Mukherjee,et al.  Exploiting Gene‐Environment Independence for Analysis of Case–Control Studies: An Empirical Bayes‐Type Shrinkage Estimator to Trade‐Off between Bias and Efficiency , 2008, Biometrics.

[33]  M. Stephens,et al.  Bayesian statistical methods for genetic association studies , 2009, Nature Reviews Genetics.

[34]  Sven Cichon,et al.  Haplotype interaction analysis of unlinked regions , 2005, Genetic epidemiology.

[35]  J. H. Moore,et al.  Multifactor-dimensionality reduction shows a two-locus interaction associated with Type 2 diabetes mellitus , 2004, Diabetologia.

[36]  Johanna M Seddon,et al.  Variation in complement factor 3 is associated with risk of age-related macular degeneration , 2007, Nature Genetics.

[37]  Jae Hyun Kim,et al.  Genetic analysis of a new mouse model for non-insulin-dependent diabetes. , 2001, Genomics.

[38]  Daniel F. Schwarz,et al.  New susceptibility locus for coronary artery disease on chromosome 3q22.3 , 2009, Nature Genetics.

[39]  D. Clayton,et al.  Genome-wide association study and meta-analysis finds over 40 loci affect risk of type 1 diabetes , 2009, Nature Genetics.

[40]  Jacques Fellay,et al.  A Whole-Genome Association Study of Major Determinants for Host Control of HIV-1 , 2007, Science.

[41]  M. McCarthy,et al.  Meta-analysis of genome-wide association data and large-scale replication identifies additional susceptibility loci for type 2 diabetes , 2008, Nature Genetics.

[42]  Lester L. Peters,et al.  Genome-wide association study identifies novel breast cancer susceptibility loci , 2007, Nature.

[43]  Jon Wakefield,et al.  Bayes factors for genome‐wide association studies: comparison with P‐values , 2009, Genetic epidemiology.

[44]  Jason H. Moore,et al.  The Ubiquitous Nature of Epistasis in Determining Susceptibility to Common Human Diseases , 2003, Human Heredity.

[45]  H. Cordell Detecting gene–gene interactions that underlie human diseases , 2009, Nature Reviews Genetics.

[46]  M. Xiong,et al.  Test for interaction between two unlinked loci. , 2006, American journal of human genetics.

[47]  Lin He,et al.  An association study of the N-methyl-D-aspartate receptor NR1 subunit gene (GRIN1) and NR2B subunit gene (GRIN2B) in schizophrenia with universal DNA microarray , 2005, European Journal of Human Genetics.

[48]  Cheng-Chang Chang,et al.  Genetic polymorphisms of FAS and FASL (CD95/CD95L) genes in cervical carcinogenesis: An analysis of haplotype and gene-gene interaction. , 2005, Gynecologic oncology.

[49]  E R Martin,et al.  Identification of significant association and gene-gene interaction of GABA receptor subunit genes in autism. , 2005, American journal of human genetics.

[50]  G. Church,et al.  Modular epistasis in yeast metabolism , 2005, Nature Genetics.

[51]  Simon C. Potter,et al.  Genome-wide association study of 14,000 cases of seven common diseases and 3,000 shared controls , 2007, Nature.

[52]  A. O'Hagan,et al.  Kendall's Advanced Theory of Statistics, Vol. 2b: Bayesian Inference. , 1996 .

[53]  C. Sing,et al.  Genetic architecture of common multifactorial diseases. , 1996, Ciba Foundation symposium.

[54]  J. Cheverud,et al.  GENE EFFECTS ON A QUANTITATIVE TRAIT: TWO‐LOCUS EPISTATIC EFFECTS MEASURED AT MICROSATELLITE MARKERS AND AT ESTIMATED QTL , 1997, Evolution; international journal of organic evolution.

[55]  K. Taylor,et al.  Genome-Wide Association , 2007, Diabetes.

[56]  Anthony O'Hagan,et al.  Kendall's Advanced Theory of Statistics, volume 2B: Bayesian Inference, second edition , 2004 .

[57]  T. Mackay The genetic architecture of quantitative traits. , 2001, Annual review of genetics.

[58]  J. Hein,et al.  Using biological networks to search for interacting loci in genome-wide association studies , 2009, European Journal of Human Genetics.

[59]  K. Mossman The Wellcome Trust Case Control Consortium, U.K. , 2008 .

[60]  G A Churchill,et al.  Genome-wide epistatic interaction analysis reveals complex genetic determinants of circadian behavior in mice. , 2001, Genome research.

[61]  D. Lindley,et al.  Bayes Estimates for the Linear Model , 1972 .

[62]  T. Reich,et al.  A perspective on epistasis: limits of models displaying no main effect. , 2002, American journal of human genetics.

[63]  C. Sing,et al.  Complex adaptive systems and human health: the influence of common genotypes of the apolipoprotein E (ApoE) gene polymorphism and age on the relational order within a field of lipid metabolism traits , 2000, Human Genetics.