Detecting genetic interactions for quantitative traits with U‐statistics

The genetic etiology of complex human diseases has been commonly viewed as a process that involves multiple genetic variants, environmental factors, as well as their interactions. Statistical approaches, such as the multifactor dimensionality reduction (MDR) and generalized MDR (GMDR), have recently been proposed to test the joint association of multiple genetic variants with either dichotomous or continuous traits. In this study, we propose a novel Forward U‐Test to evaluate the combined effect of multiple loci on quantitative traits with consideration of gene‐gene/gene‐environment interactions. In this new approach, a U‐Statistic‐based forward algorithm is first used to select potential disease‐susceptibility loci and then a weighted U‐statistic is used to test the joint association of the selected loci with the disease. Through a simulation study, we found the Forward U‐Test outperformed GMDR in terms of greater power. Aside from that, our approach is less computationally intensive, making it feasible for high‐dimensional gene‐gene/gene‐environment research. We illustrate our method with a real data application to nicotine dependence (ND), using three independent datasets from the Study of Addiction: Genetics and Environment. Our gene‐gene interaction analysis of 155 SNPs in 67 candidate genes identified two SNPs, rs16969968 within gene CHRNA5 and rs1122530 within gene NTRK2, jointly associated with the level of ND (P‐value = 5.31e−7). The association, which involves essential interaction, is replicated in two independent datasets with P‐values of 1.08e−5 and 0.02, respectively. Our finding suggests that joint action may exist between the two gene products. Genet. Epidemiol. 2011. © 2011 Wiley‐Liss, Inc. 35: 457‐468, 2011

[1]  Z. Ying,et al.  Detecting essential and removable interactions in genome-wide association studies. , 2009, Statistics and its interface.

[2]  Judy H. Cho,et al.  Finding the missing heritability of complex diseases , 2009, Nature.

[3]  S. Schuster Next-generation sequencing transforms today's biology , 2008, Nature Methods.

[4]  Mingyao Li,et al.  U‐Statistics‐based Tests for Multiple Genes in Genetic Association Studies , 2008, Annals of human genetics.

[5]  Jun Zhu,et al.  A combinatorial approach to detecting gene-gene and gene-environment interactions in family studies. , 2008, American journal of human genetics.

[6]  Wenjiang J. Fu,et al.  Mapping Haplotype-haplotype Interactions with Adaptive LASSO , 2010, BMC Genetics.

[7]  R. Elston,et al.  The Meaning of Interaction , 2010, Human Heredity.

[8]  Ming D. Li,et al.  Gene-Gene Interactions Among CHRNA4, CHRNB2, BDNF, and NTRK2 in Nicotine Dependence , 2008, Biological Psychiatry.

[9]  L. Bierut,et al.  The prognostic implications of DSM-IV abuse criteria in drinking adolescents. , 2008, Drug and Alcohol Dependence.

[10]  G. Mendel,et al.  Mendel's Principles of Heredity , 1910, Nature.

[11]  H. Wit Faculty Opinions recommendation of Alpha-5/alpha-3 nicotinic receptor subunit alleles increase risk for heavy smoking. , 2008 .

[12]  Jason H. Moore,et al.  The Ubiquitous Nature of Epistasis in Determining Susceptibility to Common Human Diseases , 2003, Human Heredity.

[13]  P. Donnelly,et al.  Genome-wide strategies for detecting multiple loci that influence complex diseases , 2005, Nature Genetics.

[14]  Judy H. Cho,et al.  Genome-wide association defines more than 30 distinct susceptibility loci for Crohn's disease , 2008, Nature Genetics.

[15]  John D. Storey,et al.  Genetic interactions between polymorphisms that affect gene expression in yeast , 2005, Nature.

[16]  Zhaohui S. Qin,et al.  A second generation human haplotype map of over 3.1 million SNPs , 2007, Nature.

[17]  Chuhsing Kate Hsiao,et al.  Regression-based association analysis with clustered haplotypes through use of genotypes. , 2006, American journal of human genetics.

[18]  A. Lacasa,et al.  Epistasis in the resistance of pepper to phytophthora stem blight (Phytophthora capsici L.) and its significance in the prediction of double cross performances , 1994, Euphytica.

[19]  H. Cordell Detecting gene–gene interactions that underlie human diseases , 2009, Nature Reviews Genetics.

[20]  John P. Rice,et al.  A Risk Allele for Nicotine Dependence in CHRNA5 Is a Protective Allele for Cocaine Dependence , 2008, Biological Psychiatry.

[21]  William Wheeler,et al.  Genome-Wide and Candidate Gene Association Study of Cigarette Smoking Behaviors , 2009, PloS one.

[22]  R. Wu,et al.  Detecting Sequence-Sequence Interactions for Complex Diseases , 2006 .

[23]  Daniel J Schaid,et al.  Nonparametric tests of association of multiple genes with human disease. , 2005, American journal of human genetics.

[24]  N. Chatterjee,et al.  Powerful multilocus tests of genetic association in the presence of gene-gene and gene-environment interactions. , 2006, American journal of human genetics.

[25]  M. McCarthy,et al.  Meta-analysis of genome-wide association data and large-scale replication identifies additional susceptibility loci for type 2 diabetes , 2008, Nature Genetics.

[26]  Qing Lu,et al.  Using the optimal receiver operating characteristic curve to design a predictive genetic test, exemplified with type 2 diabetes. , 2008, American journal of human genetics.

[27]  Ming D. Li,et al.  Association of Specific Haplotypes of Neurotrophic Tyrosine Kinase Receptor 2 Gene (NTRK2) with Vulnerability to Nicotine Dependence in African-Americans and European-Americans , 2007, Biological Psychiatry.

[28]  P. Donnelly,et al.  A new multipoint method for genome-wide association studies by imputation of genotypes , 2007, Nature Genetics.

[29]  Nicholas G Martin,et al.  Cholinergic nicotinic receptor genes implicated in a nicotine dependence association study targeting 348 candidate genes with 3713 SNPs. , 2007, Human molecular genetics.

[30]  W. Hoeffding A Class of Statistics with Asymptotically Normal Distribution , 1948 .

[31]  L. Bierut,et al.  Nicotinic Receptor Gene Variants Influence Susceptibility to Heavy Smoking , 2008, Cancer Epidemiology Biomarkers & Prevention.

[32]  A. C. Collins,et al.  The CHRNA5/A3/B4 Gene Cluster Variability as an Important Determinant of Early Alcohol and Tobacco Initiation in Young Adults , 2008, Biological Psychiatry.

[33]  J. H. Moore,et al.  A novel method to identify gene–gene effects in nuclear families: the MDR‐PDT , 2006, Genetic epidemiology.

[34]  John D. Storey,et al.  Multiple Locus Linkage Analysis of Genomewide Expression in Yeast , 2005, PLoS biology.

[35]  A. Zeileis,et al.  Regression Models for Count Data in R , 2008 .

[36]  R. Wu,et al.  A General Model for Multilocus Epistatic Interactions in Case-Control Studies , 2010, PloS one.

[37]  Jun Zhu,et al.  A generalized combinatorial approach for detecting gene-by-gene and gene-by-environment interactions with application to nicotine dependence. , 2007, American journal of human genetics.

[38]  Christopher I Amos,et al.  The CHRNA5-A3 region on chromosome 15q24-25.1 is a risk factor both for nicotine dependence and for lung cancer. , 2008, Journal of the National Cancer Institute.

[39]  Guifang Fu,et al.  The Bayesian lasso for genome-wide association studies , 2011, Bioinform..

[40]  Hongyu Zhao,et al.  Statistical Power of Model Selection Strategies for Genome-Wide Association Studies , 2009, PLoS genetics.

[41]  Sharon R Grossman,et al.  Integrating common and rare genetic variation in diverse human populations , 2010, Nature.

[42]  W. Dassen,et al.  Search for haplotype interactions that influence susceptibility to type 1 diabetes, through use of unphased genotype data. , 2003, American journal of human genetics.

[43]  Megan E. Piper,et al.  A Candidate Gene Approach Identifies the CHRNA5-A3-B4 Region as a Risk Factor for Age-Dependent Nicotine Addiction , 2008, PLoS genetics.

[44]  Ming D. Li,et al.  Association and interaction analysis of variants in CHRNA5/CHRNA3/CHRNB4 gene cluster with nicotine dependence in African and European Americans , 2009, American journal of medical genetics. Part B, Neuropsychiatric genetics : the official publication of the International Society of Psychiatric Genetics.

[45]  P. Muglia,et al.  α-5/α-3 nicotinic receptor subunit alleles increase risk for heavy smoking , 2008, Molecular Psychiatry.

[46]  J. H. Moore,et al.  Multifactor-dimensionality reduction reveals high-order interactions among estrogen-metabolism genes in sporadic breast cancer. , 2001, American journal of human genetics.

[47]  Lester L. Peters,et al.  Genome-wide association study identifies novel breast cancer susceptibility loci , 2007, Nature.

[48]  Ming D. Li,et al.  Associations of Variants in CHRNA5/A3/B4 Gene Cluster with Smoking Behaviors in a Korean Population , 2010, PloS one.

[49]  R. Nagel,et al.  Epistasis and the genetics of human diseases. , 2005, Comptes rendus biologies.