A partial least‐square approach for modeling gene‐gene and gene‐environment interactions when multiple markers are genotyped

Genetic association studies achieve an unprecedented level of resolution in mapping disease genes by genotyping dense single nucleotype polymorphisms (SNPs) in a gene region. Meanwhile, these studies require new powerful statistical tools that can optimally handle a large amount of information provided by genotype data. A question that arises is how to model interactions between two genes. Simply modeling all possible interactions between the SNPs in two gene regions is not desirable because a greatly increased number of degrees of freedom can be involved in the test statistic. We introduce an approach to reduce the genotype dimension in modeling interactions. The genotype compression of this approach is built upon the information on both the trait and the cross‐locus gametic disequilibrium between SNPs in two interacting genes, in such a way as to parsimoniously model the interactions without loss of useful information in the process of dimension reduction. As a result, it improves power to detect association in the presence of gene‐gene interactions. This approach can be similarly applied for modeling gene‐environment interactions. We compare this method with other approaches, the corresponding test without modeling any interaction, that based on a saturated interaction model, that based on principal component analysis, and that based on Tukey's one‐degree‐of‐freedom model. Our simulations suggest that this new approach has superior power to that of the other methods. In an application to endometrial cancer case‐control data from the Women's Health Initiative, this approach detected AKT1 and AKT2 as being significantly associated with endometrial cancer susceptibility by taking into account their interactions with body mass index. Genet. Epidemiol. 2008. © 2008 Wiley‐Liss, Inc.

[1]  D. Conrad,et al.  A worldwide survey of haplotype variation and linkage disequilibrium in the human genome , 2006, Nature Genetics.

[2]  J. Hirschhorn,et al.  A comprehensive review of genetic association studies , 2002, Genetics in Medicine.

[3]  M. Xiong,et al.  Test for interaction between two unlinked loci. , 2006, American journal of human genetics.

[4]  R. Kaaks,et al.  Obesity , Endogenous Hormones , and Endometrial Cancer Risk : A Synthetic Review 1 , 2002 .

[5]  N Risch,et al.  The Future of Genetic Studies of Complex Human Diseases , 1996, Science.

[6]  N. Chatterjee,et al.  Powerful multilocus tests of genetic association in the presence of gene-gene and gene-environment interactions. , 2006, American journal of human genetics.

[7]  P. Donnelly,et al.  Genome-wide strategies for detecting multiple loci that influence complex diseases , 2005, Nature Genetics.

[8]  David V Conti,et al.  A testing framework for identifying susceptibility genes in the presence of epistasis. , 2006, American journal of human genetics.

[9]  David M. Evans,et al.  Two-Stage Two-Locus Models in Genome-Wide Association , 2006, PLoS genetics.

[10]  M. Daly,et al.  Genome-wide association studies for common diseases and complex traits , 2005, Nature Reviews Genetics.

[11]  Dmitri V Zaykin,et al.  Contrasting linkage-disequilibrium patterns between cases and controls as a novel association-mapping method. , 2006, American journal of human genetics.

[12]  I. Helland Partial Least Squares Regression , 2006 .

[13]  M. Pike,et al.  Progestins and menopause: epidemiological studies of risks of endometrial and breast cancer , 2000, Steroids.

[14]  Peter Kraft,et al.  Exploiting Gene-Environment Interaction to Detect Genetic Associations , 2007, Human Heredity.

[15]  David V Conti,et al.  Testing association between disease and multiple SNPs in a candidate gene , 2007, Genetic epidemiology.

[16]  I. Helland Partial least squares regression and statistical models , 1990 .

[17]  J. Testa,et al.  Perturbations of the AKT signaling pathway in human cancer , 2005, Oncogene.

[18]  I. Helland ON THE STRUCTURE OF PARTIAL LEAST SQUARES REGRESSION , 1988 .

[19]  M. Daly,et al.  Transferability of tag SNPs in genetic association studies in multiple populations , 2006, Nature Genetics.

[20]  R. Elston,et al.  Improving power in contrasting linkage-disequilibrium patterns between cases and controls. , 2007, American journal of human genetics.

[21]  JoAnn E. Manson,et al.  Design of the Women's Health Initiative clinical trial and observational study. The Women's Health Initiative Study Group. , 1998, Controlled clinical trials.