Deciphering Genome Environment Wide Interactions Using Exposed Subjects Only

The recent successes of genome‐wide association studies (GWAS) have renewed interest in genome environment wide interaction studies (GEWIS) to discover genetic factors that modulate penetrance of environmental exposures to human diseases. Indeed, gene‐environment interactions (G × E), which have not been emphasized in the GWAS era, could be a source contributing to the missing heritability, a major bottleneck limiting continuing GWAS successes. In this manuscript, we describe a design and analytic strategy to focus on G × E using only exposed subjects, dubbed as e‐GEWIS. Operationally, an e‐GEWIS analysis is equivalent to a GWAS analysis on exposed subjects only, and it has actually been used in some earlier GWAS without being explicitly identified as such. Through both analytics and simulations, e‐GEWIS has been shown better efficiency than the usual cross‐product‐based analysis of G × E interaction with both cases and controls (cc‐GEWIS), and they have comparable efficiency to case‐only analysis of G × E (c‐GEWIS), with potentially smaller sample sizes. The formalization of e‐GEWIS here provides a theoretical basis to legitimize this framework for routine investigation of G × E, for more efficient G × E study designs, and for improvement of reproducibility in replicating GEWIS findings. As an illustration, we apply e‐GEWIS to a lung cancer GWAS data set to perform a GEWIS, focusing on gene and smoking interaction. The e‐GEWIS analysis successfully uncovered positive genetic associations on chromosome 15 among current smokers, suggesting a gene‐smoking interaction. Although this signal was detected earlier, the current finding here serves as a positive control in support of this e‐GEWIS strategy.

[1]  Peter Kraft,et al.  Gene-environment interactions in genome-wide association studies: a comparative study of tests applied to empirical studies of type 2 diabetes. , 2012, American journal of epidemiology.

[2]  S Greenland,et al.  Concepts of interaction. , 1980, American journal of epidemiology.

[3]  Paolo Vineis,et al.  A susceptibility locus for lung cancer maps to nicotinic acetylcholine receptor subunit genes on 15q25 , 2008, Nature.

[4]  Pak Chung Sham,et al.  GWASdb: a database for human genetic variants identified by genome-wide association studies , 2011, Nucleic Acids Res..

[5]  P S Albert,et al.  Limitations of the case-only design for identifying gene-environment interactions. , 2001, American journal of epidemiology.

[6]  R. Pyke,et al.  Logistic disease incidence models and case-control studies , 1979 .

[7]  G. Mills,et al.  Genome-wide association scan of tag SNPs identifies a susceptibility locus for lung cancer at 15q25.1 , 2008, Nature Genetics.

[8]  Elena Flowers,et al.  Gene-environment interactions in cardiovascular disease , 2012, European journal of cardiovascular nursing : journal of the Working Group on Cardiovascular Nursing of the European Society of Cardiology.

[9]  Wentian Li,et al.  Genome-Wide Association Scan Identifies Candidate Polymorphisms Associated with Differential Response to Anti-TNF Treatment in Rheumatoid Arthritis , 2008, Molecular medicine.

[10]  Carolyn Hutter,et al.  Powerful Cocktail Methods for Detecting Genome‐Wide Gene‐Environment Interaction , 2012, Genetic epidemiology.

[11]  C. Werning [Rheumatoid arthritis]. , 1983, Medizinische Monatsschrift fur Pharmazeuten.

[12]  H. Superko,et al.  Statins personalized. , 2012, The Medical clinics of North America.

[13]  D. Thomas,et al.  Gene–environment-wide association studies: emerging approaches , 2010, Nature Reviews Genetics.

[14]  Tyler J. VanderWeele,et al.  Estimating measures of interaction on an additive scale for preventive exposures , 2011, European Journal of Epidemiology.

[15]  P. Simpson,et al.  Statistical methods in cancer research , 2001, Journal of surgical oncology.

[16]  T. Beaty,et al.  Fundamentals of Genetic Epidemiology , 1994 .

[17]  Peter Kraft,et al.  Exploiting Gene-Environment Interaction to Detect Genetic Associations , 2007, Human Heredity.

[18]  Nilanjan Chatterjee,et al.  Design and analysis of two‐phase studies with binary outcome applied to Wilms tumour prognosis , 1999 .

[19]  F. Collins,et al.  Potential etiologic and functional implications of genome-wide association loci for human diseases and traits , 2009, Proceedings of the National Academy of Sciences.

[20]  Daniel F. Gudbjartsson,et al.  A variant associated with nicotine dependence, lung cancer and peripheral arterial disease , 2008, Nature.

[21]  U. Nöthlings,et al.  A case-only study of gene-environment interaction between genetic susceptibility variants in NOD2 and cigarette smoking in Crohn's disease aetiology , 2012, BMC Medical Genetics.

[22]  Xihong Lin,et al.  Design and analysis issues in gene and environment studies , 2012, Environmental Health.

[23]  Jaeil Ahn,et al.  Testing gene-environment interaction in large-scale case-control association studies: possible choices and comparisons. , 2012, American journal of epidemiology.

[24]  D. Clayton Commentary: reporting and assessing evidence for interaction: why, when and how? , 2012, International journal of epidemiology.

[25]  Peter Kraft,et al.  Gene‐environment interplay in common complex diseases: forging an integrative model—recommendations from an NIH workshop , 2011, Genetic epidemiology.

[26]  Juan Pablo Lewinger,et al.  Sample size requirements to detect gene‐environment interactions in genome‐wide association studies , 2011, Genetic epidemiology.

[27]  Peggy Hall,et al.  The NHGRI GWAS Catalog, a curated resource of SNP-trait associations , 2013, Nucleic Acids Res..

[28]  W. Gauderman,et al.  Gene-environment interaction in genome-wide association studies. , 2008, American journal of epidemiology.

[29]  Ying Wang,et al.  A genome-wide association study of lung cancer identifies a region of chromosome 5p15 associated with risk for adenocarcinoma. , 2009, American journal of human genetics.

[30]  A. Hannan,et al.  Nature, nurture and neurology: gene–environment interactions in neurodegenerative disease , 2005, The FEBS journal.

[31]  G. Mills,et al.  Nicotinic acetylcholine receptor region on chromosome 15q25 and lung cancer risk among African Americans: a case-control study. , 2010, Journal of the National Cancer Institute.

[32]  K F Cheng,et al.  Retrospective analysis of case‐control studies when the population is in Hardy–Weinberg equilibrium , 2005, Statistics in medicine.

[33]  Lyla M. Hernandez,et al.  Genes, Behavior, and the Social Environment:: Moving Beyond the Nature/Nurture Debate , 2006 .

[34]  M. Plummer,et al.  International agency for research on cancer. , 2020, Archives of pathology.

[35]  M. Khoury,et al.  Invited commentary: from genome-wide association studies to gene-environment-wide interaction studies--challenges and opportunities. , 2008, American journal of epidemiology.

[36]  J. Ordovás,et al.  Epigenetics and cardiovascular disease , 2010, Nature Reviews Cardiology.

[37]  Juan Pablo Lewinger,et al.  Efficient genome-wide association testing of gene-environment interaction in case-parent trios. , 2010, American journal of epidemiology.

[38]  D. Clayton Prediction and Interaction in Complex Disease Genetics: Experience in Type 1 Diabetes , 2009, PLoS genetics.

[39]  S. Manuck,et al.  Gene-environment interaction. , 2014, Annual review of psychology.

[40]  D. Thomas,et al.  Methods for investigating gene-environment interactions in candidate pathway and genome-wide association studies. , 2010, Annual review of public health.

[41]  Peter Kraft,et al.  Gene‐Environment Interactions in Cancer Epidemiology: A National Cancer Institute Think Tank Report , 2013, Genetic epidemiology.

[42]  S. Greenland,et al.  Causation and causal inference in epidemiology. , 2005, American journal of public health.

[43]  Juan Pablo Lewinger,et al.  Finding Novel Genes by Testing G × E Interactions in a Genome‐Wide Association Study , 2013, Genetic epidemiology.

[44]  Nilanjan Chatterjee,et al.  Semiparametric maximum likelihood estimation exploiting gene-environment independence in case-control studies , 2005 .

[45]  M. Kendall Theoretical Statistics , 1956, Nature.

[46]  M. LeBlanc,et al.  Increasing the power of identifying gene × gene interactions in genome‐wide association studies , 2008, Genetic epidemiology.

[47]  Raymond J Carroll,et al.  Exploiting gene‐environment independence in family‐based case‐control studies: Increased power for detecting associations, interactions and joint effects , 2005, Genetic epidemiology.

[48]  Jack A. Taylor,et al.  Non-hierarchical logistic models and case-only designs for assessing susceptibility in population-based case-control studies. , 1994, Statistics in medicine.

[49]  D. Kang,et al.  Breast cancer prevention based on gene–environment interaction , 2011, Molecular Carcinogenesis.

[50]  Manuel A. R. Ferreira,et al.  PLINK: a tool set for whole-genome association and population-based linkage analyses. , 2007, American journal of human genetics.

[51]  D. Paul The Politics of Heredity: Essays on Eugenics, Biomedicine, and the Nature-Nurture Debate , 1998 .

[52]  R. Plomin,et al.  The Nature of Nurture: A Genomewide Association Scan for Family Chaos , 2008, Behavior genetics.

[53]  Kathryn Roeder,et al.  Next generation analytic tools for large scale genetic epidemiology studies of complex diseases , 2012, Genetic epidemiology.

[54]  C. Nusbaum,et al.  Large-scale identification, mapping, and genotyping of single-nucleotide polymorphisms in the human genome. , 1998, Science.

[55]  Peter Kraft,et al.  Replication in genome-wide association studies. , 2009, Statistical science : a review journal of the Institute of Mathematical Statistics.

[56]  Shaun M. Purcell,et al.  Statistical power and significance testing in large-scale genetic studies , 2014, Nature Reviews Genetics.

[57]  Raymond J Carroll,et al.  Analysis of case‐control studies of genetic and environmental factors with missing genetic information and haplotype‐phase ambiguity , 2005, Genetic epidemiology.

[58]  M. Marazita,et al.  Genome-wide Association Studies , 2012, Journal of dental research.

[59]  N. Breslow,et al.  The analysis of case-control studies , 1980 .