Polygenic approaches to detect gene–environment interactions when external information is unavailable

Abstract The exploration of ‘gene–environment interactions’ (G × E) is important for disease prediction and prevention. The scientific community usually uses external information to construct a genetic risk score (GRS), and then tests the interaction between this GRS and an environmental factor (E). However, external genome-wide association studies (GWAS) are not always available, especially for non-Caucasian ethnicity. Although GRS is an analysis tool to detect G × E in GWAS, its performance remains unclear when there is no external information. Our ‘adaptive combination of Bayes factors method’ (ADABF) can aggregate G × E signals and test the significance of G × E by a polygenic test. We here explore a powerful polygenic approach for G × E when external information is unavailable, by comparing our ADABF with the GRS based on marginal effects of SNPs (GRS-M) and GRS based on SNP × E interactions (GRS-I). ADABF is the most powerful method in the absence of SNP main effects, whereas GRS-M is generally the best test when single-nucleotide polymorphisms main effects exist. GRS-I is the least powerful test due to its data-splitting strategy. Furthermore, we apply these methods to Taiwan Biobank data. ADABF and GRS-M identified gene × alcohol and gene × smoking interactions on blood pressure (BP). BP-increasing alleles elevate more BP in drinkers (smokers) than in nondrinkers (nonsmokers). This work provides guidance to choose a polygenic approach to detect G × E when external information is unavailable.

[1]  Jing Hua Zhao,et al.  Physical Activity Attenuates the Genetic Predisposition to Obesity in 20,000 Men and Women from EPIC-Norfolk Prospective Population Study , 2010, PLoS medicine.

[2]  J. Hebebrand,et al.  Polygenic Obesity in Humans , 2008, Obesity Facts.

[3]  Josée Dupuis,et al.  Incorporating Gene-Environment Interaction in Testing for Association with Rare Genetic Variants , 2014, Human Heredity.

[4]  K. Ickstadt,et al.  Detection of gene-environment interactions in the presence of linkage disequilibrium and noise by using genetic risk scores with internal weights from elastic net regression , 2017, BMC Genetics.

[5]  P. Sullivan,et al.  Effect of polygenic risk scores on depression in childhood trauma , 2014, British Journal of Psychiatry.

[6]  H. Zou,et al.  Regularization and variable selection via the elastic net , 2005 .

[7]  J. Hebebrand,et al.  From monogenic to polygenic obesity: recent advances , 2010, European Child & Adolescent Psychiatry.

[8]  Jaeil Ahn,et al.  Testing gene-environment interaction in large-scale case-control association studies: possible choices and comparisons. , 2012, American journal of epidemiology.

[9]  Chen-Yang Shen,et al.  Population structure of Han Chinese in the modern Taiwanese population based on 10,000 participants in the Taiwan Biobank project. , 2016, Human molecular genetics.

[10]  Jack Euesden,et al.  PRSice: Polygenic Risk Score software , 2014, Bioinform..

[11]  Simon C. Potter,et al.  Genome-wide association study of 14,000 cases of seven common diseases and 3,000 shared controls , 2007, Nature.

[12]  S. Mccarroll,et al.  Polygenic risk for schizophrenia and neurocognitive performance in patients with schizophrenia , 2018, Genes, brain, and behavior.

[13]  M. Stephens,et al.  Bayesian statistical methods for genetic association studies , 2009, Nature Reviews Genetics.

[14]  M. LeBlanc,et al.  Increasing the power of identifying gene × gene interactions in genome‐wide association studies , 2008, Genetic epidemiology.

[15]  P S Albert,et al.  Limitations of the case-only design for identifying gene-environment interactions. , 2001, American journal of epidemiology.

[16]  J. Chang-Claude,et al.  Gene–environment interaction and risk of breast cancer , 2016, British Journal of Cancer.

[17]  M. Jamieson,et al.  The measurement of blood pressure: sitting or supine, once or twice? , 1990, Journal of hypertension.

[18]  William S Bush,et al.  Evidence for polygenic susceptibility to multiple sclerosis--the shape of things to come. , 2010, American journal of human genetics.

[19]  Ruth Ottman Gene-environment interaction: definitions and study designs. , 1996 .

[20]  Peter Kraft,et al.  Additive interactions between susceptibility single-nucleotide polymorphisms identified in genome-wide association studies and breast cancer risk factors in the Breast and Prostate Cancer Cohort Consortium. , 2014, American journal of epidemiology.

[21]  Ellen Kampman,et al.  Genome-wide association yields new sequence variants at seven loci that associate with measures of obesity , 2009, Nature Genetics.

[22]  Wei Lu,et al.  Gene-environment interactions for breast cancer risk among Chinese women: a report from the Shanghai Breast Cancer Genetics Study. , 2013, American journal of epidemiology.

[23]  M. McCarthy,et al.  Gene-Lifestyle Interaction and Type 2 Diabetes: The EPIC InterAct Case-Cohort Study , 2014, PLoS medicine.

[24]  D. Rao,et al.  Gene-alcohol interactions identify several novel blood pressure loci including a promising locus near SLC16A9 , 2013, Front. Genet..

[25]  Stacey J Winham,et al.  Gene-environment interactions in genome-wide association studies: current approaches and new directions. , 2013, Journal of child psychology and psychiatry, and allied disciplines.

[26]  E. Rimm,et al.  Sugar-Sweetened Beverages and Genetic Risk of Obesity , 2013 .

[27]  Carolyn Hutter,et al.  Powerful Cocktail Methods for Detecting Genome‐Wide Gene‐Environment Interaction , 2012, Genetic epidemiology.

[28]  Benjamin A Goldstein,et al.  Contemporary Considerations for Constructing a Genetic Risk Score: An Empirical Approach , 2015, Genetic epidemiology.

[29]  Y. Benjamini,et al.  Controlling the false discovery rate: a practical and powerful approach to multiple testing , 1995 .

[30]  Hai-Gwo Hwu,et al.  Adaptive combination of Bayes factors as a powerful method for the joint analysis of rare and common variants , 2017, Scientific Reports.

[31]  Peter Kraft,et al.  Inclusion of gene-gene and gene-environment interactions unlikely to dramatically improve risk prediction for complex diseases. , 2012, American journal of human genetics.

[32]  D. Rao,et al.  Gene-smoking interactions identify several novel blood pressure loci in the Framingham Heart Study. , 2015, American journal of hypertension.

[33]  Trevor J. Hastie,et al.  Genome-wide association analysis by lasso penalized logistic regression , 2009, Bioinform..

[34]  Subhajyoti De,et al.  Common variants near MC4R are associated with fat mass, weight and risk of obesity , 2008, Nature Genetics.

[35]  Wei Pan,et al.  Gene expression A note on using permutation-based false discovery rate estimates to compare different analysis methods for microarray data , 2005 .

[36]  Paolo Vineis,et al.  A Systematic Comparison of Linear Regression–Based Statistical Methods to Assess Exposome-Health Associations , 2016, Environmental health perspectives.

[37]  Ross M. Fraser,et al.  Genetic studies of body mass index yield new insights for obesity biology , 2015, Nature.

[38]  Audrey Y. Chu,et al.  Gene × Physical Activity Interactions in Obesity: Combined Analysis of 111,421 Individuals of European Ancestry , 2013, PLoS genetics.

[39]  Yixin Fang,et al.  Analysis of genome-wide association data by large-scale Bayesian logistic regression , 2009, BMC proceedings.

[40]  Manuel A. R. Ferreira,et al.  PLINK: a tool set for whole-genome association and population-based linkage analyses. , 2007, American journal of human genetics.

[41]  Sharon L. R. Kardia,et al.  Current Applications of Genetic Risk Scores to Cardiovascular Outcomes and Subclinical Phenotypes , 2015, Current Epidemiology Reports.

[42]  F. Dudbridge Power and Predictive Accuracy of Polygenic Risk Scores , 2013, PLoS genetics.

[43]  Lianhua Yin,et al.  Interactions between ACYP2 genetic polymorphisms and environment factors with susceptibility to ischemic stroke in a Han Chinese Population. , 2017, Oncotarget.

[44]  L. Bierut,et al.  Genome-wide polygenic scores for age at onset of alcohol dependence and association with alcohol-related measures , 2016, Translational Psychiatry.

[45]  Seunggeun Lee,et al.  Test for rare variants by environment interactions in sequencing association studies , 2016, Biometrics.

[46]  Dale R. Nyholt,et al.  SECA: SNP effect concordance analysis using genome-wide association summary results , 2014, Bioinform..

[47]  James Strait,et al.  Genome-Wide Association Scan Shows Genetic Variants in the FTO Gene Are Associated with Obesity-Related Traits , 2007, PLoS genetics.

[48]  M. Jarvelin,et al.  A Common Variant in the FTO Gene Is Associated with Body Mass Index and Predisposes to Childhood and Adult Obesity , 2007, Science.

[49]  Hugues Aschard,et al.  A perspective on interaction effects in genetic association studies , 2016, Genetic epidemiology.

[50]  Wen-Chung Lee,et al.  Incorporating prior knowledge to facilitate discoveries in a genome-wide association study on age-related macular degeneration , 2010, BMC Research Notes.

[51]  P. Visscher,et al.  Common polygenic variation contributes to risk of schizophrenia and bipolar disorder , 2009, Nature.

[52]  K. Liao,et al.  Association of Environmental and Genetic Factors and Gene–Environment Interactions With Risk of Developing Rheumatoid Arthritis , 2013, Arthritis care & research.

[53]  Xihong Lin,et al.  Test for interactions between a genetic marker set and environment in generalized linear models. , 2013, Biostatistics.

[54]  V. Sharma,et al.  Elevated Blood Pressure in Acute Ischemic Stroke - Treat or Leave? , 2016, Cerebrovascular Diseases.

[55]  F. Hu,et al.  Genetic predisposition, Western dietary pattern, and the risk of type 2 diabetes in men. , 2009, The American journal of clinical nutrition.

[56]  M. García-Closas,et al.  Combined associations of genetic and environmental risk factors: implications for prevention of breast cancer. , 2014, Journal of the National Cancer Institute.

[57]  M. Rask-Andersen,et al.  Gene-environment interaction study for BMI reveals interactions between genetic factors and physical activity, alcohol consumption and socioeconomic status , 2017, PLoS genetics.

[58]  Peter Sandercock,et al.  The International Stroke Trial (IST): a randomised trial of aspirin, subcutaneous heparin, both, or neither among 19 435 patients with acute ischaemic stroke , 1997, The Lancet.

[59]  Wei Pan,et al.  Testing for Polygenic Effects in Genome‐Wide Association Studies , 2015, Genetic epidemiology.

[60]  X. Hua,et al.  Winner's Curse Correction and Variable Thresholding Improve Performance of Polygenic Risk Modeling Based on Genome-Wide Association Study Summary-Level Data , 2016, bioRxiv.

[61]  Christian Gieger,et al.  Six new loci associated with body mass index highlight a neuronal influence on body weight regulation , 2009, Nature Genetics.

[62]  Jon Wakefield,et al.  A Bayesian measure of the probability of false discovery in genetic epidemiology studies. , 2007, American journal of human genetics.

[63]  W. Zheng,et al.  Interaction of cigarette smoking and carcinogen-metabolizing polymorphisms in the risk of colorectal polyps. , 2013, Carcinogenesis.

[64]  Andrew J. Saykin,et al.  Identifying significant gene‐environment interactions using a combination of screening testing and hierarchical false discovery rate control , 2016, Genetic epidemiology.

[65]  J. Potash,et al.  Polygenic interactions with environmental adversity in the aetiology of major depressive disorder , 2015, Psychological Medicine.

[66]  W. Willett,et al.  Breast Cancer Risk From Modifiable and Nonmodifiable Risk Factors Among White Women in the United States. , 2016, JAMA oncology.

[67]  Mathieu Lemire,et al.  SBERIA: Set‐Based Gene‐Environment Interaction Test for Rare and Common Variants in Complex Diseases , 2013, Genetic epidemiology.

[68]  Comparison of weighting approaches for genetic risk scores in gene-environment interaction studies , 2017, BMC Genetics.

[69]  R. Cadoret,et al.  Evidence for gene-environment interaction in the development of adolescent antisocial behavior , 1983, Behavior genetics.

[70]  Fernando Pires Hartwig,et al.  A Large-Scale Multi-ancestry Genome-wide Study Accounting for Smoking Behavior Identifies Multiple Significant Loci for Blood Pressure. , 2018, American journal of human genetics.

[71]  R Ottman,et al.  Gene-environment interaction: definitions and study designs. , 1996, Preventive medicine.

[72]  W. Gauderman,et al.  Gene-environment interaction in genome-wide association studies. , 2008, American journal of epidemiology.

[73]  Stéphane Joost,et al.  Gene–obesogenic environment interactions in the UK Biobank study , 2017, International journal of epidemiology.

[74]  Peter Kraft,et al.  Interactions between genetic variants and breast cancer risk factors in the breast and prostate cancer cohort consortium. , 2011, Journal of the National Cancer Institute.

[75]  Caroline Leigh Watkins The International Stroke Trial (IST): a randomised trial of aspirin, subcutaneous heparin, both, or neither among 19 435 patients with acute ischaemic stroke , 1997 .

[76]  K. Kohara,et al.  Association of the GNAS1 gene variant with hypertension is dependent on alcohol consumption. , 2003, Hypertension research : official journal of the Japanese Society of Hypertension.

[77]  A. Linneberg,et al.  The association of ADH and ALDH gene variants with alcohol drinking habits and cardiovascular disease risk factors. , 2008, Alcoholism, clinical and experimental research.

[78]  E. Vassos,et al.  Prospects for using risk scores in polygenic medicine , 2017, Genome Medicine.

[79]  Andres Metspalu,et al.  Personalized risk prediction for type 2 diabetes: the potential of genetic risk scores , 2016, Genetics in Medicine.

[80]  Peter Kraft,et al.  Gene‐Environment Interactions in Cancer Epidemiology: A National Cancer Institute Think Tank Report , 2013, Genetic epidemiology.

[81]  James Y Dai,et al.  Two-stage testing procedures with independent filtering for genome-wide gene-environment interaction. , 2012, Biometrika.

[82]  Jon Wakefield,et al.  Bayes factors for genome‐wide association studies: comparison with P‐values , 2009, Genetic epidemiology.

[83]  N. Wray,et al.  A mega-analysis of genome-wide association studies for major depressive disorder , 2013, Molecular Psychiatry.

[84]  H. Yamashita,et al.  Gene–environment interactions in obesity: implication for future applications in preventive medicine , 2015, Journal of Human Genetics.

[85]  Michael S. Reidy,et al.  Genetic Modulation of Lipid Profiles following Lifestyle Modification or Metformin Treatment: The Diabetes Prevention Program , 2012, PLoS genetics.

[86]  Juan Pablo Lewinger,et al.  Sample size requirements to detect gene‐environment interactions in genome‐wide association studies , 2011, Genetic epidemiology.

[87]  P. Visscher,et al.  Conditional and joint multiple-SNP analysis of GWAS summary statistics identifies additional variants influencing complex traits , 2012, Nature Genetics.

[88]  Zheng-Ming Chen,et al.  CAST: randomised placebo-controlled trial of early aspirin use in 20 000 patients with acute ischaemic stroke , 1997, The Lancet.

[89]  Nilanjan Chatterjee,et al.  Common genetic polymorphisms modify the effect of smoking on absolute risk of bladder cancer. , 2013, Cancer research.

[90]  A. Deng Genetic basis of polygenic hypertension. , 2007, Human molecular genetics.

[91]  S. Manuck,et al.  Gene-environment interaction. , 2014, Annual review of psychology.