Inclusion of gene-gene and gene-environment interactions unlikely to dramatically improve risk prediction for complex diseases.

Genome-wide association studies have identified hundreds of common genetic variants associated with the risk of multifactorial diseases. However, their impact on discrimination and risk prediction is limited. It has been suggested that the identification of gene-gene (G-G) and gene-environment (G-E) interactions would improve disease prediction and facilitate prevention. We conducted a simulation study to explore the potential improvement in discrimination if G-G and G-E interactions exist and are known. We used three diseases (breast cancer, type 2 diabetes, and rheumatoid arthritis) as motivating examples. We show that the inclusion of G-G and G-E interaction effects in risk-prediction models is unlikely to dramatically improve the discrimination ability of these models.

[1]  G. Abecasis,et al.  MaCH: using sequence and genotype data to estimate haplotypes and unobserved genotypes , 2010, Genetic epidemiology.

[2]  W. Gauderman,et al.  Gene-environment interaction in genome-wide association studies. , 2008, American journal of epidemiology.

[3]  M. Pepe The Statistical Evaluation of Medical Tests for Classification and Prediction , 2003 .

[4]  M. Pencina,et al.  Evaluating the added predictive ability of a new marker: From area under the ROC curve to reclassification and beyond , 2008, Statistics in medicine.

[5]  David M. Evans,et al.  Two-Stage Two-Locus Models in Genome-Wide Association , 2006, PLoS genetics.

[6]  Ewout W Steyerberg,et al.  Extensions of net reclassification improvement calculations to measure usefulness of new biomarkers , 2011, Statistics in medicine.

[7]  G. Rose Sick individuals and sick populations. , 2001, International journal of epidemiology.

[8]  W. Willett,et al.  A genome-wide association study identifies alleles in FGFR2 associated with risk of sporadic postmenopausal breast cancer , 2007, Nature Genetics.

[9]  Hon-Cheong So,et al.  A Unifying Framework for Evaluating the Predictive Power of Genetic Variants Based on the Level of Heritability Explained , 2010, PLoS genetics.

[10]  Chris S. Haley,et al.  Epistasis: too often neglected in complex trait studies? , 2004, Nature Reviews Genetics.

[11]  R. Wilkins Polygenes, risk prediction, and targeted prevention of breast cancer. , 2008, The New England journal of medicine.

[12]  Peter Kraft,et al.  Characterizing Associations and SNP-Environment Interactions for GWAS-Identified Prostate Cancer Risk Markers—Results from BPC3 , 2011, PloS one.

[13]  A. Whittemore,et al.  Assessing interactions between the associations of common genetic susceptibility variants, reproductive history and body mass index with breast cancer risk in the breast cancer association consortium: a combined case-control study , 2010, Breast Cancer Research.

[14]  P. Kraft,et al.  Cumulative association of 22 genetic variants with seropositive rheumatoid arthritis risk , 2010, Annals of the rheumatic diseases.

[15]  C I Amos,et al.  Re: Discriminatory accuracy from single-nucleotide polymorphisms in models to predict breast cancer risk. , 2009, Journal of the National Cancer Institute.

[16]  P. Gregersen,et al.  Gene-gene and gene-environment interactions involving HLA-DRB1, PTPN22, and smoking in two subsets of rheumatoid arthritis. , 2007, American journal of human genetics.

[17]  Margaret S Pepe,et al.  Gauging the performance of SNPs, biomarkers, and clinical factors for predicting risk of breast cancer. , 2008, Journal of the National Cancer Institute.

[18]  Beate Ritz,et al.  Genome-Wide Gene-Environment Study Identifies Glutamate Receptor Gene GRIN2A as a Parkinson's Disease Modifier Gene via Interaction with Coffee , 2011, PLoS genetics.

[19]  Peter Kraft,et al.  Gene-environment interactions in genome-wide association studies: a comparative study of tests applied to empirical studies of type 2 diabetes. , 2012, American journal of epidemiology.

[20]  E. Demerath,et al.  Physical Activity Attenuates the Influence of FTO Variants on Obesity Risk: A Meta-Analysis of 218,166 Adults and 19,268 Children , 2011, PLoS medicine.

[21]  M. Gail Abstract CN07-03: Discriminatory accuracy from Single Nucleotide Polymorphisms in models of absolute breast cancer risk , 2008 .

[22]  N. Cook Use and Misuse of the Receiver Operating Characteristic Curve in Risk Prediction , 2007, Circulation.

[23]  M. Gail,et al.  Projecting individualized probabilities of developing breast cancer for white females who are being examined annually. , 1989, Journal of the National Cancer Institute.

[24]  L. Qi,et al.  Interactions between genetic factors that predict diabetes and dietary factors that ultimately impact on risk of diabetes , 2010, Current opinion in lipidology.

[25]  W. Willett,et al.  Moderate alcohol consumption and the risk of breast cancer. , 1987, The New England journal of medicine.

[26]  M. Khoury,et al.  Genomic profiling to promote a healthy lifestyle: not ready for prime time , 2003, Nature Genetics.

[27]  Tianxi Cai,et al.  Joint Effects of Common Genetic Variants on the Risk for Type 2 Diabetes in U.S. Men and Women of European Ancestry , 2009, Annals of Internal Medicine.

[28]  Michel Eichelbaum,et al.  Pharmacogenomics and individualized drug therapy. , 2006, Annual review of medicine.

[29]  Peter Kraft,et al.  Exploiting Gene-Environment Interaction to Detect Genetic Associations , 2007, Human Heredity.

[30]  S. Lewis,et al.  Alcohol, ALDH2, and Esophageal Cancer: A Meta-analysis Which Illustrates the Potentials and Limitations of a Mendelian Randomization Approach , 2005, Cancer Epidemiology Biomarkers & Prevention.

[31]  Peter Kraft,et al.  Genetic variants at 2q24 are associated with susceptibility to type 2 diabetes. , 2010, Human molecular genetics.

[32]  M. Pirmohamed,et al.  Cost-effectiveness analysis of HLA B*5701 genotyping in preventing abacavir hypersensitivity. , 2004, Pharmacogenetics.

[33]  Robert M. Plenge,et al.  Defining the Role of the MHC in Autoimmunity: A Review and Pooled Analysis , 2008, PLoS genetics.

[34]  M. Gail Discriminatory accuracy from single-nucleotide polymorphisms in models to predict breast cancer risk. , 2008, Journal of the National Cancer Institute.

[35]  S. Humphries,et al.  Utility of genetic and non-genetic risk factors in prediction of type 2 diabetes: Whitehall II prospective cohort study , 2010, BMJ : British Medical Journal.

[36]  Scott M. Williams,et al.  Epistasis and its implications for personal genetics. , 2009, American journal of human genetics.

[37]  H. Cordell Detecting gene–gene interactions that underlie human diseases , 2009, Nature Reviews Genetics.

[38]  M S Pepe,et al.  Comments on ‘Evaluating the added predictive ability of a new marker: From area under the ROC curve to reclassification and beyond’ by M. J. Pencina et al., Statistics in Medicine (DOI: 10.1002/sim.2929) , 2008, Statistics in medicine.

[39]  L. Alfredsson,et al.  A gene-environment interaction between smoking and shared epitope genes in HLA-DR provides a high risk of seropositive rheumatoid arthritis. , 2004, Arthritis and rheumatism.

[40]  Cornelia M van Duijn,et al.  Genome-based prediction of common diseases: advances and prospects. , 2008, Human molecular genetics.

[41]  Peter Kraft,et al.  Interactions between genetic variants and breast cancer risk factors in the breast and prostate cancer cohort consortium. , 2011, Journal of the National Cancer Institute.

[42]  Godfrey Fowler,et al.  THE STRATEGY OF PREVENTIVE MEDICINE , 1992 .

[43]  M. Khoury,et al.  An epidemiologic assessment of genomic profiling for measuring susceptibility to common diseases and targeting interventions , 2004, Genetics in Medicine.

[44]  Juan Pablo Lewinger,et al.  Invited commentary: GE-Whiz! Ratcheting gene-environment studies up to the whole genome and the whole exposome. , 2012, American journal of epidemiology.

[45]  Annie E. Hill,et al.  Genetic architecture of complex traits: Large phenotypic effects and pervasive epistasis , 2008, Proceedings of the National Academy of Sciences.

[46]  J. Bousquet,et al.  Effect of 17q21 variants and smoking exposure in early-onset asthma. , 2008, The New England journal of medicine.

[47]  Michael E Goddard,et al.  which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. Multi-locus models of genetic risk of disease , 2009 .

[48]  Nilanjan Chatterjee,et al.  Common Genetic Variants and Central Adiposity Among Asian‐Indians , 2012, Obesity.

[49]  J. Manson,et al.  Biomarkers of inflammation and development of rheumatoid arthritis in women from two prospective cohort studies. , 2009, Arthritis and rheumatism.

[50]  Nancy R Cook,et al.  Association between a literature-based genetic risk score and cardiovascular events in women. , 2010, JAMA.

[51]  A. Zlotta NAT2 Slow Acetylation, GSTM1 Null Genotype, and Risk of Bladder Cancer: Results from the Spanish Bladder Cancer Study and Meta-Analyses , 2006 .

[52]  Hongbing Shen,et al.  Genome-wide association study identifies three new susceptibility loci for esophageal squamous-cell carcinoma in Chinese populations , 2011, Nature Genetics.

[53]  Peter Kraft,et al.  Gene‐environment interplay in common complex diseases: forging an integrative model—recommendations from an NIH workshop , 2011, Genetic epidemiology.

[54]  A. Hofman,et al.  American Journal of Epidemiology Practice of Epidemiology Improvement of Risk Prediction by Genomic Profiling: Reclassification Measures versus the Area under the Receiver Operating Characteristic Curve , 2022 .

[55]  M. Thun,et al.  Performance of Common Genetic Variants in Breast-cancer Risk Models , 2022 .

[56]  Mitchell H Gail,et al.  On criteria for evaluating models of absolute risk. , 2005, Biostatistics.

[57]  Gudmundur A. Thorisson,et al.  The International HapMap Project Web site. , 2005, Genome research.

[58]  F. Collins,et al.  Potential etiologic and functional implications of genome-wide association loci for human diseases and traits , 2009, Proceedings of the National Academy of Sciences.

[59]  Kristopher J. Stanya,et al.  Genetic variants at 2 q 24 are associated with susceptibility to type 2 diabetes , 2010 .

[60]  Kathryn Roeder,et al.  Next generation analytic tools for large scale genetic epidemiology studies of complex diseases , 2012, Genetic epidemiology.

[61]  F. Grodstein,et al.  Do breast-feeding and other reproductive factors influence future risk of rheumatoid arthritis? Results from the Nurses' Health Study. , 2004, Arthritis and rheumatism.

[62]  Jaeil Ahn,et al.  Testing gene-environment interaction in large-scale case-control association studies: possible choices and comparisons. , 2012, American journal of epidemiology.

[63]  David A. Hinds,et al.  Assessment of Clinical Validity of a Breast Cancer Risk Model Combining Genetic and Clinical Information , 2010, Journal of the National Cancer Institute.

[64]  W. Willett,et al.  Multiple loci identified in a genome-wide association study of prostate cancer , 2008, Nature Genetics.

[65]  E. Karlson,et al.  Smoking intensity, duration, and cessation, and the risk of rheumatoid arthritis in women. , 2006, The American journal of medicine.

[66]  D. Thomas,et al.  Biological models and statistical interactions: an example from multistage carcinogenesis. , 1981, International journal of epidemiology.

[67]  Sander Greenland,et al.  The need for reorientation toward cost‐effective prediction: Comments on ‘Evaluating the added predictive ability of a new marker: From area under the ROC curve to reclassification and beyond’ by M. J. Pencina et al., Statistics in Medicine (DOI: 10.1002/sim.2929) , 2008, Statistics in medicine.

[68]  W. Willett,et al.  A multistage genome-wide association study in breast cancer identifies two new risk alleles at 1p11.2 and 14q24.1 (RAD51L1) , 2009, Nature Genetics.