Estimation of effect size distribution from genome-wide association studies and implications for future discoveries

We report a set of tools to estimate the number of susceptibility loci and the distribution of their effect sizes for a trait on the basis of discoveries from existing genome-wide association studies (GWASs). We propose statistical power calculations for future GWASs using estimated distributions of effect sizes. Using reported GWAS findings for height, Crohn's disease and breast, prostate and colorectal (BPC) cancers, we determine that each of these traits is likely to harbor additional loci within the spectrum of low-penetrance common variants. These loci, which can be identified from sufficiently powerful GWASs, together could explain at least 15–20% of the known heritability of these traits. However, for BPC cancers, which have modest familial aggregation, our analysis suggests that risk models based on common variants alone will have modest discriminatory power (63.5% area under curve), even with new discoveries.

[1]  W. Willett,et al.  A multistage genome-wide association study in breast cancer identifies two new risk alleles at 1p11.2 and 14q24.1 (RAD51L1) , 2009, Nature Genetics.

[2]  Steven Gallinger,et al.  Meta-analysis of genome-wide association data identifies four new susceptibility loci for colorectal cancer , 2008, Nature Genetics.

[3]  M. Gail Value of adding single-nucleotide polymorphism genotypes to a breast cancer risk model. , 2009, Journal of the National Cancer Institute.

[4]  Lester L. Peters,et al.  Genome-wide association study identifies novel breast cancer susceptibility loci , 2007, Nature.

[5]  J. Kaprio,et al.  Environmental and heritable factors in the causation of cancer--analyses of cohorts of twins from Sweden, Denmark, and Finland. , 2000, The New England journal of medicine.

[6]  Pauline C Ng,et al.  Power to Detect Risk Alleles Using Genome-Wide Tag SNP Panels , 2007, PLoS genetics.

[7]  Carl D Langefeld,et al.  Power for genetic association studies with random allele frequencies and genotype distributions. , 2004, American journal of human genetics.

[8]  S. Leal,et al.  Methods for detecting associations with rare variants for common diseases: application to analysis of sequence data. , 2008, American journal of human genetics.

[9]  Ali Amin Al Olama,et al.  Multiple newly identified loci associated with prostate cancer susceptibility , 2008, Nature Genetics.

[10]  Ali Amin Al Olama,et al.  Identification of seven new prostate cancer susceptibility loci through a genome-wide association study , 2009, Nature Genetics.

[11]  Judy H. Cho,et al.  Finding the missing heritability of complex diseases , 2009, Nature.

[12]  T. Frayling,et al.  Reaching new heights: insights into the genetics of human stature. , 2008, Trends in Genetics.

[13]  Peter Kraft,et al.  Beyond odds ratios — communicating disease risk based on genetic profiles , 2009, Nature Reviews Genetics.

[14]  W. Willett,et al.  Multiple loci identified in a genome-wide association study of prostate cancer , 2008, Nature Genetics.

[15]  David B. Goldstein,et al.  Rare Variants Create Synthetic Genome-Wide Associations , 2010, PLoS biology.

[16]  Nicholas J Schork,et al.  Power calculations for genetic association studies using estimated probability distributions. , 2002, American journal of human genetics.

[17]  Judy H. Cho,et al.  Genome-wide association defines more than 30 distinct susceptibility loci for Crohn's disease , 2008, Nature Genetics.

[18]  Peter Kraft,et al.  Genetic risk prediction--are we there yet? , 2009, The New England journal of medicine.

[19]  R. D'Agostino,et al.  Genotype score in addition to common risk factors for prediction of type 2 diabetes. , 2008, The New England journal of medicine.

[20]  R. Prentice,et al.  Correcting “winner's curse” in odds ratios from genomewide association findings for major complex human diseases , 2009, Genetic epidemiology.

[21]  M. Thun,et al.  Performance of Common Genetic Variants in Breast-cancer Risk Models , 2022 .

[22]  Bjarni V. Halldórsson,et al.  Many sequence variants affecting diversity of adult human height , 2008, Nature Genetics.

[23]  Peter M Visscher,et al.  Sizing up human height variation , 2008, Nature Genetics.

[24]  C. Gieger,et al.  Identification of ten loci associated with height highlights new biological pathways in human growth , 2008, Nature Genetics.

[25]  David M. Evans,et al.  Genome-wide association analysis identifies 20 loci that influence adult height , 2008, Nature Genetics.

[26]  P. Donnelly,et al.  Designing Genome-Wide Association Studies: Sample Size, Power, Imputation, and the Choice of Genotyping Chip , 2009, PLoS genetics.

[27]  R. Prentice,et al.  Bias-reduced estimators and confidence intervals for odds ratios in genome-wide association studies. , 2008, Biostatistics.

[28]  Douglas F. Easton,et al.  Polygenic susceptibility to breast cancer and implications for prevention , 2002, Nature Genetics.

[29]  H. Grönberg,et al.  Estimation of absolute risk for prostate cancer using genetic markers and family history , 2009, The Prostate.

[30]  Fei Zou,et al.  Estimating odds ratios in genome scans: an approximate conditional likelihood approach. , 2008, American journal of human genetics.

[31]  Suzanne M. Leal,et al.  Discovery of Rare Variants via Sequencing: Implications for the Design of Complex Trait Association Studies , 2009, PLoS genetics.

[32]  M. Gail Discriminatory accuracy from single-nucleotide polymorphisms in models to predict breast cancer risk. , 2008, Journal of the National Cancer Institute.

[33]  H. A. Orr,et al.  THE POPULATION GENETICS OF ADAPTATION: THE DISTRIBUTION OF FACTORS FIXED DURING ADAPTIVE EVOLUTION , 1998, Evolution; international journal of organic evolution.

[34]  D. Goldstein Common genetic variation and human traits. , 2009, The New England journal of medicine.

[35]  J. Hirschhorn Genomewide association studies--illuminating biologic pathways. , 2009, The New England journal of medicine.

[36]  Qizhai Li,et al.  Flexible design for following up positive findings. , 2007, American journal of human genetics.