Overcoming the winner's curse: estimating penetrance parameters from case-control data.

Genomewide association studies are now a widely used approach in the search for loci that affect complex traits. After detection of significant association, estimates of penetrance and allele-frequency parameters for the associated variant indicate the importance of that variant and facilitate the planning of replication studies. However, when these estimates are based on the original data used to detect the variant, the results are affected by an ascertainment bias known as the "winner's curse." The actual genetic effect is typically smaller than its estimate. This overestimation of the genetic effect may cause replication studies to fail because the necessary sample size is underestimated. Here, we present an approach that corrects for the ascertainment bias and generates an estimate of the frequency of a variant and its penetrance parameters. The method produces a point estimate and confidence region for the parameter estimates. We study the performance of this method using simulated data sets and show that it is possible to greatly reduce the bias in the parameter estimates, even when the original association study had low power. The uncertainty of the estimate decreases with increasing sample size, independent of the power of the original test for association. Finally, we show that application of the method to case-control data can improve the design of replication studies considerably.

[1]  Johan Auwerx,et al.  A Pro12Ala substitution in PPARγ2 associated with decreased receptor activity, lower body mass index and improved insulin sensitivity , 1998, Nature Genetics.

[2]  D Siegmund,et al.  Upward bias in estimation of genetic effects. , 2002, American journal of human genetics.

[3]  J. Pritchard,et al.  Linkage disequilibrium in humans: models and data. , 2001, American journal of human genetics.

[4]  K. Clément,et al.  The Pro115Gln and Pro12Ala PPAR gamma gene mutations in obesity and type 2 diabetes , 2000, International Journal of Obesity.

[5]  Eric S. Lander,et al.  The common PPARγ Pro12Ala polymorphism is associated with decreased risk of type 2 diabetes , 2000, Nature Genetics.

[6]  T. Kadowaki,et al.  The Pro12Ala Polymorphism in PPAR γ2 May Confer Resistance to Type 2 Diabetes , 2000 .

[7]  J. Ott,et al.  Complement Factor H Polymorphism in Age-Related Macular Degeneration , 2005, Science.

[8]  A. Edwards,et al.  Complement Factor H Polymorphism and Age-Related Macular Degeneration , 2005, Science.

[9]  Lei Sun,et al.  Reduction of selection bias in genomewide studies by resampling , 2005, Genetic epidemiology.

[10]  Shankuan Zhu,et al.  Bias in estimates of quantitative-trait-locus effect in genome scans: demonstration of the phenomenon and a method-of-moments procedure for reducing bias. , 2002, American journal of human genetics.

[11]  E. C. Capen,et al.  Competitive Bidding in High-Risk Situations , 1971 .

[12]  E. Lander,et al.  Meta-analysis of genetic association studies supports a contribution of common variants to susceptibility to common disease , 2003, Nature Genetics.

[13]  Christoph Lange,et al.  Genomic screening and replication using the same data set in family-based association testing , 2005, Nature Genetics.

[14]  J Blangero,et al.  Large upward bias in estimation of locus-specific effects from genomewide scans. , 2001, American journal of human genetics.

[15]  V. Colantuoni,et al.  Pro12Ala substitution in the peroxisome proliferator-activated receptor-gamma2 is not associated with type 2 diabetes. , 1999, Diabetes.

[16]  J. Pritchard,et al.  Confounding from Cryptic Relatedness in Case-Control Association Studies , 2005, PLoS genetics.

[17]  J. Hirschhorn,et al.  A comprehensive review of genetic association studies , 2002, Genetics in Medicine.

[18]  S. Wild,et al.  Global prevalence of diabetes: estimates for the year 2000 and projections for 2030. , 2004, Diabetes care.

[19]  J. Auwerx,et al.  Impact of the Peroxisome Proliferator Activated Receptor γ2 Pro12Ala polymorphism on adiposity, lipids and non-insulin-dependent diabetes mellitus , 2000, International Journal of Obesity.

[20]  S. Engeli,et al.  Pro12Ala missense mutation of the peroxisome proliferator activated receptor gamma and diabetes mellitus. , 1999, Biochemical and biophysical research communications.

[21]  N Risch,et al.  The Future of Genetic Studies of Complex Human Diseases , 1996, Science.

[22]  F. Hu,et al.  A Common Genetic Variant Is Associated with Adult and Childhood Obesity , 2006, Science.

[23]  T. Kadowaki,et al.  The Pro12Ala polymorphism in PPAR gamma2 may confer resistance to type 2 diabetes. , 2000, Biochemical and biophysical research communications.