Enhancing the Power to Detect Low-Frequency Variants in Genome-Wide Screens

In genetic association studies a conventional test statistic is proportional to the correlation coefficient between the trait and the variant, with the result that it lacks power to detect association for low-frequency variants. Considering the link between the conventional association test statistics and the linkage disequilibrium measure r2, we propose a test statistic analogous to the standardized linkage disequilibrium D′ to increase the power of detecting association for low-frequency variants. By both simulation and real data analysis we show that the proposed D′ test is more powerful than the conventional methods for detecting association for low-frequency variants in a genome-wide setting. The optimal coding strategy for the D′ test and its asymptotic properties are also investigated. In summary, we advocate using the D′ test in a dominant model as a complementary approach to enhancing the power of detecting association for low-frequency variants with moderate to large effect sizes in case-control genome-wide association studies.

[1]  Jacob Cohen Statistical Power Analysis for the Behavioral Sciences , 1969, The SAGE Encyclopedia of Research Design.

[2]  Hung-Chih Ku,et al.  A Comparison of the Likelihood Ratio Test and the Variance‐Stabilising Transformation‐Based Tests for Detecting Association of Rare Variants , 2013, Annals of human genetics.

[3]  Chang-Yun Lin,et al.  Measuring linkage disequilibrium by the partial correlation coefficient , 2012, Heredity.

[4]  Tanya M. Teslovich,et al.  Large-scale association analyses identify new loci influencing glycemic traits and provide insight into the underlying biological pathways , 2012, Nature Genetics.

[5]  Claude Bouchard,et al.  A genome-wide approach accounting for body mass index identifies genetic variants influencing fasting glycemic traits and insulin resistance , 2012, Nature Genetics.

[6]  Chang-Yun Lin,et al.  Blindly Using Wald's Test Can Miss Rare Disease‐Causal Variants in Case‐Control Association Studies , 2012, Annals of human genetics.

[7]  T. Hansen,et al.  The effect of FOXA2 rs1209523 on glucose-related phenotypes and risk of type 2 diabetes in Danish individuals , 2012, BMC Medical Genetics.

[8]  Chang-Yun Lin,et al.  A Comparison of Approaches to Control for Confounding Factors by Regression Models , 2011, Human Heredity.

[9]  Carlos Zapata,et al.  On the Uses and Applications of the Most Commonly Used Measures of Linkage Disequilibrium from the Comparative Analysis of Their Statistical Properties , 2011, Human Heredity.

[10]  P. Visscher,et al.  Common SNPs explain a large proportion of heritability for human height , 2011 .

[11]  Eric Boerwinkle,et al.  A weighted false discovery rate control procedure reveals alleles at FOXA2 that influence fasting glucose levels. , 2010, American journal of human genetics.

[12]  Christian Gieger,et al.  New genetic loci implicated in fasting glucose homeostasis and their impact on type 2 diabetes risk , 2010, Nature Genetics.

[13]  Judy H. Cho,et al.  Finding the missing heritability of complex diseases , 2009, Nature.

[14]  P. Donnelly,et al.  Designing Genome-Wide Association Studies: Sample Size, Power, Imputation, and the Choice of Genotyping Chip , 2009, PLoS genetics.

[15]  B. Mangin,et al.  The Estimator of the Optimal Measure of Allelic Association: Mean, Variance and Probability Distribution When the Sample Size Tends to Infinity , 2008, Statistical applications in genetics and molecular biology.

[16]  Montgomery Slatkin,et al.  Linkage disequilibrium — understanding the evolutionary past and mapping the medical future , 2008, Nature Reviews Genetics.

[17]  M. Newton Large-Scale Simultaneous Hypothesis Testing: The Choice of a Null Hypothesis , 2008 .

[18]  B. Efron Size, power and false discovery rates , 2007, 0710.2245.

[19]  N. Morton,et al.  Genome scanning by composite likelihood. , 2007, American journal of human genetics.

[20]  D. Balding A tutorial on statistical methods for population association studies , 2006, Nature Reviews Genetics.

[21]  M. Daly,et al.  Biases and reconciliation in estimates of linkage disequilibrium in the human genome. , 2006, American journal of human genetics.

[22]  B. Efron Correlation and Large-Scale Simultaneous Significance Testing , 2007 .

[23]  W. G. Hill,et al.  Linkage disequilibrium in finite populations , 1968, Theoretical and Applied Genetics.

[24]  Jakob C. Mueller,et al.  Linkage disequilibrium for different scales and applications , 2004, Briefings Bioinform..

[25]  Ronald M Peshock,et al.  The Dallas Heart Study: a population-based probability sample for the multidisciplinary study of ethnic differences in cardiovascular health. , 2004, The American journal of cardiology.

[26]  B. Efron Large-Scale Simultaneous Hypothesis Testing , 2004 .

[27]  Juliet M Chapman,et al.  Detecting Disease Associations due to Linkage Disequilibrium Using Haplotype Tags: A Class of Tests and the Determinants of Statistical Power , 2003, Human Heredity.

[28]  S. Shete,et al.  A Note on the Optimal Measure of Allelic Association , 2003, Annals of human genetics.

[29]  Joseph L. Gastwirth,et al.  Trend Tests for Case-Control Studies of Genetic Markers: Power, Sample Size and Robustness , 2002, Human Heredity.

[30]  D. Easton,et al.  Sampling distribution of summary linkage disequilibrium measures , 2002 .

[31]  J. Pritchard,et al.  Linkage disequilibrium in humans: models and data. , 2001, American journal of human genetics.

[32]  N. Morton,et al.  The optimal measure of allelic association , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[33]  S. Rodríguez,et al.  Sampling variance and distribution of the D' measure of overall gametic disequilibrium between multiallelic loci. , 2001, Annals of human genetics.

[34]  A Collins,et al.  Mapping a disease locus by allelic association. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[35]  P. Sasieni From genotypes to genes: doubling the sample size. , 1997, Biometrics.

[36]  S W Guo,et al.  Linkage disequilibrium measures for fine-scale mapping: a comparison. , 1997, Human heredity.

[37]  C Zapata,et al.  Approximate variance of the standardized measure of gametic disequilibrium D'. , 1997, American journal of human genetics.

[38]  K. Roeder,et al.  Disequilibrium mapping: composite likelihood for pairwise disequilibrium. , 1996, Genomics.

[39]  N. Risch,et al.  A comparison of linkage disequilibrium measures for fine-scale mapping. , 1995, Genomics.

[40]  J. Terwilliger A powerful likelihood method for the analysis of linkage disequilibrium between trait loci and one or more polymorphic marker loci. , 1995, American journal of human genetics.

[41]  G. Casella,et al.  Statistical Inference , 2003, Encyclopedia of Social Network Analysis and Mining.

[42]  R. Lewontin,et al.  On measures of gametic disequilibrium. , 1988, Genetics.

[43]  P. Hedrick Inference of recombinational hotspots using gametic disequilibrium values , 1988, Heredity.

[44]  P. Hedrick,et al.  Gametic disequilibrium measures: proceed with caution. , 1987, Genetics.

[45]  G. Thomson,et al.  Measuring the strength of associations between HLA antigens and diseases. , 1981, Tissue antigens.

[46]  W. G. Hill,et al.  Estimation of linkage disequilibrium in randomly mating populations , 1974, Heredity.

[47]  R. Lewontin The Interaction of Selection and Linkage. I. General Considerations; Heterotic Models. , 1964, Genetics.

[48]  A. W. Edwards The Measure of Association in a 2 × 2 Table , 1963 .

[49]  R. Lewontin,et al.  THE EVOLUTIONARY DYNAMICS OF COMPLEX POLYMORPHISMS , , , 1960 .