Power consequences of linkage disequilibrium variation between populations

We quantify the degree to which LD differences exist in the human genome and investigates the consequences that variations in patterns of LD between populations can have on the power of case‐control or family‐trio association studies. Although only a small proportion of SNPs show significant LD differences (0.8–5%), these can introduce artificial signals of associations and reduce the power to detect true associations in case‐control designs, even when meta‐analytic approaches are used to account for stratification. We show that combining trios from different populations in the presence of significant LD differences can adversely affect power even though the number of trios has increased. Our results have implications on genetic studies conducted in populations with substantial population structure and show that the use of meta‐analytic approaches or family‐based designs to protect Type 1 error does not prevent loss of power due to differences in LD across populations. Genet. Epidemiol. 2008. © 2008 Wiley‐Liss, Inc.

[1]  P. Donnelly,et al.  The effects of human population structure on large genetic association studies , 2004, Nature Genetics.

[2]  Lon R Cardon,et al.  Evaluating coverage of genome-wide association studies , 2006, Nature Genetics.

[3]  M. Olivier A haplotype map of the human genome. , 2003, Nature.

[4]  D. Kwiatkowski,et al.  Classical sickle beta-globin haplotypes exhibit a high degree of long-range haplotype similarity in African and Afro-Caribbean populations , 2007, BMC Genetics.

[5]  M. Farrall,et al.  Integrating Case‐control and TDT Studies , 2005 .

[6]  D. Balding Likelihood-based inference for genetic correlation coefficients. , 2003, Theoretical population biology.

[7]  Eden R Martin,et al.  No gene is an island: the flip-flop phenomenon. , 2007, American journal of human genetics.

[8]  John S Witte,et al.  Point: population stratification: a problem for case-control studies of candidate-gene associations? , 2002, Cancer epidemiology, biomarkers & prevention : a publication of the American Association for Cancer Research, cosponsored by the American Society of Preventive Oncology.

[9]  Kyoko Shibata,et al.  Genetic flip-flop without an accompanying change in linkage disequilibrium. , 2008, American journal of human genetics.

[10]  P. Donnelly,et al.  Case-control studies of association in structured or admixed populations. , 2001, Theoretical population biology.

[11]  S. Gabriel,et al.  Assessing the impact of population stratification on genetic association studies , 2004, Nature Genetics.

[12]  Kenjiro Taura,et al.  Evaluation of genome-wide power of genetic association studies based on empirical data from the HapMap project. , 2007, Human molecular genetics.

[13]  T. Agbenyega,et al.  Hemoglobin variants and disease manifestations in severe falciparum malaria. , 2007, JAMA.

[14]  Richard A. Nichols,et al.  A method for quantifying differentiation between populations at multi-allelic loci and its implications for investigating identity and paternity , 2008, Genetica.

[15]  M. Daly,et al.  Evaluating and improving power in whole-genome association studies using fixed marker sets , 2006, Nature Genetics.

[16]  W. G. Hill,et al.  Linkage disequilibrium in finite populations , 1968, Theoretical and Applied Genetics.

[17]  S. Wright,et al.  Genetical Structure of Populations , 1950, Nature.

[18]  D. Clayton,et al.  Population structure, differential bias and genomic control in a large-scale, case-control association study , 2005, Nature Genetics.

[19]  K. Roeder,et al.  Genomic Control for Association Studies , 1999, Biometrics.

[20]  A. Morris,et al.  Fine mapping versus replication in whole-genome association studies. , 2007, American journal of human genetics.

[21]  W. Ewens,et al.  Transmission test for linkage disequilibrium: the insulin gene region and insulin-dependent diabetes mellitus (IDDM). , 1993, American journal of human genetics.

[22]  E. Silverman,et al.  Case-Control Association Studies in Pharmacogenetics , 2001, The Pharmacogenomics Journal.

[23]  D. Balding,et al.  A method for quantifying differentiation between populations at multi-allelic loci and its implications for investigating identity and paternity , 2005, Genetica.

[24]  S. Gabriel,et al.  Two independent alleles at 6q23 associated with risk of rheumatoid arthritis , 2007, Nature Genetics.

[25]  Simon C. Potter,et al.  Genome-wide association study of 14,000 cases of seven common diseases and 3,000 shared controls , 2007, Nature.

[26]  M. Olivier A haplotype map of the human genome , 2003, Nature.

[27]  Peter Donnelly,et al.  Assessing population differentiation and isolation from single‐nucleotide polymorphism data , 2002 .

[28]  M. Daly,et al.  Transferability of tag SNPs in genetic association studies in multiple populations , 2006, Nature Genetics.

[29]  Elad Ziv,et al.  Human population structure and genetic association studies. , 2003, Pharmacogenomics.

[30]  Michael I. Jordan,et al.  A randomization test for controlling population stratification in whole-genome association studies. , 2007, American journal of human genetics.

[31]  Zhaohui S. Qin,et al.  A second generation human haplotype map of over 3.1 million SNPs , 2007, Nature.

[32]  M Farrall,et al.  Integrating case-control and TDT studies. , 2005, Annals of human genetics.

[33]  Pablo Villoslada,et al.  Analysis and Application of European Genetic Substructure Using 300 K SNP Information , 2008, PLoS genetics.

[34]  Paul Scheet,et al.  A fast and flexible statistical model for large-scale population genotype data: applications to inferring missing genotypes and haplotypic phase. , 2006, American journal of human genetics.

[35]  C. M. Lewis,et al.  Genetic association studies: Design, analysis and interpretation , 2002, Briefings Bioinform..

[36]  M. Daly,et al.  Biases and reconciliation in estimates of linkage disequilibrium in the human genome. , 2006, American journal of human genetics.

[37]  P. Donnelly,et al.  Inference of population structure using multilocus genotype data. , 2000, Genetics.

[38]  W. Ewens,et al.  A sibship test for linkage in the presence of association: the sib transmission/disequilibrium test. , 1998, American journal of human genetics.

[39]  Birgir Hrafnkelsson,et al.  An Icelandic example of the impact of population structure on association studies , 2005, Nature Genetics.

[40]  H. Danker-Hopfe,et al.  7 Analysis of population structure: A comparative study of different estimators of wright's fixation indices , 1991 .

[41]  R. Klein,et al.  Power analysis for genome-wide association studies , 2007, BMC Genetics.

[42]  R. Lewontin The Interaction of Selection and Linkage. I. General Considerations; Heterotic Models. , 1964, Genetics.

[43]  D. Conrad,et al.  A worldwide survey of haplotype variation and linkage disequilibrium in the human genome , 2006, Nature Genetics.

[44]  S. Wright,et al.  Isolation by Distance. , 1943, Genetics.

[45]  D. Reich,et al.  Principal components analysis corrects for stratification in genome-wide association studies , 2006, Nature Genetics.

[46]  P. Donnelly,et al.  Association mapping in structured populations. , 2000, American journal of human genetics.

[47]  Laurent Excoffier,et al.  Analysis of Population Subdivision , 2004 .