The effect that genotyping errors have on the robustness of common linkage-disequilibrium measures.

The rapid development of a dense single-nucleotide-polymorphism marker map has stimulated numerous studies attempting to characterize the magnitude and distribution of background linkage disequilibrium (LD) within and between human populations. Although genotyping errors are an inherent problem in all LD studies, there have been few systematic investigations documenting their consequences on estimates of background LD. Therefore, we derived simple deterministic formulas to investigate the effect that genotyping errors have on four commonly used LD measures-D', r, Q, and d-in studies of background LD. We have found that genotyping error rates as small as 3% can have serious affects on these LD measures, depending on the allele frequencies and the assumed error model. Furthermore, we compared the robustness of D', r, Q, and d, in the presence of genotyping errors. In general, Q and d are more robust than D' and r, although exceptions do exist. Finally, through stochastic simulations, we illustrate how genotyping errors can lead to erroneous inferences when measures of LD between two samples are compared.

[1]  K H Buetow,et al.  Influence of aberrant observations on high-resolution linkage analysis outcomes. , 1991, American journal of human genetics.

[2]  S. P. Fodor,et al.  Determination of ancestral alleles for human single-nucleotide polymorphisms using high-density oligonucleotide arrays , 1999, Nature Genetics.

[3]  G. Yule On the Association of Attributes in Statistics: With Illustrations from the Material of the Childhood Society, &c , 1900 .

[4]  R. Dukoff,et al.  Quantitative assessment of apolipoprotein E genotypes by image analysis of PCR-RFLP fragments. , 2000, Clinica chimica acta; international journal of clinical chemistry.

[5]  R. Lewontin,et al.  THE EVOLUTIONARY DYNAMICS OF COMPLEX POLYMORPHISMS , , , 1960 .

[6]  H H Göring,et al.  Linkage analysis in the presence of errors II: marker-locus genotyping errors modeled with hypercomplex recombination fractions. , 2000, American journal of human genetics.

[7]  Pui-Yan Kwok,et al.  Juxtaposed regions of extensive and minimal linkage disequilibrium in human Xq25 and Xq28 , 2000, Nature Genetics.

[8]  M. Nei,et al.  Non-random association between electromorphs and inversion chromosomes in finite populations. , 1980, Genetical research.

[9]  M. Xiong,et al.  Fine-scale genetic mapping based on linkage disequilibrium: theory and applications. , 1997, American journal of human genetics.

[10]  N E Morton,et al.  Error filtration, interference, and the human linkage map. , 1991, Proceedings of the National Academy of Sciences of the United States of America.

[11]  J. Kere,et al.  Gene Mapping in Isolated Populations: New Roles for Old Friends? , 1999, Human Heredity.

[12]  R. Lewontin The Interaction of Selection and Linkage. I. General Considerations; Heterotic Models. , 1964, Genetics.

[13]  M. Daly,et al.  A map of human genome sequence variation containing 1.42 million single nucleotide polymorphisms , 2001, Nature.

[14]  J. Ott,et al.  Significant evidence for linkage disequilibrium over a 5-cM region among Afrikaners. , 2000, Genomics.

[15]  Francis S. Collins,et al.  Variations on a Theme: Cataloging Human DNA Sequence Variation , 1997, Science.

[16]  K. Livak,et al.  Oligonucleotides with fluorescent dyes at opposite ends provide a quenched probe system useful for detecting PCR product and nucleic acid hybridization. , 1995, PCR methods and applications.

[17]  Jurg Ott,et al.  Assessment and management of single nucleotide polymorphism genotype errors in genetic association analysis. , 2000 .

[18]  N. Risch,et al.  A comparison of linkage disequilibrium measures for fine-scale mapping. , 1995, Genomics.

[19]  R. Lewontin,et al.  On measures of gametic disequilibrium. , 1988, Genetics.

[20]  Jonathan Scott Friedlaender,et al.  Haplotypes and linkage disequilibrium at the phenylalanine hydroxylase locus, PAH, in a global representation of populations. , 2000, American journal of human genetics.

[21]  G. Abecasis,et al.  Single nucleotide polymorphism and linkage disequilibrium within the TCR α/δ locus , 2000 .

[22]  L. Peltonen,et al.  A system for specific, high-throughput genotyping by allele-specific primer extension on microarrays. , 2000, Genome research.

[23]  L R Cardon,et al.  Extent and distribution of linkage disequilibrium in three genomic regions. , 2001, American journal of human genetics.

[24]  John A. Todd,et al.  The genetically isolated populations of Finland and Sardinia may not be a panacea for linkage disequilibrium mapping of common disease genes , 2000, Nature Genetics.

[25]  M. Xiong,et al.  Haplotypes vs single marker linkage disequilibrium tests: what do we gain? , 2001, European Journal of Human Genetics.

[26]  T C Matise,et al.  Power loss for multiallelic transmission/disequilibrium test when errors introduced: GAW11 simulated data , 1999, Genetic epidemiology.

[27]  S. Tishkoff,et al.  Global Patterns of Linkage Disequilibrium at the CD4 Locus and Modern Human Origins , 1996, Science.

[28]  E S Lander,et al.  Systematic detection of errors in genetic linkage data. , 1992, Genomics.

[29]  C. Nusbaum,et al.  Large-scale identification, mapping, and genotyping of single-nucleotide polymorphisms in the human genome. , 1998, Science.

[30]  N. Shen,et al.  Patterns of single-nucleotide polymorphisms in candidate genes for blood-pressure homeostasis , 1999, Nature Genetics.

[31]  N. Freimer,et al.  The distribution of linkage disequilibrium over anonymous genome regions. , 1995, Human molecular genetics.

[32]  L. Feuk,et al.  Robust and accurate single nucleotide polymorphism genotyping by dynamic allele-specific hybridization (DASH): design criteria and assay validation. , 2001, Genome research.

[33]  J. Todd,et al.  Major factors influencing linkage disequilibrium by analysis of different chromosome regions in distinct populations: demography, chromosome recombination frequency and selection. , 2000, Human molecular genetics.

[34]  S. Pääbo,et al.  Demographic history and linkage disequilibrium in human populations , 1997, Nature Genetics.

[35]  K K Kidd,et al.  The accuracy of statistical methods for estimation of haplotype frequencies: an example from the CD4 locus. , 2000, American journal of human genetics.

[36]  J. Witte,et al.  Linkage disequilibrium and allele-frequency distributions for 114 single-nucleotide polymorphisms in five populations. , 2000, American journal of human genetics.

[37]  Earl Hubbell,et al.  Genome-wide mapping with biallelic markers in Arabidopsis thaliana , 1999, Nature Genetics.

[38]  P. Hedrick,et al.  Gametic disequilibrium measures: proceed with caution. , 1987, Genetics.

[39]  M. Shriver,et al.  Melting curve analysis of SNPs (McSNP): a gel-free and inexpensive approach for SNP genotyping. , 2001, BioTechniques.