Undetected genotyping errors cause apparent overtransmission of common alleles in the transmission/disequilibrium test.

The transmission/disequilibrium test (TDT), a family-based test of linkage and association, is a popular and intuitive statistical test for studies of complex inheritance, as it is nonparametric and robust to population stratification. We carried out a literature search and located 79 significant TDT-derived associations between a microsatellite marker allele and a disease. Among these, there were 31 (39%) in which the most common allele was found to exhibit distorted transmission to affected offspring, implying that the allele may be associated with either susceptibility to or protection from a disease. In 27 of these 31 studies (87%), the most common allele appeared to be overtransmitted to affected offspring (a risk factor), and, in the remaining 4 studies, the most common allele appeared to be undertransmitted (a protective factor). In a second literature search, we identified 92 case-control studies in which a microsatellite marker allele was found to have significantly different frequencies in case and control groups. Of these, there were 37 instances (40%) in which the most common allele was involved. In 12 of these 37 studies (32%), the most common allele was enriched in cases relative to controls (a risk factor), and, in the remaining 25 studies, the most common allele was enriched in controls (a protective factor). Thus, the most common allele appears to be a risk factor when identified through the TDT, and it appears to be protective when identified through case-control analysis. To understand this phenomenon, we incorporated an error model into the calculation of the TDT statistic. We show that undetected genotyping error can cause apparent transmission distortion at markers with alleles of unequal frequency. We demonstrate that this distortion is in the direction of overtransmission for common alleles. Therefore, we conclude that undetected genotyping errors may be contributing to an inflated false-positive rate among reported TDT-derived associations and that genotyping fidelity must be increased.

[1]  N E Morton,et al.  Error filtration, interference, and the human linkage map. , 1991, Proceedings of the National Academy of Sciences of the United States of America.

[2]  Jeanette C Papp,et al.  Detection and integration of genotyping errors in statistical genetics. , 2002, American journal of human genetics.

[3]  Derek Gordon,et al.  True Pedigree Errors More Frequent Than Apparent Errors for Single Nucleotide Polymorphisms , 1999, Human Heredity.

[4]  J. Ott,et al.  A transmission/disequilibrium test that allows for genotyping errors in the analysis of single-nucleotide polymorphism data. , 2001, American journal of human genetics.

[5]  D. Weeks,et al.  A tale of two genotypes: consistency between two high-throughput genotyping centers. , 2002, Genome research.

[6]  H. J. Evans,et al.  Automation of genetic linkage analysis using fluorescent microsatellite markers. , 1994, Genomics.

[7]  Michael Boehnke,et al.  Probability of detection of genotyping errors and mutations as inheritance inconsistencies in nuclear-family data. , 2002, American journal of human genetics.

[8]  A. Roter,et al.  An approach to high-throughput genotyping. , 1996, Genome research.

[9]  E S Lander,et al.  Systematic detection of errors in genetic linkage data. , 1992, Genomics.

[10]  J. Ott,et al.  Molecular and statistical approaches to the detection and correction of errors in genotype databases. , 1993, American journal of human genetics.

[11]  W. Ewens,et al.  The transmission/disequilibrium test: history, subdivision, and admixture. , 1995, American journal of human genetics.

[12]  K H Buetow,et al.  Influence of aberrant observations on high-resolution linkage analysis outcomes. , 1991, American journal of human genetics.

[13]  Joseph B. Rayman,et al.  Methods for precise sizing, automated binning of alleles, and reduction of error rates in large-scale genotyping using fluorescently labeled dinucleotide markers. FUSION (Finland-U.S. Investigation of NIDDM Genetics) Study Group. , 1997, Genome research.

[14]  R M Idury,et al.  A simple method for automated allele binning in microsatellite markers. , 1997, Genome research.

[15]  P. Jordan,et al.  PCR amplification introduces errors into mononucleotide and dinucleotide repeat sequences , 2001, Molecular pathology : MP.

[16]  N Risch,et al.  The Future of Genetic Studies of Complex Human Diseases , 1996, Science.

[17]  F. Collins,et al.  Approach to genotyping errors caused by nontemplated nucleotide addition by Taq DNA polymerase. , 1995, Genome research.

[18]  H H Göring,et al.  Linkage analysis in the presence of errors II: marker-locus genotyping errors modeled with hypercomplex recombination fractions. , 2000, American journal of human genetics.

[19]  Jurg Ott,et al.  Assessment and management of single nucleotide polymorphism genotype errors in genetic association analysis. , 2000 .

[20]  A Chakravarti,et al.  High-throughput variation detection and genotyping using microarrays. , 2001, Genome research.

[21]  W. Ewens,et al.  Transmission test for linkage disequilibrium: the insulin gene region and insulin-dependent diabetes mellitus (IDDM). , 1993, American journal of human genetics.

[22]  D. Levinson,et al.  Identification and analysis of error types in high-throughput genotyping. , 2000, American journal of human genetics.

[23]  K Lange,et al.  A multipoint method for detecting genotyping errors and mutations in sibling-pair linkage data. , 2000, American journal of human genetics.

[24]  M. Xiong,et al.  The effect that genotyping errors have on the robustness of common linkage-disequilibrium measures. , 2001, American journal of human genetics.