Incorporating Individual Error Rate into Association Test of Unmatched Case-Control Design

Objectives: Genotyping error commonly occurs and could reduce the power and bias statistical inference in genetics studies. In addition to genotypes, some automated biotechnologies also provide quality measurement of each individual genotype. We studied the relationship between the quality measurement and genotyping error rate. Furthermore, we propose two association tests incorporating the genotyping quality information with the goal to improve statistical power and inference. Methods: 50 pairs of DNA sample duplicates were typed for 232 SNPs by BeadArray technology. We used scatter plot, smoothing function and generalized additive models to investigate the relationship between genotype quality score (q) and inconsistency rate (ĩ) among duplicates. We constructed two association tests: (1) weighted contingency table test (WCT) and (2) likelihood ratio test (LRT) to incorporate individual genotype error rate (Εi), in unmatched case-control setting. Results: In the 50 duplicates, we found q and ĩ were in strong negative association, suggesting the genotypes with low quality score were more likely to be mistyped. The WCT improved the statistical power and partially corrects the bias in point estimation. The LRT offered moderate power gain, but was able to correct the bias in odds ratio estimation. The two new methods also performed favorably in some scenarios when Εi was mis-specified. Conclusions: With increasing number of genetic studies and application of automated genotyping technology, there is a growing need to adequately account for individual genotype error rate in statistical analysis. Our study represents an initial step to address this need and points out a promising direction for further research.

[1]  T C Matise,et al.  Power loss for multiallelic transmission/disequilibrium test when errors introduced: GAW11 simulated data , 1999, Genetic epidemiology.

[2]  Tianhua Niu,et al.  Incorporating genotyping uncertainty in haplotype inference for single-nucleotide polymorphisms. , 2004, American journal of human genetics.

[3]  J. Ott,et al.  An analytic solution to single nucleotide polymorphism error-detection rates in nuclear families: implications for study design. , 1999, Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing.

[4]  Cheng Li,et al.  Estimation of genotype error rate using samples with pedigree information--an application on the GeneChip Mapping 10K array. , 2004, Genomics.

[5]  J. Ott,et al.  Power and Sample Size Calculations for Case-Control Genetic Association Tests when Errors Are Present: Application to Single Nucleotide Polymorphisms , 2002, Human Heredity.

[6]  Tianhua Niu,et al.  A candidate gene association study on preterm delivery: application of high-throughput genotyping technology and advanced statistical methods. , 2004, Human molecular genetics.

[7]  Michael Boehnke,et al.  Probability of detection of genotyping errors and mutations as inheritance inconsistencies in nuclear-family data. , 2002, American journal of human genetics.

[8]  Derek Gordon,et al.  Errors and Linkage Disequilibrium Interact Multiplicatively When Computing Sample Sizes for Genetic Case-Control Association Studies , 2002, Pacific Symposium on Biocomputing.

[9]  A. Oliphant,et al.  BeadArray technology: enabling an accurate, cost-effective approach to high-throughput genotyping. , 2002, BioTechniques.

[10]  I. Bross Misclassification in 2 X 2 Tables , 1954 .

[11]  Stacey S Cherny,et al.  The impact of genotyping error on family-based analysis of quantitative traits , 2001, European Journal of Human Genetics.

[12]  S. P. Fodor,et al.  Large-scale genotyping of complex DNA , 2003, Nature Biotechnology.

[13]  Derek Gordon,et al.  Tradeoff Between No-Call Reduction in Genotyping Error Rate and Loss of Sample Size for Genetic Case/Control Association Studies , 2003, Pacific Symposium on Biocomputing.

[14]  Derek Gordon,et al.  True Pedigree Errors More Frequent Than Apparent Errors for Single Nucleotide Polymorphisms , 1999, Human Heredity.

[15]  M. Xiong,et al.  The effect that genotyping errors have on the robustness of common linkage-disequilibrium measures. , 2001, American journal of human genetics.

[16]  Stephen J Finch,et al.  What SNP genotyping errors are most costly for genetic association studies? , 2004, Genetic epidemiology.

[17]  Jing Huang,et al.  Parallel genotyping of over 10,000 SNPs using a one-primer assay on a high-density oligonucleotide array. , 2004, Genome research.

[18]  Jurg Ott,et al.  Assessment and management of single nucleotide polymorphism genotype errors in genetic association analysis. , 2000 .

[19]  Tianhua Niu,et al.  Maternal cigarette smoking, metabolic gene polymorphism, and infant birth weight. , 2002, JAMA.

[20]  K H Buetow,et al.  Influence of aberrant observations on high-resolution linkage analysis outcomes. , 1991, American journal of human genetics.

[21]  T P Speed,et al.  The effects of genotyping errors and interference on estimation of genetic distance. , 1997, Human heredity.

[22]  P. Holmans,et al.  Allowing for Genotyping Error in Analysis of Unmatched Case‐Control Studies , 2003, Annals of human genetics.

[23]  Jeanette C Papp,et al.  Detection and integration of genotyping errors in statistical genetics. , 2002, American journal of human genetics.