Haplotype Frequency Estimation in the Presence of Genotyping Errors

Several statistical methods have been proposed to estimate haplotype frequencies, either based on unrelated individuals or based on families. These estimates may yield insights on population genetics as well as associations between candidate regions and disease of interest. One limitation of the existing methods is that all these methods make the implicit assumption that there are no genotyping errors. However, genotyping errors are unavoidable in practice. Numerous methods have been developed to incorporate genotyping errors in genetic studies, but none to date have addressed the issues of haplotype inference in the presence of genotyping errors. In this article, we develop statistical methods for haplotype inference incorporating genotyping errors. We describe how our methods can be applied to analyze unrelated individuals as well as nuclear families. Our simulation results show that the proposed methods perform well in the presence of genotyping errors.

[1]  Derek Gordon,et al.  True Pedigree Errors More Frequent Than Apparent Errors for Single Nucleotide Polymorphisms , 1999, Human Heredity.

[2]  J. Long,et al.  An E-M algorithm and testing strategy for multiple-locus haplotypes. , 1995, American journal of human genetics.

[3]  A. Clark,et al.  Inference of haplotypes from PCR-amplified samples of diploid populations. , 1990, Molecular biology and evolution.

[4]  Zhaohui S. Qin,et al.  Bayesian haplotype inference for multiple linked single-nucleotide polymorphisms. , 2002, American journal of human genetics.

[5]  L. Excoffier,et al.  Incorporating genotypes of relatives into a test of linkage disequilibrium. , 1998, American journal of human genetics.

[6]  Daniel J Schaid,et al.  Relative efficiency of ambiguous vs. directly measured haplotype frequencies , 2002, Genetic epidemiology.

[7]  T P Speed,et al.  The effects of genotyping errors and interference on estimation of genetic distance. , 1997, Human heredity.

[8]  P. Donnelly,et al.  A new statistical method for haplotype reconstruction from population data. , 2001, American journal of human genetics.

[9]  K H Buetow,et al.  Influence of aberrant observations on high-resolution linkage analysis outcomes. , 1991, American journal of human genetics.

[10]  H H Göring,et al.  Linkage analysis in the presence of errors III: marker loci and their map as nuisance parameters. , 2000, American journal of human genetics.

[11]  J. Weber,et al.  Estimation of pairwise relationships in the presence of genotyping errors. , 1998, American journal of human genetics.

[12]  L. Excoffier,et al.  Maximum-likelihood estimation of molecular haplotype frequencies in a diploid population. , 1995, Molecular biology and evolution.

[13]  Jeanette C Papp,et al.  Detection and integration of genotyping errors in statistical genetics. , 2002, American journal of human genetics.

[14]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[15]  K. Kidd,et al.  HAPLO: a program using the EM algorithm to estimate the frequencies of multi-site haplotypes. , 1995, The Journal of heredity.

[16]  M. Xiong,et al.  The effect that genotyping errors have on the robustness of common linkage-disequilibrium measures. , 2001, American journal of human genetics.

[17]  N E Morton,et al.  Error filtration, interference, and the human linkage map. , 1991, Proceedings of the National Academy of Sciences of the United States of America.

[18]  J. Ott,et al.  A transmission/disequilibrium test that allows for genotyping errors in the analysis of single-nucleotide polymorphism data. , 2001, American journal of human genetics.

[19]  H H Göring,et al.  Linkage analysis in the presence of errors II: marker-locus genotyping errors modeled with hypercomplex recombination fractions. , 2000, American journal of human genetics.

[20]  Jurg Ott,et al.  Assessment and management of single nucleotide polymorphism genotype errors in genetic association analysis. , 2000 .

[21]  H H Göring,et al.  Linkage analysis in the presence of errors IV: joint pseudomarker analysis of linkage and/or linkage disequilibrium on a mixture of pedigrees and singletons when the mode of inheritance cannot be accurately specified. , 2000, American journal of human genetics.

[22]  H H Göring,et al.  Linkage analysis in the presence of errors I: complex-valued recombination fractions and complex phenotypes. , 2000, American journal of human genetics.

[23]  Hongyu Zhao,et al.  Genotyping error detection through tightly linked markers. , 2003, Genetics.

[24]  N. Schork,et al.  Genetic analysis of case/control data using estimated haplotype frequencies: application to APOE locus variation and Alzheimer's disease. , 2001, Genome research.