Error detection for genetic data, using likelihood methods.

As genetic maps become denser, the effect of laboratory typing errors becomes more serious. We review a general method for detecting errors in pedigree genotyping data that is a variant of the likelihood-ratio test statistic. It pinpoints individuals and loci with relatively unlikely genotypes. Power and significance studies using Monte Carlo methods are shown by using simulated data with pedigree structures similar to the CEPH pedigrees and a larger experimental pedigree used in the study of idiopathic dilated cardiomyopathy (DCM). The studies show the index detects errors for small values of theta with high power and an acceptable false positive rate. The method was also used to check for errors in DCM laboratory pedigree data and to estimate the error rate in CEPH-chromosome 6 data. The errors flagged by our method in the DCM pedigree were confirmed by the laboratory. The results are consistent with estimated false-positive and false-negative rates obtained using simulation.

[1]  J. Ott,et al.  Molecular and statistical approaches to the detection and correction of errors in genotype databases. , 1993, American journal of human genetics.

[2]  E. Lander,et al.  Construction of multilocus genetic linkage maps in humans. , 1987, Proceedings of the National Academy of Sciences of the United States of America.

[3]  H. Cann,et al.  Report of the Second International Workshop on Human Chromosome 6. , 1994, Genomics.

[4]  M. Ehm,et al.  ERROR DETECTION IN GENETIC LINKAGE DATA FOR HUMAN PEDIGREES USING LIKELIHOOD RATIO METHODS , 1995 .

[5]  J. Ott,et al.  Strategies for multilocus linkage analysis in humans. , 1984, Proceedings of the National Academy of Sciences of the United States of America.

[6]  J. Ott Computer-simulation methods in human linkage analysis. , 1989, Proceedings of the National Academy of Sciences of the United States of America.

[7]  K H Buetow,et al.  Influence of aberrant observations on high-resolution linkage analysis outcomes. , 1991, American journal of human genetics.

[8]  M. Spence,et al.  Analysis of human genetic linkage , 1986 .

[9]  G. Lathrop,et al.  Easy calculations of lod scores and genetic risks on small computers. , 1984, American journal of human genetics.

[10]  E S Lander,et al.  Systematic detection of errors in genetic linkage data. , 1992, Genomics.

[11]  J. Ott,et al.  Detecting marker inconsistencies in human gene mapping. , 1993, Human heredity.

[12]  M. Kendall,et al.  Kendall's advanced theory of statistics , 1995 .

[13]  A A Schäffer,et al.  Avoiding recomputation in linkage analysis. , 1994, Human heredity.

[14]  A A Schäffer,et al.  Faster sequential genetic linkage computations. , 1993, American journal of human genetics.

[15]  R L White,et al.  The CEPH consortium linkage map of human chromosome 1. , 1991, Genomics.

[16]  G. Lathrop,et al.  Construction of human linkage maps: Likelihood calculations for multilocus linkage analysis , 1986, Genetic epidemiology.

[17]  G M Lathrop,et al.  Evaluating pedigree data. I. The estimation of pedigree error in the presence of marker mistyping. , 1983, American journal of human genetics.

[18]  N. Morton,et al.  Standard maps of chromosome 10 , 1990, Annals of human genetics.

[19]  B. Keats,et al.  Report of the committee on linkage and gene order. , 1990, Cytogenetics and cell genetics.