Evaluation of standard error and confidence interval of estimated multilocus genotype probabilities, and their implications in DNA forensics.

Multilocus genotype probabilities, estimated using the assumption of independent association of alleles within and across loci, are subject to sampling fluctuation, since allele frequencies used in such computations are derived from samples drawn from a population. We derive exact sampling variances of estimated genotype probabilities and provide simple approximation of sampling variances. Computer simulations conducted using real DNA typing data indicate that, while the sampling distribution of estimated genotype probabilities is not symmetric around the point estimate, the confidence interval of estimated (single-locus or multilocus) genotype probabilities can be obtained from the sampling of a logarithmic transformation of the estimated values. This, in turn, allows an examination of heterogeneity of estimators derived from data on different reference populations. Applications of this theory to DNA typing data at VNTR loci suggest that use of different reference population data may yield significantly different estimates. However, significant differences generally occur with rare (less than 1 in 40,000) genotype probabilities. Conservative estimates of five-locus DNA profile probabilities are always less than 1 in 1 million in an individual from the United States, irrespective of the racial/ethnic origin.

[1]  S. Daiger,et al.  Polymorphisms at VNTR loci suggest homogeneity of the white population of Utah. , 1991, Human biology.

[2]  L. A. Goodman On the Exact Variance of Products , 1960 .

[3]  K K Kidd,et al.  Drift, admixture, and selection in human evolution: a study with DNA polymorphisms. , 1991, Proceedings of the National Academy of Sciences of the United States of America.

[4]  J M Lalouel,et al.  Characterization of eight VNTR loci by agarose gel electrophoresis. , 1989, Genomics.

[5]  P. Harper Human Gene Mapping 11. The Eleventh International Workshop on Human Gene Mapping , 1993 .

[6]  K K Kidd,et al.  The utility of DNA typing in forensic work. , 1991, Science.

[7]  L Roberts,et al.  Fight erupts over DNA fingerprinting. , 1991, Science.

[8]  N. L. Johnson,et al.  Linear Statistical Inference and Its Applications , 1966 .

[9]  R. Chakraborty,et al.  Caucasian genes in American blacks: new data. , 1992, American journal of human genetics.

[10]  Calyampudi R. Rao,et al.  Linear Statistical Inference and Its Applications. , 1975 .

[11]  Y. Nakamura,et al.  Variable number of tandem repeat (VNTR) markers for human gene mapping. , 1987, Science.

[12]  B Devlin,et al.  On the probability of matching DNA fingerprints. , 1992, Science.

[13]  S. Daiger,et al.  Apparent heterozygote deficiencies observed in DNA typing data and their implications in forensic applications , 1992, Annals of human genetics.

[14]  B Budowle,et al.  Fixed-bin analysis for statistical evaluation of continuous distributions of allelic data from VNTR loci, for use in forensic comparisons. , 1991, American journal of human genetics.

[15]  M Clyne,et al.  Human population genetic studies of five hypervariable DNA loci. , 1989, American journal of human genetics.

[16]  N E Morton,et al.  Genetic structure of forensic populations. , 1992, American journal of human genetics.

[17]  R. Chakraborty,et al.  Sample size requirements for addressing the population genetic issues of forensic use of DNA typing. , 1992, Human biology.

[18]  Ching Chun Li First Course in Population Genetics , 1976 .

[19]  B S Weir,et al.  Independence of VNTR alleles defined as fixed bins. , 1992, Genetics.

[20]  Ian W. Evett,et al.  Statistical Inference in Crime Investigations Using Deoxyribonucleic Acid Profiling , 1992 .

[21]  F. Widmann The Distribution of the Human Blood Groups and Other Polymorphisms , 1977 .