Multipoint Estimation of Identity-by-Descent Probabilities at Arbitrary Positions among Marker Loci on General Pedigrees

Objectives: To describe, implement, and test an efficient algorithm to obtain multipoint identity-by-descent (IBD) probabilities at arbitrary positions among marker loci for general pedigrees. Unlike existing programs, our algorithm can analyze data sets with large numbers of people and markers. The algorithm has been implemented in the SimWalk2 computer package. Methods: Using a rigorous testing regimen containing five pedigrees of various sizes with realistic marker data, we compared several widely used IBD computation programs: Allegro, Aspex, GeneHunter, MapMaker/Sibs, Mendel, Sage, SimWalk2, and Solar. Results: The testing revealed a few discrepancies, particularly on consanguineous pedigrees, but overall excellent results in the deterministic multipoint packages. SimWalk2 was also found to be in good agreement with the deterministic multipoint programs, usually matching to two decimal places the kinship coefficient that ranges from 0 to 1. However, the packages based on single-point IBD estimation, while consistent with each other, often showed poor results, disagreeing with the multipoint kinship results by as much as 0.5. Conclusions: Our testing has clearly shown that multipoint IBD estimation is much better than single-point estimation. In addition, our testing has validated our algorithm for estimating IBD probabilities at arbitrary positions on general pedigrees.

[1]  K Lange,et al.  Descent graphs in pedigree analysis: applications to haplotyping, location scores, and marker-sharing statistics. , 1996, American journal of human genetics.

[2]  S. Heath Markov chain Monte Carlo segregation and linkage analysis for oligogenic models. , 1997, American journal of human genetics.

[3]  M. Boehnke,et al.  Sample-size guidelines for linkage analysis of a dominant locus for a quantitative trait by the method of lod scores. , 1990, American journal of human genetics.

[4]  E A Thompson,et al.  Monte carlo markov chain methods for genome screening , 1999, Genetic epidemiology.

[5]  L Kruglyak,et al.  Exact multipoint quantitative-trait linkage analysis in pedigrees by variance components. , 2000, American journal of human genetics.

[6]  L Kruglyak,et al.  Parametric and nonparametric linkage analysis: a unified multipoint approach. , 1996, American journal of human genetics.

[7]  M. R. Young,et al.  Comparison of sequential and fixed-structure sampling of pedigrees in complex segregation analysis of a quantitative trait. , 1988, American journal of human genetics.

[8]  E. Lander,et al.  Construction of multilocus genetic linkage maps in humans. , 1987, Proceedings of the National Academy of Sciences of the United States of America.

[9]  R. Elston,et al.  A general model for the genetic analysis of pedigree data. , 1971, Human heredity.

[10]  Daniel F. Gudbjartsson,et al.  Allegro, a new computer program for multipoint linkage analysis , 2000, Nature genetics.

[11]  Elizabeth A. Thompson,et al.  MCMC Estimation of Multi‐locus Genome Sharing and Multipoint Gene Location Scores , 2000 .

[12]  G. Abecasis,et al.  A general test of association for quantitative traits in nuclear families. , 2000, American journal of human genetics.

[13]  M. Boehnke,et al.  Selecting pedigrees for linkage analysis of a quantitative trait: the expected number of informative meioses. , 1990, American journal of human genetics.

[14]  C. W. Cotterman,et al.  A calculus for statistico-genetics , 1940 .

[15]  E. Thompson Monte Carlo Likelihood in Genetic Mapping , 1994 .

[16]  M. Boehnke,et al.  Identifying pedigrees segregating at a major locus for a quantitative trait: an efficient strategy for linkage analysis. , 1989, American journal of human genetics.

[17]  K. Lange,et al.  Programs for pedigree analysis: Mendel, Fisher, and dGene , 1988, Genetic epidemiology.

[18]  D E Weeks,et al.  Consanguinity and relative-pair methods for linkage analysis. , 1998, American journal of human genetics.

[19]  R C Elston,et al.  A faster and more general hidden Markov model algorithm for multipoint likelihood calculations. , 1997, Human heredity.

[20]  E. Génin,et al.  Consanguinity and the sib-pair method: an approach using identity by descent between and within individuals. , 1996, American journal of human genetics.

[21]  Eric S. Lander,et al.  Faster Multipoint Linkage Analysis Using Fourier Transforms , 1998, J. Comput. Biol..

[22]  M Speer,et al.  Chromosome‐based method for rapid computer simulation in human genetic linkage analysis , 1993, Genetic epidemiology.

[23]  E Sobel,et al.  Metropolis sampling in pedigree analysis , 1993, Statistical methods in medical research.

[24]  Deborah Charlesworth,et al.  The genetic structure of populations , 1976 .

[25]  A. Jacquard The Genetic Structure of Populations , 1974 .

[26]  C Cannings On the probabilities of identity states in permutable populations. , 1998, American journal of human genetics.

[27]  L. Almasy,et al.  Multipoint quantitative-trait linkage analysis in general pedigrees. , 1998, American journal of human genetics.

[28]  Freda Kemp,et al.  Mathematical and Statistical Methods for Genetic Analysis , 2003 .

[29]  E. Lander,et al.  Complete multipoint sib-pair analysis of qualitative and quantitative traits. , 1995, American journal of human genetics.