Sibship reconstruction from genetic data with typing errors.

Likelihood methods have been developed to partition individuals in a sample into full-sib and half-sib families using genetic marker data without parental information. They invariably make the critical assumption that marker data are free of genotyping errors and mutations and are thus completely reliable in inferring sibships. Unfortunately, however, this assumption is rarely tenable for virtually all kinds of genetic markers in practical use and, if violated, can severely bias sibship estimates as shown by simulations in this article. I propose a new likelihood method with simple and robust models of typing error incorporated into it. Simulations show that the new method can be used to infer full- and half-sibships accurately from marker data with a high error rate and to identify typing errors at each locus in each reconstructed sib family. The new method also improves previous ones by adopting a fresh iterative procedure for updating allele frequencies with reconstructed sibships taken into account, by allowing for the use of parental information, and by using efficient algorithms for calculating the likelihood function and searching for the maximum-likelihood configuration. It is tested extensively on simulated data with a varying number of marker loci, different rates of typing errors, and various sample sizes and family structures and applied to two empirical data sets to demonstrate its usefulness.

[1]  Davis,et al.  Detection of multiple paternity in the Kemp's ridley sea turtle with limited sampling , 1999 .

[2]  Jonathan M. Wright,et al.  DNA fingerprint based analysis of paternal and maternal effects on offspring growth and survival in communally reared rainbow trout , 1995 .

[3]  A. Kremer,et al.  Comparison of microsatellites and amplified fragment length polymorphism markers for parentage analysis , 2000, Molecular ecology.

[4]  Michael S. Blouin,et al.  DNA-based methods for pedigree reconstruction and kinship analysis in natural populations , 2003 .

[5]  Ellen M Wijsman,et al.  Relationship inference from trios of individuals, in the presence of typing error. , 2002, American journal of human genetics.

[6]  C. Chevalet,et al.  Error tolerant parent identification from a finite set of individuals , 1997 .

[7]  M P Epstein,et al.  Improved inference of relationship for pairs of individuals. , 2000, American journal of human genetics.

[8]  C Boesch,et al.  Microsatellite scoring errors associated with noninvasive genotyping based on nuclear DNA amplified from shed hair , 1997, Molecular ecology.

[9]  A. Bourke,et al.  Genetic analysis of spatial foraging patterns and resource sharing in bumble bee pollinators , 2003, Molecular ecology.

[10]  T. C. Marshall,et al.  Statistical confidence for likelihood‐based paternity inference in natural populations , 1998, Molecular ecology.

[11]  J. Cheverud Genetics and analysis of quantitative traits , 1999 .

[12]  R L Hammond,et al.  Mating frequency and mating system of the polygynous ant, Leptothorax acervorum , 2001, Molecular ecology.

[13]  A. Clark,et al.  INFERENCE OF SPERM COMPETITION FROM BROODS OF FIELD‐CAUGHT DROSOPHILA , 1998, Evolution; international journal of organic evolution.

[14]  A. Jones,et al.  Methods of parentage analysis in natural populations , 2003, Molecular ecology.

[15]  E A Thompson,et al.  The estimation of pairwise relationships , 1975, Annals of human genetics.

[16]  V. Loeschcke,et al.  Conservation Genetics , 2019, Handbook of Statistical Genomics.