Identification of distant family relationships

MOTIVATION Family relationships can be estimated from DNA marker data. Applications arise in a large number of areas including evolution and conservation research, genealogical research in human, plant and animal populations, forensic problems and genetic mapping via linkage and association analyses. Traditionally, likelihood-based approaches to relationship estimation have used unlinked genetic markers. Due to the fact that some relationships cannot be distinguished from data at unlinked markers, and given the limited number of such markers available, there are considerable constraints on the type of identification problem that can be satisfactorily addressed with such approaches. The aim of this article is to explore the potential of linked autosomal single nucleotide polymorphism markers in this context. Throughout, we will view the problem of relationship estimation as one of pedigree identification rather than identity-by-descent, and thus focus on applications where determination of the exact relationship is important. RESULTS We show that the increase in information obtained by exploiting large sets of linked markers substantially increases the number of problems that can be solved. Results are presented based on simulations as well as on real data. AVAILABILITY The R library FEST is freely available from http://folk.uio.no/thoree/FEST.

[1]  E A Thompson,et al.  The IBD process along four chromosomes. , 2008, Theoretical population biology.

[2]  Ellen M Wijsman,et al.  Multipoint linkage analysis with many multiallelic or dense diallelic markers: Markov chain-Monte Carlo provides practical approaches for genome scans on general pedigrees. , 2006, American journal of human genetics.

[3]  N J Cox,et al.  The importance of genealogy in determining genetic associations with complex traits. , 2001, American journal of human genetics.

[4]  Lon R. Cardon,et al.  GRR: graphical representation of relationship errors , 2001, Bioinform..

[5]  G. Abecasis,et al.  Handling marker-marker linkage disequilibrium: pedigree analysis with clustered markers. , 2005, American journal of human genetics.

[6]  Thore Egeland,et al.  On identification problems requiring linked autosomal markers. , 2008, Forensic science international. Genetics.

[7]  Hans-Jürgen Bandelt,et al.  Extended guidelines for mtDNA typing of population data in forensic science. , 2007, Forensic science international. Genetics.

[8]  Thore Egeland,et al.  Essen-Möller and Identification Based on DNA , 2006 .

[9]  N A Sheehan,et al.  Structured Incorporation of Prior Information in Relationship Identification Problems , 2007, Annals of human genetics.

[10]  Gonçalo R. Abecasis,et al.  PEDSTATS: descriptive statistics, graphics and quality assessment for gene mapping data , 2005, Bioinform..

[11]  E A Thompson,et al.  The estimation of pairwise relationships , 1975, Annals of human genetics.

[12]  N A Sheehan,et al.  Adjusting for Founder Relatedness in a Linkage Analysis Using Prior Information , 2007, Human Heredity.

[13]  David J Balding,et al.  Discrimination of half-siblings when maternal genotypes are known. , 2006, Forensic science international.

[14]  Michael Krawczak,et al.  Kinship testing with X-chromosomal markers: mathematical and statistical issues. , 2007, Forensic science international. Genetics.

[15]  G. Abecasis,et al.  Merlin—rapid analysis of dense genetic maps using sparse gene flow trees , 2002, Nature Genetics.

[16]  Jeanette C Papp,et al.  Detection and integration of genotyping errors in statistical genetics. , 2002, American journal of human genetics.

[17]  Thore Egeland,et al.  Genome-wide Linkage Analysis with Clustered SNP Markers , 2009, Journal of biomolecular screening.

[18]  E. A. Thompson,et al.  Genetic linkage in the estimation of pairwise relationship , 1998, Theoretical and Applied Genetics.

[19]  Elizabeth A Thompson,et al.  Impact of parental relationships in maximum lod score affected sib‐pair method , 2002, Genetic epidemiology.

[20]  Amanda B. Hepler,et al.  Genetic relatedness analysis: modern data and new challenges , 2006, Nature Reviews Genetics.

[21]  Sascha Willuweit,et al.  Y chromosome haplotype reference database (YHRD): update. , 2007, Forensic science international. Genetics.

[22]  K. P. Donnelly,et al.  The probability that related individuals share some section of genome identical by descent. , 1983, Theoretical population biology.

[23]  E. Lander,et al.  Construction of multilocus genetic linkage maps in humans. , 1987, Proceedings of the National Academy of Sciences of the United States of America.

[24]  T Egeland,et al.  Beyond traditional paternity and identification cases. Selecting the most probable pedigree. , 2000, Forensic science international.

[25]  Ellen M Wijsman,et al.  Relationship inference from trios of individuals, in the presence of typing error. , 2002, American journal of human genetics.

[26]  Elizabeth A. Thompson,et al.  Inference of genealogical structure , 1976 .

[27]  M P Epstein,et al.  Improved inference of relationship for pairs of individuals. , 2000, American journal of human genetics.

[28]  Sanjay Shete,et al.  Ignoring linkage disequilibrium among tightly linked markers induces false-positive evidence of linkage for affected sib pair analysis. , 2004, American journal of human genetics.

[29]  P. Donnelly,et al.  Jefferson fathered slave's last child , 1998, Nature.

[30]  Michael S. Blouin,et al.  DNA-based methods for pedigree reconstruction and kinship analysis in natural populations , 2003 .

[31]  D. Balding Weight-of-Evidence for Forensic DNA Profiles , 2005 .