A method for detecting IBD regions simultaneously in multiple individuals--with applications to disease genetics.

All individuals in a finite population are related if traced back long enough and will, therefore, share regions of their genomes identical by descent (IBD). Detection of such regions has several important applications-from answering questions about human evolution to locating regions in the human genome containing disease-causing variants. However, IBD regions can be difficult to detect, especially in the common case where no pedigree information is available. In particular, all existing non-pedigree based methods can only infer IBD sharing between two individuals. Here, we present a new Markov Chain Monte Carlo method for detection of IBD regions, which does not rely on any pedigree information. It is based on a probabilistic model applicable to unphased SNP data. It can take inbreeding, allele frequencies, genotyping errors, and genomic distances into account. And most importantly, it can simultaneously infer IBD sharing among multiple individuals. Through simulations, we show that the simultaneous modeling of multiple individuals makes the method more powerful and accurate than several other non-pedigree based methods. We illustrate the potential of the method by applying it to data from individuals with breast and/or ovarian cancer, and show that a known disease-causing mutation can be mapped to a 2.2-Mb region using SNP data from only five seemingly unrelated affected individuals. This would not be possible using classical linkage mapping or association mapping.

[1]  T. Meerman,et al.  Genomic sharing surrounding alleles identical by descent : Effects of genetic drift and population growth , 1997 .

[2]  L Kruglyak,et al.  Parametric and nonparametric linkage analysis: a unified multipoint approach. , 1996, American journal of human genetics.

[3]  A. Albrechtsen,et al.  Identification of a novel BRCA1 nucleotide 4803delCC/c.4684delCC mutation and a nucleotide 249T>A/c.130T>A (p.Cys44Ser) mutation in two Greenlandic Inuit families: implications for genetic screening of Greenlandic Inuit families with high risk for breast and/or ovarian cancer , 2010, Breast Cancer Research and Treatment.

[4]  D. Gudbjartsson,et al.  A high-resolution recombination map of the human genome , 2002, Nature Genetics.

[5]  G. T. te Meerman,et al.  Genomic sharing surrounding alleles identical by descent: Effects of genetic drift and population growth , 1997, Genetic epidemiology.

[6]  Alun Thomas,et al.  Assessment of SNP streak statistics using gene drop simulation with linkage disequilibrium , 2010, Genetic epidemiology.

[7]  Brian L. Browning,et al.  High-resolution detection of identity by descent in unrelated individuals. , 2010, American journal of human genetics.

[8]  E A Thompson,et al.  The IBD process along four chromosomes. , 2008, Theoretical population biology.

[9]  G. T. te Meerman,et al.  Haplotype sharing analysis in affected individuals from nuclear families with at least one affected offspring , 1997, Genetic epidemiology.

[10]  Alexander Gusev,et al.  Whole population, genome-wide mapping of hidden relatedness. , 2009, Genome research.

[11]  L. Sandkuijl,et al.  Perspectives of identity by descent (IBD) mapping in founder populations , 1995, Clinical and experimental allergy : journal of the British Society for Allergy and Clinical Immunology.

[12]  Sharon R Browning,et al.  Estimation of Pairwise Identity by Descent From Dense Genetic Marker Data in a Population Sample of Haplotypes , 2008, Genetics.

[13]  Zhaohui S. Qin,et al.  A second generation human haplotype map of over 3.1 million SNPs , 2007, Nature.

[14]  Nelson B. Freimer,et al.  Genome screening by searching for shared segments: mapping a gene for benign recurrent intrahepatic cholestasis , 1994, Nature Genetics.

[15]  E. Lander,et al.  Construction of multilocus genetic linkage maps in humans. , 1987, Proceedings of the National Academy of Sciences of the United States of America.

[16]  Kermit Ritland,et al.  Estimators for pairwise relatedness and individual inbreeding coefficients , 1996 .

[17]  D. Queller,et al.  ESTIMATING RELATEDNESS USING GENETIC MARKERS , 1989, Evolution; international journal of organic evolution.

[18]  I. Evett,et al.  Interpreting DNA Evidence: Statistical Genetics for Forensic Scientists , 1998 .

[19]  G. Abecasis,et al.  Merlin—rapid analysis of dense genetic maps using sparse gene flow trees , 2002, Nature Genetics.

[20]  Bernard Prum,et al.  Estimation of the inbreeding coefficient through use of genomic data. , 2003, American journal of human genetics.

[21]  Anders Albrechtsen,et al.  Natural Selection and the Distribution of Identity-by-Descent in the Human Genome , 2010, Genetics.

[22]  Lawrence R. Rabiner,et al.  A tutorial on hidden Markov models and selected applications in speech recognition , 1989, Proc. IEEE.

[23]  Anders Albrechtsen,et al.  Relatedness mapping and tracts of relatedness for genome‐wide data in the presence of linkage disequilibrium , 2009, Genetic epidemiology.

[24]  A. Albrechtsen,et al.  A common Greenlandic Inuit BRCA1 RING domain founder mutation , 2009, Breast Cancer Research and Treatment.

[25]  D. Rubin,et al.  Inference from Iterative Simulation Using Multiple Sequences , 1992 .

[26]  E A Thompson,et al.  The estimation of pairwise relationships , 1975, Annals of human genetics.

[27]  Manuel A. R. Ferreira,et al.  PLINK: a tool set for whole-genome association and population-based linkage analyses. , 2007, American journal of human genetics.

[28]  BRLMM : an Improved Genotype Calling Method for the GeneChip ® Human Mapping 500 K Array Set , 2006 .

[29]  R. Elston,et al.  A general model for the genetic analysis of pedigree data. , 1971, Human heredity.

[30]  G. Malécot,et al.  Les mathématiques de l'hérédité , 1948 .

[31]  Andrew Gelman,et al.  General methods for monitoring convergence of iterative simulations , 1998 .

[32]  Gregory Leibon,et al.  A SNP Streak Model for the Identification of Genetic Regions Identical-by-descent , 2008, Statistical applications in genetics and molecular biology.

[33]  M. Daly,et al.  Genome-wide association studies for common diseases and complex traits , 2005, Nature Reviews Genetics.

[34]  J. Ott Estimation of the recombination fraction in human pedigrees: efficient computation of the likelihood for human linkage studies. , 1974, American journal of human genetics.

[35]  Probability functions on complex pedigrees , 1978 .

[36]  M. Lynch,et al.  Estimation of pairwise relatedness with molecular markers. , 1999, Genetics.

[37]  J. Hilden G E N EX ‐ An algebraic approach to pedigree probability calculus , 1970 .

[38]  K Allen-Brady,et al.  Shared Genomic Segment Analysis. Mapping Disease Predisposition Genes in Extended Pedigrees Using SNP Genotype Assays , 2008, Annals of human genetics.