A renewal theory approach to IBD sharing.

A long genomic segment inherited by a pair of individuals from a single, recent common ancestor is said to be identical-by-descent (IBD). Shared IBD segments have numerous applications in genetics, from demographic inference to phasing, imputation, pedigree reconstruction, and disease mapping. Here, we provide a theoretical analysis of IBD sharing under Markovian approximations of the coalescent with recombination. We describe a general framework for the IBD process along the chromosome under the Markovian models (SMC/SMC'), as well as introduce and justify a new model, which we term the renewal approximation, under which lengths of successive segments are independent. Then, considering the infinite-chromosome limit of the IBD process, we recover previous results (for SMC) and derive new results (for SMC') for the mean number of shared segments longer than a cutoff and the fraction of the chromosome found in such segments. We then use renewal theory to derive an expression (in Laplace space) for the distribution of the number of shared segments and demonstrate implications for demographic inference. We also compute (again, in Laplace space) the distribution of the fraction of the chromosome in shared segments, from which we obtain explicit expressions for the first two moments. Finally, we generalize all results to populations with a variable effective size.

[1]  Mark Abney,et al.  Using identity by descent estimation with dense genotype data to detect positive selection , 2012, European Journal of Human Genetics.

[2]  M. D. Brown,et al.  Inferring Coancestry in Population Samples in the Presence of Linkage Disequilibrium , 2012, Genetics.

[3]  J. Kingman A FIRST COURSE IN STOCHASTIC PROCESSES , 1967 .

[4]  J. M. Luck,et al.  Statistics of the Occupation Time of Renewal Processes , 2000, cond-mat/0010428.

[5]  R. Durbin,et al.  Inference of human population history from individual whole-genome sequences. , 2011, Nature.

[6]  I. Pe’er,et al.  Length distributions of identity by descent reveal fine-scale demographic history. , 2012, American journal of human genetics.

[7]  Samuel Karlin,et al.  A First Course on Stochastic Processes , 1968 .

[8]  S. Sampling theory for neutral alleles in a varying environment , 2003 .

[9]  Jinchuan Xing,et al.  Mobile elements reveal small population size in the ancient ancestors of Homo sapiens , 2010, Proceedings of the National Academy of Sciences.

[10]  Alexander Gusev,et al.  DASH: a method for identical-by-descent haplotype mapping uncovers association with recent variation. , 2011, American journal of human genetics.

[11]  Asger Hobolth,et al.  Markovian approximation to the finite loci coalescent with recombination along multiple sequences. , 2014, Theoretical population biology.

[12]  Vladimir Vacic,et al.  The Variance of Identity-by-Descent Sharing in the Wright–Fisher Model , 2012, Genetics.

[13]  Anders Albrechtsen,et al.  A method for detecting IBD regions simultaneously in multiple individuals--with applications to disease genetics. , 2011, Genome research.

[14]  B. Berger,et al.  Reconstructing Roma History from Genome-Wide Data , 2012, PloS one.

[15]  George H. Weiss,et al.  A First Course in Stochastic Processes, 2nd sd. (Samuel Karlin and Howard M. Taylor) , 1977 .

[16]  Christopher R. Gignoux,et al.  Gene flow from North Africa contributes to differential human genetic diversity in southern Europe , 2013, Proceedings of the National Academy of Sciences.

[17]  R. J. Harrison,et al.  A General Method for Calculating Likelihoods Under the Coalescent Process , 2011, Genetics.

[18]  Itsik Pe'er,et al.  Abraham's children in the genome era: major Jewish diaspora populations comprise distinct genetic clusters with shared Middle Eastern Ancestry. , 2010, American journal of human genetics.

[19]  B. Browning,et al.  A fast, powerful method for detecting identity by descent. , 2011, American journal of human genetics.

[20]  Lubomir Brancik,et al.  Numerical Inverse Laplace Transforms for Electrical Engineering Simulation , 2011 .

[21]  E. Thompson Identity by Descent: Variation in Meiosis, Across Genomes, and in Populations , 2013, Genetics.

[22]  Jinchuan Xing,et al.  Maximum-likelihood estimation of recent shared ancestry (ERSA). , 2011, Genome research.

[23]  Dan He,et al.  IBD-Groupon: an efficient method for detecting group-wise identity-by-descent regions simultaneously in multiple individuals based on pairwise IBD relationships , 2013, Bioinform..

[24]  John Wakeley,et al.  Gene Genealogies Within a Fixed Pedigree, and the Robustness of Kingman’s Coalescent , 2012, Genetics.

[25]  Claudia Moreau,et al.  Genome-wide patterns of identity-by-descent sharing in the French Canadian founder population , 2013, European Journal of Human Genetics.

[26]  A. N. Stokes,et al.  An Improved Method for Numerical Inversion of Laplace Transforms , 1982 .

[27]  Rui Lin,et al.  Identity-by-Descent Mapping to Detect Rare Variants Conferring Susceptibility to Multiple Sclerosis , 2013, PloS one.

[28]  Itsik Pe'er,et al.  Cryptic Distant Relatives Are Common in Both Isolated and Cosmopolitan Genetic Samples , 2012, PloS one.

[29]  S. Warren,et al.  Signatures of founder effects, admixture, and selection in the Ashkenazi Jewish population , 2010, Proceedings of the National Academy of Sciences.

[30]  R. Durbin,et al.  Identity-by-Descent-Based Phasing and Imputation in Founder Populations Using Graphical Models , 2011, Genetic epidemiology.

[31]  P. Stam,et al.  The distribution of the fraction of the genome identical by descent in finite random mating populations , 1980 .

[32]  E A Thompson,et al.  A model for the length of tracts of identity by descent in finite random mating populations. , 2003, Theoretical population biology.

[33]  Pall I. Olason,et al.  Detection of sharing by descent, long-range phasing and haplotype imputation , 2008, Nature Genetics.

[34]  R. Hudson Properties of a neutral allele model with intragenic recombination. , 1983, Theoretical population biology.

[35]  E. Thompson,et al.  Bayesian Inference of Local Trees Along Chromosomes by the Sequential Markov Coalescent , 2014, Journal of Molecular Evolution.

[36]  R. Griffiths,et al.  An ancestral recombination graph , 1997 .

[37]  R. Nielsen,et al.  Inferring Demographic History from a Spectrum of Shared Haplotype Lengths , 2013, PLoS genetics.

[38]  Larry Wasserman,et al.  All of Statistics , 2004 .

[39]  Manuel A. R. Ferreira,et al.  PLINK: a tool set for whole-genome association and population-based linkage analyses. , 2007, American journal of human genetics.

[40]  Sharon R. Browning,et al.  Detecting Rare Variant Associations by Identity-by-Descent Mapping in Case-Control Studies , 2012, Genetics.

[41]  Alexander Gusev,et al.  Whole population, genome-wide mapping of hidden relatedness. , 2009, Genome research.

[42]  R. Fisher,et al.  A fuller theory of “Junctions” in inbreeding , 1954, Heredity.

[43]  B. Browning,et al.  Improving the Accuracy and Efficiency of Identity-by-Descent Detection in Population Data , 2013, Genetics.

[44]  Itsik Pe'er,et al.  Inference of historical migration rates via haplotype sharing , 2013, Bioinform..

[45]  P. Marjoram,et al.  Ancestral Inference from Samples of DNA Sequences with Recombination , 1996, J. Comput. Biol..

[46]  Brian L Browning,et al.  Identity by descent between distant relatives: detection and applications. , 2012, Annual review of genetics.

[47]  Peter L. Ralph,et al.  The Geography of Recent Genetic Ancestry across Europe , 2012, PLoS biology.

[48]  Paul Marjoram,et al.  Fast "coalescent" simulation , 2006, BMC Genetics.

[49]  G. McVean,et al.  Approximating the coalescent with recombination , 2005, Philosophical Transactions of the Royal Society B: Biological Sciences.

[50]  J. Hein,et al.  Recombination as a point process along sequences. , 1999, Theoretical population biology.