Recursive Algorithms for Modeling Genomic Ancestral Origins in a Fixed Pedigree

The study of gene flow in pedigrees is of strong interest for the development of quantitative trait loci (QTL) mapping methods in multiparental populations. We developed a Markovian framework for modeling ancestral origins along two homologous chromosomes within individuals in fixed pedigrees. A highly beneficial property of our method is that the size of state space depends linearly or quadratically on the number of pedigree founders, whereas this increases exponentially with pedigree size in alternative methods. To calculate the parameter values of the Markov process, we describe two novel recursive algorithms that differ with respect to the pedigree founders being assumed to be exchangeable or not. Our algorithms apply equally to autosomes and sex chromosomes, another desirable feature of our approach. We tested the accuracy of the algorithms by a million simulations on a pedigree. We demonstrated two applications of the recursive algorithms in multiparental populations: design a breeding scheme for maximizing the overall density of recombination breakpoints and thus the QTL mapping resolution, and incorporate pedigree information into hidden Markov models in ancestral inference from genotypic data; the conditional probabilities and the recombination breakpoint data resulting from ancestral inference can facilitate follow-up QTL mapping. The results show that the generality of the recursive algorithms can greatly increase the application range of genetic analysis such as ancestral inference in multiparental populations.

[1]  F. V. van Eeuwijk,et al.  Accurate Genotype Imputation in Multiparental Populations from Low-Coverage Sequence , 2018, Genetics.

[2]  M. Koornneef,et al.  Fine mapping of a major QTL for awn length in barley using a multiparent mapping population , 2016, Theoretical and Applied Genetics.

[3]  Peter J. Bradbury,et al.  Construction of high-quality recombination maps with low-coverage genomic sequencing for joint linkage analysis in maize , 2015, BMC Biology.

[4]  M. Causse,et al.  Potential of a tomato MAGIC population to decipher the genetic control of quantitative traits and detect causal variants in the resequencing era. , 2015, Plant biotechnology journal.

[5]  L. A. García-Cortés A novel recursive algorithm for the calculation of the detailed identity coefficients , 2015, Genetics Selection Evolution.

[6]  Chaozhi Zheng,et al.  Modeling X-Linked Ancestral Origins in Multiparental Populations , 2015, G3: Genes, Genomes, Genetics.

[7]  B. Mathew,et al.  Multi-parent advanced generation inter-cross in barley: high-resolution quantitative trait locus mapping for flowering time as a proof of concept , 2015, Molecular Breeding.

[8]  James Cockram,et al.  An Eight-Parent Multiparent Advanced Generation Inter-Cross Population for Winter-Sown Wheat: Creation, Properties, and Validation , 2014, G3: Genes, Genomes, Genetics.

[9]  F. V. van Eeuwijk,et al.  A General Modeling Framework for Genome Ancestral Origins in Multiparental Populations , 2014, Genetics.

[10]  Shizhong Xu,et al.  Genetic Mapping and Genomic Selection Using Recombination Breakpoint Data , 2013, Genetics.

[11]  Andrew W George,et al.  A multiparent advanced generation inter-cross population for genetic analysis in wheat. , 2012, Plant biotechnology journal.

[12]  Karl W. Broman,et al.  Genotype Probabilities at Intermediate Generations in the Construction of Recombinant Inbred Lines , 2012, Genetics.

[13]  Lisa E. Gralinski,et al.  The Genome Architecture of the Collaborative Cross Mouse Genetic Reference Population , 2012, Genetics.

[14]  Leonard McMillan,et al.  Accelerating the Inbreeding of Multi-Parental Recombinant Inbred Lines Generated By Sibling Matings , 2012, G3: Genes | Genomes | Genetics.

[15]  Jin J Zhou,et al.  Quantitative Trait Loci Association Mapping by Imputation of Strain Origins in Multifounder Crosses , 2012, Genetics.

[16]  O. Martin,et al.  Distribution of Parental Genome Blocks in Recombinant Inbred Lines , 2011, Genetics.

[17]  M. Colomé-Tatché,et al.  Quantitative Epigenetics Through Epigenomic Perturbation of Isogenic Lines , 2011, Genetics.

[18]  R. Mott,et al.  Collaborative Cross mice and their power to map host susceptibility to Aspergillus fumigatus infection. , 2011, Genome research.

[19]  Qi Zhang,et al.  Efficient genome ancestry inference in complex pedigrees with inbreeding , 2010, Bioinform..

[20]  R. Mott,et al.  A Multiparent Advanced Generation Inter-Cross to Fine-Map Quantitative Traits in Arabidopsis thaliana , 2009, PLoS genetics.

[21]  K. Lange,et al.  Mixed Effects Models for Quantitative Trait Loci Mapping With Inbred Strains , 2008, Genetics.

[22]  L. Kruglyak,et al.  Breeding Designs for Recombinant Inbred Advanced Intercross Lines , 2008, Genetics.

[23]  W. G. Hill,et al.  Prediction of multi-locus inbreeding coefficients and relation to linkage disequilibrium in random mating populations. , 2007, Theoretical population biology.

[24]  F. Teuscher,et al.  Haplotype Probabilities for Multiple-Strain Recombinant Inbred Lines , 2007, Genetics.

[25]  J. Woolliams,et al.  Marker densities and the mapping of ancestral junctions. , 2005, Genetical research.

[26]  Nengjun Yi,et al.  The Collaborative Cross, a community resource for the genetic analysis of complex traits , 2004, Nature Genetics.

[27]  N. Barton,et al.  The distribution of surviving blocks of an ancestral genome. , 2003, Theoretical population biology.

[28]  E A Thompson,et al.  A model for the length of tracts of identity by descent in finite random mating populations. , 2003, Theoretical population biology.

[29]  V. Stefanov Distribution of genome shared identical by descent by two individuals in grandparent-type relationship. , 2000, Genetics.

[30]  E A Thompson,et al.  Distribution of genome shared IBD by half-sibs: approximation by the Poisson clumping heuristic. , 1996, Theoretical population biology.

[31]  E. Thompson A recursive algorithm for inferring gene origins , 1983, Annals of human genetics.

[32]  K. P. Donnelly,et al.  The probability that related individuals share some section of genome identical by descent. , 1983, Theoretical population biology.

[33]  G Karigl,et al.  A recursive algorithm for the calculation of identity coefficients , 1981, Annals of human genetics.

[34]  P. Stam,et al.  The distribution of the fraction of the genome identical by descent in finite random mating populations , 1980 .

[35]  J. Rostron On the computation of inbreeding coefficients , 1978, Annals of human genetics.

[36]  C. Denniston An extension of the probability approach to genetic relationships: one locus. , 1974, Theoretical population biology.

[37]  A. Jacquard The Genetic Structure of Populations , 1974 .

[38]  B. Weir,et al.  Descent measures for two loci with some applications. , 1973, Theoretical population biology.

[39]  R. Nadot,et al.  Apparentement et identite. Algorithme du Calcul des Coefficients D'Identite , 1973 .

[40]  B. Weir,et al.  Pedigree mating with two linked loci. , 1969, Genetics.

[41]  R. Fisher,et al.  A fuller theory of “Junctions” in inbreeding , 1954, Heredity.

[42]  J B Haldane,et al.  Inbreeding and Linkage. , 1931, Genetics.

[43]  J. Bennett Junctions in inbreeding , 2005, Genetica.

[44]  J. R. Patel,et al.  Genetic analysis in wheat. , 1990 .

[45]  E A Thompson,et al.  Two-locus and three-locus gene identity by descent in pedigrees. , 1988, IMA journal of mathematics applied in medicine and biology.

[46]  J. Bennett The Distribution of Heterogeneity Upon Inbreeding , 1954 .

[47]  R. Fisher The theory of inbreeding , 1949 .