Genetic evaluation with major genes and polygenic inheritance when some animals are not genotyped using gene content multiple-trait BLUP

BackgroundIn pedigreed populations with a major gene segregating for a quantitative trait, it is not clear how to use pedigree, genotype and phenotype information when some individuals are not genotyped. We propose to consider gene content at the major gene as a second trait correlated to the quantitative trait, in a gene content multiple-trait best linear unbiased prediction (GCMTBLUP) method.ResultsThe genetic covariance between the trait and gene content at the major gene is a function of the substitution effect of the gene. This genetic covariance can be written in a multiple-trait form that accommodates any pattern of missing values for either genotype or phenotype data. Effects of major gene alleles and the genetic covariance between genotype at the major gene and the phenotype can be estimated using standard EM-REML or Gibbs sampling. Prediction of breeding values with genotypes at the major gene can use multiple-trait BLUP software. Major genes with more than two alleles can be considered by including negative covariances between gene contents at each different allele. We simulated two scenarios: a selected and an unselected trait with heritabilities of 0.05 and 0.5, respectively. In both cases, the major gene explained half the genetic variation. Competing methods used imputed gene contents derived by the method of Gengler et al. or by iterative peeling. Imputed gene contents, in contrast to GCMTBLUP, do not consider information on the quantitative trait for genotype prediction. GCMTBLUP gave unbiased estimates of the gene effect, in contrast to the other methods, with less bias and better or equal accuracy of prediction. GCMTBLUP improved estimation of genotypes in non-genotyped individuals, in particular if these individuals had own phenotype records and the trait had a high heritability. Ignoring the major gene in genetic evaluation led to serious biases and decreased prediction accuracy.ConclusionsCGMTBLUP is the best linear predictor of additive genetic merit including pedigree, phenotype, and genotype information at major genes, since it considers missing genotypes. Simulations confirm that it is a simple, efficient and theoretically sound method for genetic evaluation of traits influenced by polygenic inheritance and one or several major genes.

[1]  R. L. Quaas,et al.  Multiple Trait Evaluation Using Relatives' Records , 1976 .

[2]  R. C. Elston,et al.  An efficient algorithm to compute the posterior genotypic distribution for every member of a pedigree without loops , 1993, Theoretical and Applied Genetics.

[3]  J. Weller,et al.  Incorporation of genotype effects into animal model evaluations when only a small fraction of the population has been genotyped. , 2009, Animal : an international journal of animal bioscience.

[4]  N. Sheehan,et al.  On a misconception about irreducibility of the single-site Gibbs sampler in a pedigree application. , 2002, Genetics.

[5]  R. Elston,et al.  A general model for the genetic analysis of pedigree data. , 1971, Human heredity.

[6]  Mehdi Sargolzaei,et al.  QMSim: a large-scale genome simulator for livestock , 2009, Bioinform..

[7]  L. Almasy,et al.  Multipoint quantitative-trait linkage analysis in general pedigrees. , 1998, American journal of human genetics.

[8]  Robert L. Wolpert,et al.  Statistical Inference , 2019, Encyclopedia of Social Network Analysis and Mining.

[9]  Estimation of effects of quantitative trait loci in large complex pedigrees. , 1997, Genetics.

[10]  J. A. Arendonk,et al.  Application of Gibbs sampling for inference in a mixed major gene-polygenic inheritance model in animal populations , 1995, Theoretical and Applied Genetics.

[11]  S. Heath Markov chain Monte Carlo segregation and linkage analysis for oligogenic models. , 1997, American journal of human genetics.

[12]  Ignacy Misztal,et al.  BLUPF90 and related programs (BGF90) , 2002 .

[13]  R. Fernando,et al.  Deregressing estimated breeding values and weighting information for genomic regression analyses , 2009, Genetics Selection Evolution.

[14]  M. Calus,et al.  Prediction of haplotypes for ungenotyped animals and its effect on marker-assisted breeding value estimation , 2010, Genetics Selection Evolution.

[15]  K. Meyer,et al.  Estimating variances and covariances for multivariate animal models by restricted maximum likelihood , 1991, Genetics Selection Evolution.

[16]  P. VanRaden,et al.  Invited review: reliability of genomic predictions for North American Holstein bulls. , 2009, Journal of dairy science.

[17]  P M VanRaden,et al.  Derivation, calculation, and use of national animal model information. , 1991, Journal of dairy science.

[18]  I. D. Boer,et al.  Genetic evaluation methods for populations with dominance and inbreeding , 1993, Theoretical and Applied Genetics.

[19]  P. Martin,et al.  Effects of the FecL major gene in the Lacaune meat sheep population , 2014, Genetics Selection Evolution.

[20]  Michel Georges,et al.  Genetic and functional confirmation of the causality of the DGAT1 K232A quantitative trait nucleotide in affecting milk yield and composition. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[21]  Elizabeth A. Thompson,et al.  Statistical inference from genetic data on pedigrees , 2003 .

[22]  Zulma G Vitezica,et al.  Using genotype probabilities in survival analysis: a scrapie case , 2005, Genetics Selection Evolution.

[23]  L. D. Vleck,et al.  Restricted Maximum Likelihood estimates of variance components from multitrait sire models with large number of fixed effects , 1989 .

[24]  Laura J. Scott,et al.  Joint Analysis of Psychiatric Disorders Increases Accuracy of Risk Prediction for Schizophrenia, Bipolar Disorder, and Major Depressive Disorder , 2015, American journal of human genetics.

[25]  N Gengler,et al.  A simple method to approximate gene content in large pedigree populations: application to the myostatin gene in dual-purpose Belgian Blue cattle. , 2007, Animal : an international journal of animal bioscience.

[26]  I. Hoeschele,et al.  Genetic evaluation with data presenting evidence of mixed major gene and polygenic inheritance , 1988, Theoretical and Applied Genetics.

[27]  The effect of missing marker genotypes on the accuracy of gene-assisted breeding value estimation: a comparison of methods. , 2010, Animal : an international journal of animal bioscience.

[28]  H. Grüneberg,et al.  Introduction to quantitative genetics , 1960 .

[29]  R. Fernando,et al.  Genetic evaluation with autosomal and X-chromosomal inheritance , 1990, Theoretical and Applied Genetics.

[30]  Peter M Visscher,et al.  A note on the asymptotic distribution of likelihood ratio tests to test variance components. , 2006, Twin research and human genetics : the official journal of the International Society for Twin Studies.

[31]  M. Lund,et al.  Genomic prediction when some animals are not genotyped , 2010, Genetics Selection Evolution.

[32]  D. Rubin INFERENCE AND MISSING DATA , 1975 .

[33]  Rohan L Fernando,et al.  A class of Bayesian methods to combine large numbers of genotyped and non-genotyped animals for whole-genome analyses , 2014, Genetics Selection Evolution.

[34]  N Gengler,et al.  Accuracy of prediction of gene content in large animal populations and its use for candidate gene detection and genetic evaluation. , 2008, Journal of dairy science.

[35]  M Quinton,et al.  Estimation of effects of single genes on quantitative traits. , 1992, Journal of animal science.

[36]  Quality Control of Genotypes Using Heritability Estimates of Gene Content at the Marker , 2014, Genetics.

[37]  K. Liang,et al.  Asymptotic Properties of Maximum Likelihood Estimators and Likelihood Ratio Tests under Nonstandard Conditions , 1987 .

[38]  S. Fernandez,et al.  A study on the minimum number of loci required for genetic evaluation using a finite locus model , 2004, Genetics Selection Evolution.

[39]  B. Kinghorn,et al.  An efficient algorithm for segregation analysis in large populations , 1996 .

[40]  C. Cockerham,et al.  VARIANCE OF GENE FREQUENCIES , 1969, Evolution; international journal of organic evolution.

[41]  P. Monget,et al.  A novel mutation in the bone morphogenetic protein 15 gene causing defective protein secretion is associated with both increased ovulation rate and sterility in Lacaune sheep. , 2007, Endocrinology.

[42]  H. Kang,et al.  Variance component model to account for sample structure in genome-wide association studies , 2010, Nature Genetics.

[43]  A. Robertson,et al.  The Association between Blood Groups and Several Production Characteristics in Three Danish Cattle Breeds , 1961 .

[44]  Karl J. Friston,et al.  Variance Components , 2003 .

[45]  Per Madsen,et al.  Residual maximum likelihood estimation of (co)variance components in multivariate mixed linear models using average information , 1997 .

[46]  W. Muir,et al.  Genome-wide association mapping including phenotypes from relatives without genotypes. , 2012, Genetics research.

[47]  I Misztal,et al.  A relationship matrix including full pedigree and genomic information. , 2009, Journal of dairy science.

[48]  I Misztal,et al.  Bias in genomic predictions for populations under selection. , 2011, Genetics research.

[49]  O. F. Christensen,et al.  Compatibility of pedigree-based and marker-based relationship matrices for single-step genetic evaluation , 2012, Genetics Selection Evolution.

[50]  M. McMullen,et al.  A unified mixed-model method for association mapping that accounts for multiple levels of relatedness , 2006, Nature Genetics.

[51]  M Grossman,et al.  Marker assisted selection using best linear unbiased prediction , 1989, Genetics Selection Evolution.