Genomic Prediction from Multiple-Trait Bayesian Regression Methods Using Mixture Priors

Bayesian multiple-regression methods incorporating different mixture priors for marker effects are used widely in genomic prediction. Improvement in prediction accuracies from using those methods, such as BayesB, BayesC, and BayesCπ, have been shown in single-trait analyses with both simulated and real data. These methods have been extended to multi-trait analyses, but only under the restrictive assumption that a locus simultaneously affects all the traits or none of them. This assumption is not biologically meaningful, especially in multi-trait analyses involving many traits. In this paper, we develop and implement a more general multi-trait BayesCΠ and BayesB methods allowing a broader range of mixture priors. Our methods allow a locus to affect any combination of traits, e.g., in a 5-trait analysis, the “restrictive” model only allows two situations, whereas ours allow all 32 situations. Further, we compare our methods to single-trait methods and the “restrictive” multi-trait formulation using real and simulated data. In the real data analysis, higher prediction accuracies were observed from both our new broad-based multi-trait methods and the “restrictive” formulation. The broad-based and restrictive multi-trait methods showed similar prediction accuracies. In the simulated data analysis, higher prediction accuracies to the “restrictive” method were observed from our general multi-trait methods for intermediate training population size. The software tool JWAS offers open-source routines to perform these analyses.

[1]  M. Calus,et al.  Accuracy of multi-trait genomic selection using different methods , 2011, Genetics Selection Evolution.

[2]  Cedric Gondro,et al.  Genome-Wide Association Studies and Genomic Prediction , 2013, Methods in Molecular Biology.

[3]  Daniel Gianola,et al.  Additive Genetic Variability and the Bayesian Alphabet , 2009, Genetics.

[4]  T. A. Martin,et al.  Accuracy of Genomic Selection Methods in a Standard Data Set of Loblolly Pine (Pinus taeda L.) , 2012, Genetics.

[5]  Dorian J. Garrick,et al.  An Upper Bound for Accuracy of Prediction Using GBLUP , 2016, PloS one.

[6]  R. Fernando,et al.  The Impact of Genetic Relationship Information on Genome-Assisted Breeding Values , 2007, Genetics.

[7]  Dorian J. Garrick,et al.  JWAS: Julia implementation of Whole-genome Analyses Software , 2018 .

[8]  M. Goddard,et al.  Prediction of total genetic value using genome-wide dense marker maps. , 2001, Genetics.

[9]  R. Fernando,et al.  XSim: Simulation of Descendants from Ancestors with Sequence Data , 2015, G3: Genes, Genomes, Genetics.

[10]  M. Calus,et al.  Genomic Prediction in Animals and Plants: Simulation of Data, Validation, Reporting, and Benchmarking , 2013, Genetics.

[11]  Dorian J. Garrick,et al.  A fast and efficient Gibbs sampler for BayesB in whole-genome analyses , 2015, Genetics Selection Evolution.

[12]  Dorian Garrick,et al.  Bayesian methods applied to GWAS. , 2013, Methods in molecular biology.

[13]  Rohan L. Fernando,et al.  Extension of the bayesian alphabet for genomic selection , 2011, BMC Bioinformatics.

[14]  Jean-Luc Jannink,et al.  Multiple-Trait Genomic Selection Methods Increase Genetic Value Prediction Accuracy , 2012, Genetics.