Application of Bayesian least absolute shrinkage and selection operator (LASSO) and BayesCπ methods for genomic selection in French Holstein and Montbéliarde breeds.

Recently, the amount of available single nucleotide polymorphism (SNP) marker data has considerably increased in dairy cattle breeds, both for research purposes and for application in commercial breeding and selection programs. Bayesian methods are currently used in the genomic evaluation of dairy cattle to handle very large sets of explanatory variables with a limited number of observations. In this study, we applied 2 bayesian methods, BayesCπ and bayesian least absolute shrinkage and selection operator (LASSO), to 2 genotyped and phenotyped reference populations consisting of 3,940 Holstein bulls and 1,172 Montbéliarde bulls with approximately 40,000 polymorphic SNP. We compared the accuracy of the bayesian methods for the prediction of 3 traits (milk yield, fat content, and conception rate) with pedigree-based BLUP, genomic BLUP, partial least squares (PLS) regression, and sparse PLS regression, a variable selection PLS variant. The results showed that the correlations between observed and predicted phenotypes were similar in BayesCπ (including or not pedigree information) and bayesian LASSO for most of the traits and whatever the breed. In the Holstein breed, bayesian methods led to higher correlations than other approaches for fat content and were similar to genomic BLUP for milk yield and to genomic BLUP and PLS regression for the conception rate. In the Montbéliarde breed, no method dominated the others, except BayesCπ for fat content. The better performances of the bayesian methods for fat content in Holstein and Montbéliarde breeds are probably due to the effect of the DGAT1 gene. The SNP identified by the BayesCπ, bayesian LASSO, and sparse PLS regression methods, based on their effect on the different traits of interest, were located at almost the same position on the genome. As the bayesian methods resulted in regressions of direct genomic values on daughter trait deviations closer to 1 than for the other methods tested in this study, bayesian methods are suggested for genomic evaluations of French dairy cattle.

[1]  T. Krzyzelewski,et al.  The Implementation of Genomic Evaluations in the UK , 2011 .

[2]  Ignacio González,et al.  integrOmics: an R package to unravel relationships between two omics datasets , 2009, Bioinform..

[3]  R. Fernando,et al.  Genomic prediction of simulated multibreed and purebred performance using observed fifty thousand single nucleotide polymorphism genotypes. , 2010, Journal of animal science.

[4]  Henk Bovenhuis,et al.  Sensitivity of methods for estimating breeding values using genetic markers to the number of QTL and distribution of QTL variance , 2010, Genetics Selection Evolution.

[5]  J. H. Steiger Tests for comparing elements of a correlation matrix. , 1980 .

[6]  Jianhua Z. Huang,et al.  Sparse principal component analysis via regularized low rank matrix approximation , 2008 .

[7]  R. Spelman,et al.  Genomic selection in New Zealand and the implications for national genetic evaluation. , 2009 .

[8]  Rohan L. Fernando,et al.  Extension of the bayesian alphabet for genomic selection , 2011, BMC Bioinformatics.

[9]  Bjørn-Helge Mevik,et al.  Mean squared error of prediction (MSEP) estimates for principal component regression (PCR) and partial least squares regression (PLSR) , 2004 .

[10]  F. Seefried,et al.  Impacts of both reference population size and inclusion of a residual polygenic effect on the accuracy of genomic prediction , 2011, Genetics Selection Evolution.

[11]  R. Fernando,et al.  The Impact of Genetic Relationship Information on Genome-Assisted Breeding Values , 2007, Genetics.

[12]  Tom Druet,et al.  A Hidden Markov Model Combining Linkage and Linkage Disequilibrium Information for Haplotype Reconstruction and Quantitative Trait Locus Fine Mapping , 2010, Genetics.

[13]  Philippe Besse,et al.  Statistical Applications in Genetics and Molecular Biology A Sparse PLS for Variable Selection when Integrating Omics Data , 2011 .

[14]  D Gianola,et al.  Dimension reduction and variable selection for genomic selection: application to predicting milk yield in Holsteins. , 2011, Journal of animal breeding and genetics = Zeitschrift fur Tierzuchtung und Zuchtungsbiologie.

[15]  M. Goddard,et al.  A genome map of divergent artificial selection between Bos taurus dairy cattle and Bos taurus beef cattle. , 2009, Animal genetics.

[16]  P. Lichtner,et al.  The impact of genetic relationship information on genomic breeding values in German Holstein cattle , 2010, Genetics Selection Evolution.

[17]  M. Goddard,et al.  Prediction of total genetic value using genome-wide dense marker maps. , 2001, Genetics.

[18]  Bruce Tier,et al.  A comparison of five methods to predict genomic breeding values of dairy bulls from genome-wide SNP markers , 2009, Genetics Selection Evolution.

[19]  D. Boichard,et al.  Fine tuning genomic evaluations in dairy cattle through SNP pre-selection with the Elastic-Net algorithm. , 2011, Genetics research.

[20]  Daniel Gianola,et al.  Additive Genetic Variability and the Bayesian Alphabet , 2009, Genetics.

[21]  John Van Sickle,et al.  ANALYZING CORRELATIONS BETWEEN STREAM AND WATERSHED ATTRIBUTES 1 , 2003 .

[22]  P M VanRaden,et al.  Derivation, calculation, and use of national animal model information. , 1991, Journal of dairy science.

[23]  Michel Georges,et al.  Genetic and functional confirmation of the causality of the DGAT1 K232A quantitative trait nucleotide in affecting milk yield and composition. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[24]  Guosheng Su,et al.  Deregressed EBV as the response variable yield more reliable genomic predictions than traditional EBV in pure-bred pigs , 2011, Genetics Selection Evolution.

[25]  P. VanRaden,et al.  Distribution and location of genetic effects for dairy traits. , 2009, Journal of dairy science.

[26]  C. Robert-Granié,et al.  Improved Lasso for genomic selection. , 2011, Genetics research.

[27]  Vincent Ducrocq,et al.  Application of PLS and Sparse PLS regression in genomic selection , 2010 .

[28]  A Legarra,et al.  Genomic selection in the French Lacaune dairy sheep breed. , 2012, Journal of dairy science.

[29]  D. Boichard,et al.  Genetic Analysis of Conception Rate in French Holstein Cattle , 1994 .

[30]  P. VanRaden,et al.  Efficient methods to compute genomic predictions. , 2008, Journal of dairy science.

[31]  R. Fernando,et al.  Genomic breeding value prediction and QTL mapping of QTLMAS2010 data using Bayesian Methods , 2011, BMC proceedings.

[32]  D Gianola,et al.  Predictive ability of direct genomic values for lifetime net merit of Holstein sires using selected subsets of single nucleotide polymorphism markers. , 2009, Journal of dairy science.

[33]  I Misztal,et al.  Bias in genomic predictions for populations under selection. , 2011, Genetics research.

[34]  P. VanRaden,et al.  Invited review: reliability of genomic predictions for North American Holstein bulls. , 2009, Journal of dairy science.

[35]  K. G. Nirea,et al.  Genomic selection in Fleckvieh/Simmental - First results , 2009 .

[36]  Hyonho Chun,et al.  Expression Quantitative Trait Loci Mapping With Multivariate Sparse Partial Least Squares Regression , 2009, Genetics.

[37]  José Crossa,et al.  Predicting Quantitative Traits With Regression Models for Dense Molecular Markers and Pedigree , 2009, Genetics.

[38]  V Ducrocq,et al.  Evidence of biases in genetic evaluations due to genomic preselection in dairy cattle. , 2011, Journal of dairy science.

[39]  J. Woolliams,et al.  Reducing dimensionality for prediction of genome-wide breeding values , 2009, Genetics Selection Evolution.

[40]  R. Tibshirani Regression Shrinkage and Selection via the Lasso , 1996 .

[41]  A Legarra,et al.  A comparison of partial least squares (PLS) and sparse PLS regressions in genomic selection in French dairy cattle. , 2012, Journal of dairy science.