Cross-validation of genetic and genomic predictions of temperament in Nellore–Angus crossbreds

Abstract The objective of this study was to cross-validate predictions of genomic merit for overall temperament at weaning (TEMP) by alternative partitioning of training and validation sets in a Nellore–Angus crossbred population. Nellore–Angus F2 calves (embryo transfer and natural service reciprocal crosses), paternal half-siblings to the embryo transfer calves, and F3 calves had records for TEMP (subjectively scored on a 1 to 9 scale). Calves with DNA available were genotyped on the Illumina Bovine SNP50 version 1 and 2 assays, and, after quality control filtering, there were 34,913 SNP markers available for use. Calves with TEMP records and genotypes (n=769; mean±std of all TEMP records is 3.975±2.062) were used in this study. BayesB procedures with π =0.997 were used to obtain direct genomic values (DGV) with alternative partitioning of data into training and validation sets utilizing the family structure of this population. Training was conducted (scenarios 1–4) using the progeny of all but one of the four sire or grandsires, (5) using only F2 progeny, (6) by random assignment, and (7) using only embryo transfer F2 and natural service half-sibling cattle. In random assignment, the number of animals included in the validation population was the average of scenarios 1–5. The DGV generated in these training and validation sets were compared to traditional, pedigree-based breeding values from an animal model with equivalent training and validation sets. The training model included fixed effects of sex, birth year-season combination, and temperament scoring pen nested in birth year-season combination. Standardized accuracies were higher for DGV using BayesB procedures (0.226 on average) compared to EBV using a traditional pedigree-based animal model (0.122 average). Random allocation of individuals into training and validation groups resulted in the highest accuracies for DGV (0.503) and EBV (0.354) of the validation groups. Overall, accuracies of animals in validation groups were low (i.e., less than 0.35) using genomic predictions, which could be due to inadequate sample size, insufficient marker density, or limited relationship between individuals in the training and validation populations. The genomic method generated unique DGV among full- and half-siblings within families whereas pedigree-based prediction results in identical EBV among full-siblings. This is of particular interest for the cattle industry as genomic methods can provide more plausible estimates of genetic merit for crossbred or purebred cattle without records, even if the initial accuracy is low.

[1]  M. Goddard,et al.  Can the same genetic markers be used in multiple breeds , 2006 .

[2]  R. Fernando,et al.  Genomic selection in admixed and crossbred populations. , 2010, Journal of animal science.

[3]  D. Garrick,et al.  Accuracies of direct genomic breeding values in Hereford beef cattle using national or international training populations. , 2013, Journal of animal science.

[4]  M. Goddard,et al.  Reliability of Genomic Predictions Across Multiple Populations , 2009, Genetics.

[5]  C. Gill,et al.  Comparison of breeding value prediction for two traits in a Nellore-Angus crossbred population using different Bayesian modeling methodologies , 2014, Genetics and molecular biology.

[6]  J. Woolliams,et al.  The Accuracy of Genomic Selection in Norwegian Red Cattle Assessed by Cross-Validation , 2009, Genetics.

[7]  R. Fernando,et al.  Accuracies of genomic breeding values in American Angus beef cattle using K-means clustering for cross-validation , 2011, Genetics Selection Evolution.

[8]  R. Schnabel,et al.  Accuracy of direct genomic breeding values for nationally evaluated traits in US Limousin and Simmental beef cattle , 2012, Genetics Selection Evolution.

[9]  R. Dillon,et al.  Relationships between temperament and growth in a feedlot and commercial carcass traits of Bos indicus crossbreds , 1997 .

[10]  R. Fernando,et al.  The Impact of Genetic Relationship Information on Genome-Assisted Breeding Values , 2007, Genetics.

[11]  Optimising multistage dairy cattle breeding schemes including genomic selection using decorrelated or optimum selection indices , 2012, Genetics Selection Evolution.

[12]  C. Gill,et al.  Genetic evaluation of aspects of temperament in Nellore-Angus calves. , 2014, Journal of animal science.

[13]  S. R. Miraei-Ashtiani,et al.  The impact of information quantity and strength of relationship between training set and validation set on accuracy of genomic estimated breeding values , 2010 .

[14]  Robert D Schnabel,et al.  Genome-Wide Survey of SNP Variation Uncovers the Genetic Structure of Cattle Breeds , 2009, Science.

[15]  G. Lathrop,et al.  Genetic and Haplotypic Structure in 14 European and African Cattle Breeds , 2007, Genetics.

[16]  Rohan L. Fernando,et al.  Extension of the bayesian alphabet for genomic selection , 2011, BMC Bioinformatics.

[17]  M J Kelly,et al.  Genomic predictions in Angus cattle: comparisons of sample size, response variables, and clustering methods for cross-validation. , 2014, Journal of animal science.

[18]  P. Lichtner,et al.  The impact of genetic relationship information on genomic breeding values in German Holstein cattle , 2010, Genetics Selection Evolution.

[19]  R. Fernando,et al.  Genomic prediction of simulated multibreed and purebred performance using observed fifty thousand single nucleotide polymorphism genotypes. , 2010, Journal of animal science.

[20]  C. Gill,et al.  Genome-wide association study of temperament and tenderness using different Bayesian approaches in a Nellore–Angus crossbred population , 2014 .

[21]  Robin Thompson,et al.  ASREML user guide release 1.0 , 2002 .

[22]  R. Fernando,et al.  Genomic selection of purebreds for crossbred performance , 2009, Genetics Selection Evolution.

[23]  M. Goddard,et al.  Prediction of total genetic value using genome-wide dense marker maps. , 2001, Genetics.