Factors affecting genomic selection revealed by empirical evidence in maize

Abstract Genomic selection (GS) as a promising molecular breeding strategy has been widely implemented and evaluated for plant breeding, because it has remarkable superiority in enhancing genetic gain, reducing breeding time and expenditure, and accelerating the breeding process. In this study the factors affecting prediction accuracy (rMG) in GS were evaluated systematically, using six agronomic traits (plant height, ear height, ear length, ear diameter, grain yield per plant and hundred-kernel weight) evaluated in one natural and two biparental populations. The factors examined included marker density, population size, heritability, statistical model, population relationships and the ratio of population size between the training and testing sets, the last being revealed by resampling individuals in different proportions from a population. Prediction accuracy continuously increased as marker density and population size increased and was positively correlated with heritability; rMG showed a slight gain when the training set increased to three times as large as the testing set. Low predictive performance between unrelated populations could be attributed to different allele frequencies, and predictive ability and prediction accuracy could be improved by including more related lines in the training population. Among the seven statistical models examined, including ridge regression best linear unbiased prediction (RR-BLUP), genomic BLUP (GBLUP), BayesA, BayesB, BayesC, Bayesian least absolute shrinkage and selection operator (Bayesian LASSO), and reproducing kernel Hilbert space (RKHS), the RKHS and additive-dominance model (Add + Dom model) showed credible ability for capturing non-additive effects, particularly for complex traits with low heritability. Empirical evidence generated in this study for GS-relevant factors will help plant breeders to develop GS-assisted breeding strategies for more efficient development of varieties.

[1]  J Crossa,et al.  Genomic prediction in CIMMYT maize and wheat breeding programs , 2013, Heredity.

[2]  J. Whittaker,et al.  Marker-assisted selection using ridge regression. , 1999, Genetical research.

[3]  R. Bernardo,et al.  Prospects for genomewide selection for quantitative traits in maize , 2007 .

[4]  R. Fernando,et al.  Genomic-Assisted Prediction of Genetic Value With Semiparametric Procedures , 2006, Genetics.

[5]  Hsiao-Pei Yang,et al.  Genomic Selection in Plant Breeding: A Comparison of Models , 2012 .

[6]  Elisabeth Jonas,et al.  Does genomic selection have a future in plant breeding? , 2013, Trends in biotechnology.

[7]  M. Stephens,et al.  fastSTRUCTURE: Variational Inference of Population Structure in Large SNP Data Sets , 2014, Genetics.

[8]  Albrecht E. Melchinger,et al.  Genomic Prediction of Northern Corn Leaf Blight Resistance in Maize with Combined or Separated Training Sets for Heterotic Groups , 2013, G3: Genes | Genomes | Genetics.

[9]  M. Sorrells,et al.  Genomic Selection for Crop Improvement , 2009 .

[10]  D. Akdemir,et al.  Genomic Selection and Association Mapping in Rice (Oryza sativa): Effect of Trait Genetic Architecture, Training Population Composition, Marker Number and Statistical Model on Accuracy of Rice Genomic Selection in Elite, Tropical Rice Breeding Lines , 2015, PLoS genetics.

[11]  R. Bernardo Best linear unbiased prediction of maize single-cross performance , 1996 .

[12]  M. Olsen,et al.  Enhancing genetic gain in the era of molecular breeding , 2017, Journal of experimental botany.

[13]  M. Calus,et al.  Whole-Genome Regression and Prediction Methods Applied to Plant and Animal Breeding , 2013, Genetics.

[14]  L R Schaeffer,et al.  Strategy for applying genome-wide selection in dairy cattle. , 2006, Journal of animal breeding and genetics = Zeitschrift fur Tierzuchtung und Zuchtungsbiologie.

[15]  Emily Combs,et al.  Accuracy of Genomewide Selection for Different Traits with Constant Population Size, Heritability, and Number of Markers , 2013 .

[16]  Jeffrey B. Endelman,et al.  Ridge Regression and Other Kernels for Genomic Selection with R Package rrBLUP , 2011 .

[17]  A. Carriquiry,et al.  Parametric and Nonparametric Statistical Methods for Genomic Selection of Traits with Additive and Epistatic Genetic Architectures , 2014, G3: Genes, Genomes, Genetics.

[18]  J. E. Cairns,et al.  Genome-enabled prediction of genetic values using radial basis function neural networks , 2012, Theoretical and Applied Genetics.

[19]  Chenwu Xu,et al.  Predicting rice hybrid performance using univariate and multivariate GBLUP models based on North Carolina mating design II , 2016, Heredity.

[20]  Robenzon E. Lorenzana,et al.  Accuracy of genotypic value predictions for marker-based selection in biparental plant populations , 2009, Theoretical and Applied Genetics.

[21]  José Crossa,et al.  Genomic Prediction of Breeding Values when Modeling Genotype × Environment Interaction using Pedigree and Dense Molecular Markers , 2012 .

[22]  Jean-Luc Jannink,et al.  Genomic Selection Accuracy using Multifamily Prediction Models in a Wheat Breeding Program , 2011 .

[23]  Rohan L. Fernando,et al.  Extension of the bayesian alphabet for genomic selection , 2011, BMC Bioinformatics.

[24]  José Crossa,et al.  Genetic Gains in Grain Yield Through Genomic Selection in Eight Bi-parental Maize Populations under Drought Stress , 2015 .

[25]  T. A. Martin,et al.  Accuracy of Genomic Selection Methods in a Standard Data Set of Loblolly Pine (Pinus taeda L.) , 2012, Genetics.

[26]  Prediction of malting quality traits in barley based on genome-wide marker data to assess the potential of genomic selection , 2016, Theoretical and Applied Genetics.

[27]  Yusheng Zhao,et al.  Accuracy of genomic selection in European maize elite breeding populations , 2011, Theoretical and Applied Genetics.

[28]  Zhanyou Xu,et al.  The impact of population structure on genomic prediction in stratified populations , 2014, Theoretical and Applied Genetics.

[29]  M. Lund,et al.  Estimating Additive and Non-Additive Genetic Variances and Predicting Genetic Merits Using Genome-Wide Dense Single Nucleotide Polymorphism Markers , 2012, PloS one.

[30]  N. Ranc,et al.  Usefulness of Multiparental Populations of Maize (Zea mays L.) for Genome-Based Prediction , 2014, Genetics.

[31]  Akihiro Nakaya,et al.  REVIEW: PART OF A HIGHLIGHT ON BREEDING STRATEGIES FOR FORAGE AND GRASS IMPROVEMENT Will genomic selection be a practical method for plant breeding? , 2012 .

[32]  P. VanRaden,et al.  Efficient methods to compute genomic predictions. , 2008, Journal of dairy science.

[33]  Yanli Lu,et al.  Whole-genome strategies for marker-assisted plant breeding , 2012, Molecular Breeding.

[34]  Alfredo E Farjat,et al.  Genomic selection in maritime pine. , 2016, Plant science : an international journal of experimental plant biology.

[35]  J Crossa,et al.  Genomic-enabled prediction with classification algorithms , 2014, Heredity.

[36]  M. Calus,et al.  Reliability of direct genomic values for animals with different relationships within and to the reference population. , 2012, Journal of dairy science.

[37]  Gonçalo R. Abecasis,et al.  The variant call format and VCFtools , 2011, Bioinform..

[38]  José Crossa,et al.  Genome-enabled prediction using probabilistic neural network classifiers , 2016, BMC Genomics.

[39]  Guoying Wang,et al.  Development of a maize 55 K SNP array with improved genome coverage for molecular breeding , 2017, Molecular Breeding.

[40]  José Crossa,et al.  A reaction norm model for genomic selection using high-dimensional genomic and environmental data , 2013, Theoretical and Applied Genetics.

[41]  S. Moore,et al.  Accuracy of genomic selection for age at puberty in a multi-breed population of tropically adapted beef cattle. , 2016, Animal genetics.

[42]  J. Ogutu,et al.  Genomic Selection using Multiple Populations , 2012 .

[43]  Approximated prediction of genomic selection accuracy when reference and candidate populations are related , 2016, Genetics Selection Evolution.

[44]  Y. Beyene,et al.  Effect of Trait Heritability, Training Population Size and Marker Density on Genomic Prediction Accuracy Estimation in 22 bi-parental Tropical Maize Populations , 2017, Front. Plant Sci..

[45]  D Gianola,et al.  Reproducing kernel Hilbert spaces regression: a general framework for genetic evaluation. , 2009, Journal of animal science.

[46]  Shizhong Xu,et al.  Predicting hybrid performance in rice using genomic best linear unbiased prediction , 2014, Proceedings of the National Academy of Sciences.

[47]  D. Gianola Priors in Whole-Genome Regression: The Bayesian Alphabet Returns , 2013, Genetics.

[48]  Xiaojie Xu,et al.  Development of a multiple-hybrid population for genome-wide association studies: theoretical consideration and genetic mapping of flowering traits in maize , 2017, Scientific Reports.

[49]  Jochen C Reif,et al.  Modeling Epistasis in Genomic Selection , 2015, Genetics.

[50]  Kevin P. Smith,et al.  Assessing Genomic Selection Prediction Accuracy in a Dynamic Barley Breeding Population , 2015, The plant genome.

[51]  R. Bernardo,et al.  Genomewide selection in oil palm: increasing selection gain per unit time and cost with small populations , 2008, Theoretical and Applied Genetics.

[52]  E. Buckler,et al.  Rapid Cycling Genomic Selection in a Multiparental Tropical Maize Population , 2017, G3: Genes, Genomes, Genetics.

[53]  G. Graham,et al.  Use of doubled haploids in maize breeding: implications for intellectual property protection and genetic diversity in hybrid crops , 2008, Molecular Breeding.

[54]  M. Pumphrey,et al.  Unlocking Diversity in Germplasm Collections via Genomic Selection: A Case Study Based on Quantitative Adult Plant Resistance to Stripe Rust in Spring Wheat , 2017, The plant genome.

[55]  M. Sorrells,et al.  Plant Breeding with Genomic Selection: Gain per Unit Time and Cost , 2010 .

[56]  José Crossa,et al.  Semi-parametric genomic-enabled prediction of genetic values using reproducing kernel Hilbert spaces methods. , 2010, Genetics research.

[57]  G. de los Campos,et al.  Genome-Wide Regression and Prediction with the BGLR Statistical Package , 2014, Genetics.

[58]  J. Ogutu,et al.  Genomic selection using regularized linear regression models: ridge regression, lasso, elastic net and their extensions , 2012, BMC Proceedings.

[59]  R. Fernando,et al.  The Impact of Genetic Relationship Information on Genome-Assisted Breeding Values , 2007, Genetics.

[60]  J. Poland,et al.  Training set optimization under population structure in genomic selection , 2014, Theoretical and Applied Genetics.

[61]  G. Covarrubias-Pazaran Genome-Assisted Prediction of Quantitative Traits Using the R Package sommer , 2016, PloS one.

[62]  J Crossa,et al.  Genomic prediction in biparental tropical maize populations in water-stressed and well-watered environments using low-density and GBS SNPs , 2014, Heredity.

[63]  Henner Simianer,et al.  Genome-based prediction of testcross values in maize , 2011, Theoretical and Applied Genetics.

[64]  M. Goddard,et al.  Prediction of total genetic value using genome-wide dense marker maps. , 2001, Genetics.

[65]  H. Buerstmayr,et al.  Genomic selection across multiple breeding cycles in applied bread wheat breeding , 2016, Theoretical and Applied Genetics.

[66]  J. Poland,et al.  Comparison of Models and Whole‐Genome Profiling Approaches for Genomic‐Enabled Prediction of Septoria Tritici Blotch, Stagonospora Nodorum Blotch, and Tan Spot Resistance in Wheat , 2017, The plant genome.

[67]  J. V. van Arendonk,et al.  Economic evaluation of progeny-testing and genomic selection schemes for small-sized nucleus dairy cattle breeding programs in developing countries. , 2017, Journal of dairy science.

[68]  Jianwei Lu,et al.  Evaluation of genome-wide selection efficiency in maize nested association mapping populations , 2011, Theoretical and Applied Genetics.

[69]  G. Charmet,et al.  Genome-wide prediction of three important traits in bread wheat , 2014, Molecular Breeding.

[70]  H. Iwata,et al.  Genomic Selection Accuracy for Grain Quality Traits in Biparental Wheat Populations , 2011 .

[71]  José Crossa,et al.  Breeding schemes for the implementation of genomic selection in wheat (Triticum spp.). , 2016, Plant science : an international journal of experimental plant biology.

[72]  Xuecai Zhang,et al.  Genome‐Wide Analysis of Tar Spot Complex Resistance in Maize Using Genotyping‐by‐Sequencing SNPs and Whole‐Genome Prediction , 2017, The plant genome.

[73]  G. de los Campos,et al.  Genomic Selection in Plant Breeding: Methods, Models, and Perspectives. , 2017, Trends in plant science.

[74]  D. Gianola,et al.  Reproducing Kernel Hilbert Spaces Regression Methods for Genomic Assisted Prediction of Quantitative Traits , 2008, Genetics.

[75]  G. de los Campos,et al.  Bayesian Genomic Prediction with Genotype × Environment Interaction Kernel Models , 2016, G3: Genes, Genomes, Genetics.