Usefulness of Multiparental Populations of Maize (Zea mays L.) for Genome-Based Prediction

The efficiency of marker-assisted prediction of phenotypes has been studied intensively for different types of plant breeding populations. However, one remaining question is how to incorporate and counterbalance information from biparental and multiparental populations into model training for genome-wide prediction. To address this question, we evaluated testcross performance of 1652 doubled-haploid maize (Zea mays L.) lines that were genotyped with 56,110 single nucleotide polymorphism markers and phenotyped for five agronomic traits in four to six European environments. The lines are arranged in two diverse half-sib panels representing two major European heterotic germplasm pools. The data set contains 10 related biparental dent families and 11 related biparental flint families generated from crosses of maize lines important for European maize breeding. With this new data set we analyzed genome-based best linear unbiased prediction in different validation schemes and compositions of estimation and test sets. Further, we theoretically and empirically investigated marker linkage phases across multiparental populations. In general, predictive abilities similar to or higher than those within biparental families could be achieved by combining several half-sib families in the estimation set. For the majority of families, 375 half-sib lines in the estimation set were sufficient to reach the same predictive performance of biomass yield as an estimation set of 50 full-sib lines. In contrast, prediction across heterotic pools was not possible for most cases. Our findings are important for experimental design in genome-based prediction as they provide guidelines for the genetic structure and required sample size of data sets used for model training.

[1]  James Cockram,et al.  An Eight-Parent Multiparent Advanced Generation Inter-Cross Population for Winter-Sown Wheat: Creation, Properties, and Validation , 2014, G3: Genes, Genomes, Genetics.

[2]  G. de los Campos,et al.  Genome-Wide Regression and Prediction with the BGLR Statistical Package , 2014, Genetics.

[3]  Hans-Peter Piepho,et al.  Genome-based prediction of maize hybrid performance across genetic groups, testers, locations, and years , 2014, Theoretical and Applied Genetics.

[4]  David J Balding,et al.  Multiple Quantitative Trait Analysis Using Bayesian Networks , 2014, Genetics.

[5]  Zhanyou Xu,et al.  The impact of population structure on genomic prediction in stratified populations , 2014, Theoretical and Applied Genetics.

[6]  J Crossa,et al.  Genomic prediction in CIMMYT maize and wheat breeding programs , 2013, Heredity.

[7]  Yu Wang,et al.  Genome-Wide Prediction of Traits with Different Genetic Architecture Through Efficient Variable Selection , 2013, Genetics.

[8]  O. Martin,et al.  Intraspecific variation of recombination rate in maize , 2013, Genome Biology.

[9]  Jochen C Reif,et al.  Genomic selection in sugar beet breeding populations , 2013, BMC Genetics.

[10]  B. Mangin,et al.  Combined linkage and linkage disequilibrium QTL mapping in multiple families of maize (Zea mays L.) line crosses highlights complementarities between models based on parental haplotype and single locus polymorphism , 2013, Theoretical and Applied Genetics.

[11]  R. Fernando,et al.  Prediction of Complex Human Traits Using the Genomic Best Linear Unbiased Predictor , 2013, PLoS genetics.

[12]  D. Gianola Priors in Whole-Genome Regression: The Bayesian Alphabet Returns , 2013, Genetics.

[13]  Edward S. Buckler,et al.  The Genetic Architecture of Maize Stalk Strength , 2013, PloS one.

[14]  Jean-Luc Jannink,et al.  Genomic Predictability of Interconnected Biparental Maize Populations , 2013, Genetics.

[15]  Daniel Gianola,et al.  Sensitivity to prior specification in Bayesian genome-based prediction models , 2013, Statistical applications in genetics and molecular biology.

[16]  Emily Combs,et al.  Accuracy of Genomewide Selection for Different Traits with Constant Population Size, Heritability, and Number of Markers , 2013 .

[17]  Albrecht E. Melchinger,et al.  Genomic Prediction of Northern Corn Leaf Blight Resistance in Maize with Combined or Separated Training Sets for Heterotic Groups , 2013, G3: Genes | Genomes | Genetics.

[18]  M. Calus,et al.  Genomic Prediction in Animals and Plants: Simulation of Data, Validation, Reporting, and Benchmarking , 2013, Genetics.

[19]  M. Calus,et al.  Whole-Genome Regression and Prediction Methods Applied to Plant and Animal Breeding , 2013, Genetics.

[20]  J. Ogutu,et al.  Genomic Selection using Multiple Populations , 2012 .

[21]  Ky L. Mathews,et al.  Genomic Prediction of Genetic Values for Resistance to Wheat Rusts , 2012 .

[22]  A. Melchinger,et al.  Maximizing the Reliability of Genomic Selection by Optimizing the Calibration Set of Reference Individuals: Comparison of Methods in Two Diverse Groups of Maize Inbreds (Zea mays L.) , 2012, Genetics.

[23]  A. Melchinger,et al.  Genetic diversity analysis of elite European maize (Zea mays L.) inbred lines using AFLP, SSR, and SNP markers reveals ascertainment bias for a subset of SNPs , 2012, Theoretical and Applied Genetics.

[24]  Chris-Carolin Schön,et al.  synbreed: a framework for the analysis of genomic prediction data using R , 2012, Bioinform..

[25]  A. Melchinger,et al.  Genomic prediction of hybrid performance in maize with models incorporating dominance and population specific marker effects , 2012, Theoretical and Applied Genetics.

[26]  Katherine E. Guill,et al.  The relationship between parental genetic or phenotypic divergence and progeny variation in the maize nested association mapping population , 2011, Heredity.

[27]  T. A. Martin,et al.  Accuracy of Genomic Selection Methods in a Standard Data Set of Loblolly Pine (Pinus taeda L.) , 2012, Genetics.

[28]  M. Stitt,et al.  Genomic and metabolic prediction of complex heterotic traits in hybrid maize , 2012, Nature Genetics.

[29]  Hsiao-Pei Yang,et al.  Genomic Selection in Plant Breeding: A Comparison of Models , 2012 .

[30]  O. Martin,et al.  A Large Maize (Zea mays L.) SNP Genotyping Array: Development and Germplasm Genotyping, and Genetic Mapping to Compare with the B73 Reference Genome , 2011, PloS one.

[31]  Yusheng Zhao,et al.  Accuracy of genomic selection in European maize elite breeding populations , 2011, Theoretical and Applied Genetics.

[32]  Jianwei Lu,et al.  Evaluation of genome-wide selection efficiency in maize nested association mapping populations , 2011, Theoretical and Applied Genetics.

[33]  W. Beavis,et al.  Accuracy and Training Population Design for Genomic Selection on Quantitative Traits in Elite North American Oats , 2011 .

[34]  Henner Simianer,et al.  Genome-based prediction of testcross values in maize , 2011, Theoretical and Applied Genetics.

[35]  Hans-Peter Piepho,et al.  Augmented p‐rep designs , 2011, Biometrical journal. Biometrische Zeitschrift.

[36]  Aaron J. Lorenz,et al.  Genomic Selection in Plant Breeding , 2011 .

[37]  Rohan L. Fernando,et al.  Extension of the bayesian alphabet for genomic selection , 2011, BMC Bioinformatics.

[38]  J. Holland,et al.  Estimating and Interpreting Heritability for Plant Breeding: An Update , 2010 .

[39]  M. Goddard,et al.  Reliability of Genomic Predictions Across Multiple Populations , 2009, Genetics.

[40]  Ben J Hayes,et al.  Accuracy of genomic breeding values in multi-breed dairy cattle populations , 2009, Genetics Selection Evolution.

[41]  Robenzon E. Lorenzana,et al.  Accuracy of genotypic value predictions for marker-based selection in biparental plant populations , 2009, Theoretical and Applied Genetics.

[42]  M. McMullen,et al.  Genetic Properties of the Maize Nested Association Mapping Population , 2009, Science.

[43]  T. Meuwissen,et al.  Accuracy of breeding values of 'unrelated' individuals predicted by dense SNP genotyping , 2009, Genetics Selection Evolution.

[44]  Jean-Luc Jannink,et al.  Factors Affecting Accuracy From Genomic Selection in Populations Derived From Multiple Inbred Lines: A Barley Case Study , 2009, Genetics.

[45]  B. Browning,et al.  A unified approach to genotype imputation and haplotype-phase inference for large data sets of trios and unrelated individuals. , 2009, American journal of human genetics.

[46]  G. Vallad,et al.  Genetic variance, coefficient of parentage, and genetic distance of six soybean populations , 2009, Theoretical and Applied Genetics.

[47]  Hans D. Daetwyler,et al.  Accuracy of Predicting the Genetic Risk of Disease Using a Genome-Wide Approach , 2008, PloS one.

[48]  Andrés Legarra,et al.  Performance of Genomic Selection in Mice , 2008, Genetics.

[49]  M. McMullen,et al.  Genetic Design and Statistical Power of Nested Association Mapping in Maize , 2008, Genetics.

[50]  B. J. Hayes,et al.  Genomic selection: Genomic selection , 2007 .

[51]  R. Fernando,et al.  The Impact of Genetic Relationship Information on Genome-Assisted Breeding Values , 2007, Genetics.

[52]  J. Dekkers,et al.  Marker-assisted selection for commercial crossbred performance. , 2007, Journal of animal science.

[53]  M. Goddard,et al.  Genomic selection. , 2007, Journal of animal breeding and genetics = Zeitschrift fur Tierzuchtung und Zuchtungsbiologie.

[54]  M. Kenward,et al.  An Introduction to the Bootstrap , 2007 .

[55]  B. Mangin,et al.  Connected populations for detecting quantitative trait loci and testing for epistasis: an application in maize , 2006, Theoretical and Applied Genetics.

[56]  J. Jannink,et al.  Using mating designs to uncover QTL and the genetic architecture of complex traits , 2006, Heredity.

[57]  L. Essioux,et al.  The effect of population structure on the relationship between heterosis and heterozygosity at marker loci , 1994, Theoretical and Applied Genetics.

[58]  Ahmed Rebai,et al.  Power of tests for QTL detection using replicated progenies derived from a diallel cross , 1993, Theoretical and Applied Genetics.

[59]  M. Goddard,et al.  Prediction of total genetic value using genome-wide dense marker maps. , 2001, Genetics.

[60]  R Jansen,et al.  Mapping epistatic quantitative trait loci with one-dimensional genome searches. , 2001, Genetics.

[61]  Arthur E. Hoerl,et al.  Ridge Regression: Biased Estimation for Nonorthogonal Problems , 2000, Technometrics.

[62]  Mean, genetic variance, and usefulness of selfing progenies from intra- and inter-pool crosses in faba beans (Vicia faba L.) and their prediction from parental parameters , 1999, Theoretical and Applied Genetics.

[63]  Prediction of testcross means and variances among F3 progenies of F1 crosses from testcross means and genetic distances of their parents in maize , 1998, Theoretical and Applied Genetics.

[64]  John M. Martin,et al.  Predicting progeny variance from parental divergence in hard red spring wheat , 1998 .

[65]  S. Xu,et al.  Mapping quantitative trait loci using multiple families of line crosses. , 1998, Genetics.

[66]  Hélène Muranty,et al.  Power of tests for quantitative trait loci detection using full-sib families in different schemes , 1996, Heredity.

[67]  M. Nei,et al.  Linkage disequilibrium in subdivided populations. , 1973, Genetics.

[68]  Peter H. A. Sneath,et al.  Numerical Taxonomy: The Principles and Practice of Numerical Classification , 1973 .

[69]  N. Mantel The detection of disease clustering and a generalized regression approach. , 1967, Cancer research.

[70]  F. E. Grubbs Sample Criteria for Testing Outlying Observations , 1950 .