Genomic selection in multi-environment plant breeding trials using a factor analytic linear mixed model.

Genomic selection (GS) is a statistical and breeding methodology designed to improve genetic gain. It has proven to be successful in animal breeding; however, key points of difference have not been fully considered in the transfer of GS from animal to plant breeding. In plant breeding, individuals (varieties) are typically evaluated across a number of locations in multiple years (environments) in formally designed comparative experiments, called multi-environment trials (METs). The design structure of individual trials can be complex and needs to be modelled appropriately. Another key feature of MET data sets is the presence of variety by environment interaction (VEI), that is the differential response of varieties to a change in environment. In this paper, a single-step factor analytic linear mixed model is developed for plant breeding MET data sets that incorporates molecular marker data, appropriately accommodates non-genetic sources of variation within trials and models VEI. A recently developed set of selection tools, which are natural derivatives of factor analytic models, are used to facilitate GS for a motivating data set from an Australian plant breeding company. The power and versatility of these tools is demonstrated for the variety by environment and marker by environment effects.

[1]  Brian R. Cullis,et al.  Factor analytic mixed models for the provision of grower information from national crop variety testing programs , 2014, Theoretical and Applied Genetics.

[2]  M. Calus,et al.  Whole-Genome Regression and Prediction Methods Applied to Plant and Animal Breeding , 2013, Genetics.

[3]  H. D. Patterson,et al.  Recovery of inter-block information when block sizes are unequal , 1971 .

[4]  B. Cullis,et al.  On the Design of Field Experiments with Correlated Treatment Effects , 2014 .

[5]  Brian R. Cullis,et al.  Residual maximum likelihood (REML) estimation of a neighbour model for field experiments , 1987 .

[6]  B. Cullis,et al.  Relative accuracy of a neighbour method for field trials , 1988, The Journal of Agricultural Science.

[7]  Yusheng Zhao,et al.  Genomic selection in a commercial winter wheat population , 2016, Theoretical and Applied Genetics.

[8]  Brian R. Cullis,et al.  On the design of early generation variety trials with correlated data , 2006 .

[9]  J. John,et al.  t‐Latinized Designs , 1998 .

[10]  M. Goddard,et al.  Using the genomic relationship matrix to predict the accuracy of genomic selection. , 2011, Journal of animal breeding and genetics = Zeitschrift fur Tierzuchtung und Zuchtungsbiologie.

[11]  P. VanRaden,et al.  Efficient methods to compute genomic predictions. , 2008, Journal of dairy science.

[12]  J. Kiefer General Equivalence Theory for Optimum Designs (Approximate Theory) , 1974 .

[13]  B. Cullis,et al.  Joint modeling of additive and non-additive (genetic line) effects in multi-environment trials , 2007, Theoretical and Applied Genetics.

[14]  Julian Besag,et al.  Statistical Analysis of Field Experiments Using Neighbouring Plots , 1986 .

[15]  J. Mandel A New Analysis of Variance Model for Non-additive Data , 1971 .

[16]  Robin Thompson,et al.  Analyzing Variety by Environment Data Using Multiplicative Mixed Models and Adjustments for Spatial Field Trend , 2001, Biometrics.

[17]  Robin Thompson,et al.  Genomic Selection in Multi-environment Crop Trials , 2016, G3: Genes, Genomes, Genetics.

[18]  Robin Thompson,et al.  The analysis of crop cultivar breeding and evaluation trials: an overview of current mixed model approaches , 2005, The Journal of Agricultural Science.

[19]  Felicity E. Bedford,et al.  Applying association mapping and genomic selection to the dissection of key traits in elite European wheat , 2014, Theoretical and Applied Genetics.

[20]  Ignacio Aguilar,et al.  Resource allocation optimization with multi-trait genomic prediction for bread wheat (Triticum aestivum L.) baking quality , 2018, Theoretical and Applied Genetics.

[21]  Hans-Peter Piepho,et al.  ANALYZING GENOTYPE-ENVIRONMENT DATA BY MIXED MODELS WITH MULTIPLICATIVE TERMS , 1997 .

[22]  N. Ahmadi,et al.  Genomic Prediction Accounting for Genotype by Environment Interaction Offers an Effective Framework for Breeding Simultaneously for Adaptation to an Abiotic Stress and Performance Under Normal Cropping Conditions in Rice , 2018, G3: Genes, Genomes, Genetics.

[23]  R. Fernando,et al.  Dissecting the genetic architecture of frost tolerance in Central European winter wheat , 2013, Journal of experimental botany.

[24]  B. Cullis,et al.  The accuracy of varietal selection using factor analytic models for multi-environment plant breeding trials , 2007 .

[25]  R. Kempton The use of biplots in interpreting variety by environment interactions , 1984, The Journal of Agricultural Science.

[26]  B. Cullis,et al.  A New Procedure for the Analysis of Early Generation Variety Trials , 1989 .

[27]  Brian R. Cullis,et al.  Efficiency of neighbour analysis for replicated variety trials in Australia , 1989, The Journal of Agricultural Science.

[28]  Hans-Peter Piepho,et al.  Comparisons of single-stage and two-stage approaches to genomic selection , 2012, Theoretical and Applied Genetics.

[29]  Genomic selection prediction accuracy in a perennial crop: case study of oil palm (Elaeis guineensis Jacq.) , 2015, Theoretical and Applied Genetics.

[30]  Julian Taylor,et al.  Increased genomic prediction accuracy in wheat breeding using a large Australian panel , 2017, Theoretical and Applied Genetics.

[31]  K. W. Finlay,et al.  The analysis of adaptation in a plant-breeding programme , 1963 .

[32]  A mixed model QTL analysis for sugarcane multiple-harvest-location trial data , 2011, Theoretical and Applied Genetics.

[33]  D. Garrick,et al.  Technical note: Derivation of equivalent computing algorithms for genomic predictions and reliabilities of animal merit. , 2009, Journal of dairy science.

[34]  Deniz Akdemir,et al.  Integrating environmental covariates and crop modeling into the genomic selection framework to predict genotype by environment interactions , 2013, Theoretical and Applied Genetics.

[35]  C. Beeck,et al.  Analysis of yield and oil from a series of canola breeding trials. Part I. Fitting factor analytic mixed models with pedigree information. , 2010, Genome.

[36]  H. Piepho,et al.  Comparison of Weighting in Two‐Stage Analysis of Plant Breeding Trials , 2009 .

[37]  B. Cullis,et al.  Plant breeding selection tools built on factor analytic mixed models for multi-environment trial data , 2018, Euphytica.

[38]  Robin Thompson,et al.  Factor analytic and reduced animal models for the investigation of additive genotype-by-environment interaction in outcrossing plant species with application to a Pinus radiata breeding programme , 2014, Theoretical and Applied Genetics.

[39]  I Misztal,et al.  Hot topic: a unified approach to utilize phenotypic, full pedigree, and genomic information for genetic evaluation of Holstein final score. , 2010, Journal of dairy science.

[40]  Robin Thompson,et al.  A Sparse Implementation of the Average Information Algorithm for Factor Analytic and Reduced Rank Variance Models , 2003 .

[41]  Ignacy Misztal,et al.  Single Step, a general approach for genomic selection , 2014 .

[42]  J. Crow,et al.  The other fly room: J. T. Patterson and Texas genetics. , 2001, Genetics.

[43]  H. D. Patterson,et al.  A new class of resolvable incomplete block designs , 1976 .

[44]  Brian R. Cullis,et al.  Accounting for natural and extraneous variation in the analysis of field experiments , 1997 .

[45]  C. T. Guimarães,et al.  Improving accuracies of genomic predictions for drought tolerance in maize by joint modeling of additive and dominance effects in multi-environment trials , 2018, Heredity.

[46]  Jochen C Reif,et al.  Genomic selection in sugar beet breeding populations , 2013, BMC Genetics.

[47]  B. Cullis,et al.  An examination of the efficiency of Australian crop variety evaluation programmes , 2000, The Journal of Agricultural Science.

[48]  José Crossa,et al.  A reaction norm model for genomic selection using high-dimensional genomic and environmental data , 2013, Theoretical and Applied Genetics.

[49]  C. A. Mooers The Agronomic Placement of Varieties 1 , 1921 .

[50]  William G. Cochran,et al.  The analysis of groups of experiments , 1938, The Journal of Agricultural Science.

[51]  Robin Thompson,et al.  A COMPARISON OF ANALYSIS METHODS FOR LATE‐STAGE VARIETY EVALUATION TRIALS , 2010 .

[52]  Russ B. Altman,et al.  Missing value estimation methods for DNA microarrays , 2001, Bioinform..

[53]  F. M. Thomson,et al.  The response to selection of different procedures for the analysis of early generation variety trials , 1992, The Journal of Agricultural Science.

[54]  S. Mansfield,et al.  Climatic drivers of genotype–environment interactions in lodgepole pine based on multi-environment trial data and a factor analytic model of additive covariance , 2018, Canadian Journal of Forest Research.

[55]  H. Buerstmayr,et al.  Genomic assisted selection for enhancing line breeding: merging genomic and phenotypic selection in winter wheat breeding programs with preliminary yield trials , 2016, Theoretical and Applied Genetics.

[56]  J. A. John,et al.  Construction of Row and Column Designs with Contiguous Replicates , 1989 .

[57]  H. Piepho,et al.  The importance of phenotypic data analysis for genomic prediction - a case study comparing different spatial models in rye , 2014, BMC Genomics.

[58]  M. Goddard,et al.  Invited review: Genomic selection in dairy cattle: progress and challenges. , 2009, Journal of dairy science.