A reaction norm model for genomic selection using high-dimensional genomic and environmental data

Key messageNew methods that incorporate the main and interaction effects of high-dimensional markers and of high-dimensional environmental covariates gave increased prediction accuracy of grain yield in wheat across and within environments.AbstractIn most agricultural crops the effects of genes on traits are modulated by environmental conditions, leading to genetic by environmental interaction (G × E). Modern genotyping technologies allow characterizing genomes in great detail and modern information systems can generate large volumes of environmental data. In principle, G × E can be accounted for using interactions between markers and environmental covariates (ECs). However, when genotypic and environmental information is high dimensional, modeling all possible interactions explicitly becomes infeasible. In this article we show how to model interactions between high-dimensional sets of markers and ECs using covariance functions. The model presented here consists of (random) reaction norm where the genetic and environmental gradients are described as linear functions of markers and of ECs, respectively. We assessed the proposed method using data from Arvalis, consisting of 139 wheat lines genotyped with 2,395 SNPs and evaluated for grain yield over 8 years and various locations within northern France. A total of 68 ECs, defined based on five phases of the phenology of the crop, were used in the analysis. Interaction terms accounted for a sizable proportion (16 %) of the within-environment yield variance, and the prediction accuracy of models including interaction terms was substantially higher (17–34 %) than that of models based on main effects only. Breeding for target environmental conditions has become a central priority of most breeding programs. Methods, like the one presented here, that can capitalize upon the wealth of genomic and environmental information available, will become increasingly important.

[1]  Jose Crossa,et al.  From Genotype × Environment Interaction to Gene × Environment Interaction , 2012, Current genomics.

[2]  José Crossa,et al.  Genomic Prediction of Breeding Values when Modeling Genotype × Environment Interaction using Pedigree and Dense Molecular Markers , 2012 .

[3]  José Crossa,et al.  Genomic‐Enabled Prediction Based on Molecular Markers and Pedigree Using the Bayesian Linear Regression Package in R , 2010, The plant genome.

[4]  M. Calus,et al.  Estimation of environmental sensitivity of genetic merit for milk production traits using a random regression model. , 2003, Journal of dairy science.

[5]  M. Goddard,et al.  Genomic selection. , 2007, Journal of animal breeding and genetics = Zeitschrift fur Tierzuchtung und Zuchtungsbiologie.

[6]  H. Gauch Model selection and validation for yield trials with interaction , 1988 .

[7]  José Crossa,et al.  A multi-trait multi-environment QTL mixed model with an application to drought and nitrogen stress trials in maize (Zea mays L.) , 2008, Euphytica.

[8]  F. V. van Eeuwijk,et al.  Interpreting genotype × environment interaction in tropical maize using linked molecular markers and environmental covariables , 1999, Theoretical and Applied Genetics.

[9]  Robin Thompson,et al.  The analysis of crop cultivar breeding and evaluation trials: an overview of current mixed model approaches , 2005, The Journal of Agricultural Science.

[10]  D Stetten,et al.  The impact of genetics. , 1973, Clinical chemistry.

[11]  M. Lund,et al.  Bayesian analysis of the linear reaction norm model with unknown covariates. , 2006, Journal of animal science.

[12]  Hans-Peter Piepho,et al.  Methods for Comparing the Yield Stability of Cropping Systems , 1998 .

[13]  J. Denis,et al.  Two way analysis using covarites1 , 1988 .

[14]  O. Kempthorne,et al.  The correlation between relatives in a random mating population , 1954, Proceedings of the Royal Society of London. Series B - Biological Sciences.

[15]  José Crossa,et al.  Prediction assessment of linear mixed models for multienvironment trials , 2011 .

[16]  Hsiao-Pei Yang,et al.  Genomic Selection in Plant Breeding: A Comparison of Models , 2012 .

[17]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[18]  R. Frankham Introduction to quantitative genetics (4th edn): by Douglas S. Falconer and Trudy F.C. Mackay Longman, 1996. £24.99 pbk (xv and 464 pages) ISBN 0582 24302 5 , 1996 .

[19]  H. Grüneberg,et al.  Introduction to quantitative genetics , 1960 .

[20]  Joint analysis of genotypic and environmental effects , 1986, Theoretical and Applied Genetics.

[21]  M. Calus,et al.  Genotype x environment interaction for protein yield in Dutch dairy cattle as quantified by different models. , 2002, Journal of dairy science.

[22]  José Crossa,et al.  Using Factor Analytic Models for Joining Environments and Genotypes without Crossover Genotype × Environment Interaction , 2008 .

[23]  G. de los Campos,et al.  Genomic Selection and Prediction in Plant Breeding , 2011 .

[24]  F. V. van Eeuwijk,et al.  Interpreting Treatment × Environment Interaction in Agronomy Trials , 2001 .

[25]  C. Cockerham,et al.  An Extension of the Concept of Partitioning Hereditary Variance for Analysis of Covariances among Relatives When Epistasis Is Present. , 1954, Genetics.

[26]  F. A. van Eeuwijk,et al.  Multi-trait and multi-environment QTL analyses of yield and a set of physiological traits in pepper , 2013, Theoretical and Applied Genetics.

[27]  José Crossa,et al.  Predicting Quantitative Traits With Regression Models for Dense Molecular Markers and Pedigree , 2009, Genetics.

[28]  D. Falconer Introduction to quantitative genetics. 1. ed. , 1984 .

[29]  H. F. Gollob,et al.  A statistical model which combines features of factor analytic and analysis of variance techniques , 1968, Psychometrika.

[30]  José Crossa,et al.  Prediction of Genetic Values of Quantitative Traits in Plant Breeding Using Pedigree and Molecular Markers , 2010, Genetics.

[31]  José Crossa,et al.  Studying crossover genotype × environment interaction using linear-bilinear models and mixed models , 2004 .

[32]  K. Sayre,et al.  Using Partial Least Squares Regression, Factorial Regression, and AMMI Models for Interpreting Genotype × Environment Interaction , 1999 .

[33]  William G. Cochran,et al.  The analysis of groups of experiments , 1938, The Journal of Agricultural Science.

[34]  R. Fisher,et al.  STUDIES IN CROP VARIATION , 2009 .

[35]  P. Cornelius,et al.  Sites regression and shifted multiplicative model clustering of cultivar trial sites under heterogeneity of error variances , 1997 .

[36]  José Crossa,et al.  Semi-parametric genomic-enabled prediction of genetic values using reproducing kernel Hilbert spaces methods. , 2010, Genetics research.

[37]  Joshua A. Udall,et al.  Breeding for Quantitative Traits in Plants , 2003 .

[38]  R. Fernando,et al.  The Impact of Genetic Relationship Information on Genome-Assisted Breeding Values , 2007, Genetics.

[39]  J. Mandel Partitioning of interaction in analysis of variance , 1969 .

[40]  Pierre Sourdille,et al.  Insertion site-based polymorphism markers open new perspectives for genome saturation and marker-assisted selection in wheat. , 2010, Plant biotechnology journal.

[41]  Robin Thompson,et al.  Exploring variety - environment data using random effects AMMI models with adjustments for spatial field trend: Part 2. Applications , 2001 .

[42]  P. VanRaden,et al.  Efficient methods to compute genomic predictions. , 2008, Journal of dairy science.

[43]  Manjit S. Kang,et al.  Incorporating Additional Information on Genotypes and Environments in Models for Two-way Genotype by Environment Tables , 1996 .

[44]  F. V. van Eeuwijk,et al.  A Mixed-Model Quantitative Trait Loci (QTL) Analysis for Multiple-Environment Trial Data Using Environmental Covariables for QTL-by-Environment Interactions, With an Example in Maize , 2007, Genetics.

[45]  P. Cornelius,et al.  Statistical Tests and Estimators of Multiplicative Models for Genotype-by-Environment Interaction , 1996 .

[46]  R. Woltereck,et al.  Weitere experimentelle untersuchungen über Artänderung, speziell über das Wesen quantitativer Artunterschiede bei Daphniden , 1913, Zeitschrift für induktive Abstammungs- und Vererbungslehre.

[47]  M. Goddard,et al.  Prediction of total genetic value using genome-wide dense marker maps. , 2001, Genetics.

[48]  C. Juday Weitere experimentelle Untersuchungen über Artveränderung, speziel über das Wesen quantitativer Artuntershiede bei Daphniden , 1910 .

[49]  Jose Crossa,et al.  Mapping QTLs and QTL × environment interaction for CIMMYT maize drought stress program using factorial regression and partial least squares methods , 2006, Theoretical and Applied Genetics.

[50]  F. V. van Eeuwijk,et al.  Mixed models including environmental covariables for studying QTL by environment interaction , 2004, Euphytica.

[51]  P. Cornelius,et al.  Modeling genotype × environment interaction using additive genetic covariances of relatives for predicting breeding values of wheat genotypes , 2006 .