Marker-Based Estimation of Heritability in Immortal Populations

Heritability is a central parameter in quantitative genetics, from both an evolutionary and a breeding perspective. For plant traits heritability is traditionally estimated by comparing within- and between-genotype variability. This approach estimates broad-sense heritability and does not account for different genetic relatedness. With the availability of high-density markers there is growing interest in marker-based estimates of narrow-sense heritability, using mixed models in which genetic relatedness is estimated from genetic markers. Such estimates have received much attention in human genetics but are rarely reported for plant traits. A major obstacle is that current methodology and software assume a single phenotypic value per genotype, hence requiring genotypic means. An alternative that we propose here is to use mixed models at the individual plant or plot level. Using statistical arguments, simulations, and real data we investigate the feasibility of both approaches and how these affect genomic prediction with the best linear unbiased predictor and genome-wide association studies. Heritability estimates obtained from genotypic means had very large standard errors and were sometimes biologically unrealistic. Mixed models at the individual plant or plot level produced more realistic estimates, and for simulated traits standard errors were up to 13 times smaller. Genomic prediction was also improved by using these mixed models, with up to a 49% increase in accuracy. For genome-wide association studies on simulated traits, the use of individual plant data gave almost no increase in power. The new methodology is applicable to any complex trait where multiple replicates of individual genotypes can be scored. This includes important agronomic crops, as well as bacteria and fungi.

[1]  Zhiwu Zhang,et al.  Mixed linear model approach adapted for genome-wide association studies , 2010, Nature Genetics.

[2]  Mark H. Wright,et al.  Genome-wide association mapping reveals a rich genetic architecture of complex traits in Oryza sativa , 2011, Nature communications.

[3]  Judy H. Cho,et al.  Finding the missing heritability of complex diseases , 2009, Nature.

[4]  Shashaank Vattikuti,et al.  Heritability and Genetic Correlations Explained by Common SNPs for Metabolic Syndrome Traits , 2012, PLoS genetics.

[5]  H. Grüneberg,et al.  Introduction to quantitative genetics , 1960 .

[6]  Naomi R. Wray,et al.  Estimating Effects and Making Predictions from Genome-Wide Marker Data , 2010, 1010.4710.

[7]  Brian L Browning,et al.  Population structure can inflate SNP-based heritability estimates. , 2011, American journal of human genetics.

[8]  G. Robinson That BLUP is a Good Thing: The Estimation of Random Effects , 1991 .

[9]  B. Cullis,et al.  Joint modeling of additive and non-additive (genetic line) effects in multi-environment trials , 2007, Theoretical and Applied Genetics.

[10]  H. Piepho,et al.  Comparison of Weighting in Two‐Stage Analysis of Plant Breeding Trials , 2009 .

[11]  Robin Thompson,et al.  Analyzing Variety by Environment Data Using Multiplicative Mixed Models and Adjustments for Spatial Field Trend , 2001, Biometrics.

[12]  Joost J. B. Keurentjes,et al.  Genetic resources for quantitative trait analysis: novelty and efficiency in design from an Arabidopsis perspective. , 2014, Current opinion in plant biology.

[13]  M. Stephens,et al.  Genome-wide Efficient Mixed Model Analysis for Association Studies , 2012, Nature Genetics.

[14]  M. Stitt,et al.  Genomic and metabolic prediction of complex heterotic traits in hybrid maize , 2012, Nature Genetics.

[15]  Robin Thompson,et al.  Average information REML: An efficient algorithm for variance parameter estimation in linear mixed models , 1995 .

[16]  Helena Oakey,et al.  Joint modeling of additive and non-additive genetic line effects in single field trials , 2006, Theoretical and Applied Genetics.

[17]  C. R. Henderson,et al.  Best linear unbiased estimation and prediction under a selection model. , 1975, Biometrics.

[18]  Peter M. Visscher,et al.  A General Unified Framework to Assess the Sampling Variance of Heritability Estimates Using Pedigree or Marker-Based Relationships , 2014, Genetics.

[19]  Karl J. Friston,et al.  Variance Components , 2003 .

[20]  Justin O Borevitz,et al.  Genome-wide association studies in plants: the missing heritability is in the field , 2011, Genome Biology.

[21]  D. Reich,et al.  Population Structure and Eigenanalysis , 2006, PLoS genetics.

[22]  W. G. Hill,et al.  Heritability in the genomics era — concepts and misconceptions , 2008, Nature Reviews Genetics.

[23]  H. Piepho,et al.  A comparison of experimental designs for selection in breeding trials with nested treatment structure , 2006, Theoretical and Applied Genetics.

[24]  Jason H. Moore,et al.  Missing heritability and strategies for finding the underlying causes of complex disease , 2010, Nature Reviews Genetics.

[25]  W. Ewens Genetics and analysis of quantitative traits , 1999 .

[26]  R. N. Kackar,et al.  Approximations for Standard Errors of Estimators of Fixed and Random Effects in Mixed Linear Models , 1984 .

[27]  Simon C. Potter,et al.  Genome-wide association study of 14,000 cases of seven common diseases and 3,000 shared controls , 2007, Nature.

[28]  Edward S. Buckler,et al.  TASSEL: software for association mapping of complex traits in diverse samples , 2007, Bioinform..

[29]  H. Kang,et al.  Variance component model to account for sample structure in genome-wide association studies , 2010, Nature Genetics.

[30]  H. Piepho,et al.  On the use of multiple lattice designs and α-designs in plant breeding trials , 2006 .

[31]  R. J. Baker,et al.  Estimation of heritability and prediction of selection response in plant populations , 1991 .

[32]  Sue J. Welham,et al.  An efficient computing strategy for prediction in mixed linear models , 2004, Comput. Stat. Data Anal..

[33]  Mikko J Sillanpää,et al.  On statistical methods for estimating heritability in wild populations , 2011, Molecular ecology.

[34]  S. Wright,et al.  The Relative Importance of Heredity and Environment in Determining the Piebald Pattern of Guinea-Pigs. , 1920, Proceedings of the National Academy of Sciences of the United States of America.

[35]  J. Cheverud Genetics and analysis of quantitative traits , 1999 .

[36]  Jeffrey B. Endelman,et al.  Ridge Regression and Other Kernels for Genomic Selection with R Package rrBLUP , 2011 .

[37]  Hans-Peter Piepho,et al.  A stage‐wise approach for the analysis of multi‐environment trials , 2012, Biometrical journal. Biometrische Zeitschrift.

[38]  F. Rousset,et al.  Inbreeding and relatedness coefficients: what do they measure? , 2002, Heredity.

[39]  A. Carriquiry,et al.  Parametric and Nonparametric Statistical Methods for Genomic Selection of Traits with Additive and Epistatic Genetic Architectures , 2014, G3: Genes, Genomes, Genetics.

[40]  D. Heckerman,et al.  Efficient Control of Population Structure in Model Organism Association Mapping , 2008, Genetics.

[41]  A. Jacquard,et al.  Heritability: one word, three concepts. , 1983, Biometrics.

[42]  E. Lander,et al.  The mystery of missing heritability: Genetic interactions create phantom heritability , 2012, Proceedings of the National Academy of Sciences.

[43]  Broome,et al.  Literature cited , 1924, A Guide to the Carnivores of Central America.

[44]  Dennis C. Friedrich,et al.  A scalable, fully automated process for construction of sequence-ready human exome targeted capture libraries , 2011, Genome Biology.

[45]  Ujjwal Kumar,et al.  Comparative genomics of the pathogenic ciliate Ichthyophthirius multifiliis, its free-living relatives and a host species provide insights into adoption of a parasitic lifestyle and prospects for disease control , 2011, Genome Biology.

[46]  F. V. van Eeuwijk,et al.  Detection and use of QTL for complex traits in multiple environments. , 2010, Current opinion in plant biology.

[47]  D. Roby,et al.  Investigation of the geographical scale of adaptive phenological variation and its underlying genetics in Arabidopsis thaliana , 2013, Molecular ecology.

[48]  Manuel A. R. Ferreira,et al.  Assumption-Free Estimation of Heritability from Genome-Wide Identity-by-Descent Sharing between Full Siblings , 2006, PLoS genetics.

[49]  M. Kenward,et al.  Small sample inference for fixed effects from restricted maximum likelihood. , 1997, Biometrics.

[50]  Brian R. Cullis,et al.  Prediction in linear mixed models , 2004 .

[51]  Hans-Peter Piepho,et al.  Comparison of Mixed-Model Approaches for Association Mapping , 2008, Genetics.

[52]  Kermit Ritland,et al.  A MARKER‐BASED METHOD FOR INFERENCES ABOUT QUANTITATIVE INHERITANCE IN NATURAL POPULATIONS , 1996, Evolution; international journal of organic evolution.

[53]  F. V. van Eeuwijk,et al.  A Mixed-Model Quantitative Trait Loci (QTL) Analysis for Multiple-Environment Trial Data Using Environmental Covariables for QTL-by-Environment Interactions, With an Example in Maize , 2007, Genetics.

[54]  Jean-Marcel Ribaut,et al.  The statistical analysis of multi-environment data: modeling genotype-by-environment interaction and its genetic basis , 2013, Front. Physiol..

[55]  Robin Thompson,et al.  The analysis of crop cultivar breeding and evaluation trials: an overview of current mixed model approaches , 2005, The Journal of Agricultural Science.

[56]  I. Hellmann,et al.  Massive genomic variation and strong selection in Arabidopsis thaliana lines from Sweden , 2013, Nature Genetics.

[57]  M. Hufford,et al.  Historical genomics of North American maize , 2012, Proceedings of the National Academy of Sciences.

[58]  S. Ceccarelli,et al.  Estimation of heritability from varietal trials data , 1993, Theoretical and Applied Genetics.

[59]  Doug Speed,et al.  Improved heritability estimation from genome-wide SNPs. , 2012, American journal of human genetics.

[60]  P. Visscher,et al.  Estimating missing heritability for disease from genome-wide association studies. , 2011, American journal of human genetics.

[61]  D. Gianola,et al.  Reproducing Kernel Hilbert Spaces Regression Methods for Genomic Assisted Prediction of Quantitative Traits , 2008, Genetics.

[62]  Ying Liu,et al.  FaST linear mixed models for genome-wide association studies , 2011, Nature Methods.

[63]  R. Fisher XV.—The Correlation between Relatives on the Supposition of Mendelian Inheritance. , 1919, Transactions of the Royal Society of Edinburgh.

[64]  Bjarni J. Vilhjálmsson,et al.  Genome-wide association study of 107 phenotypes in Arabidopsis thaliana inbred lines , 2010 .

[65]  L. Penrose,et al.  THE CORRELATION BETWEEN RELATIVES ON THE SUPPOSITION OF MENDELIAN INHERITANCE , 2022 .

[66]  J. Keurentjes,et al.  Development of a Near-Isogenic Line Population of Arabidopsis thaliana and Comparison of Mapping Power With a Recombinant Inbred Line Population , 2007, Genetics.

[67]  Brian R. Cullis,et al.  The analysis of QTL by simultaneous use of the full linkage map , 2007, Theoretical and Applied Genetics.

[68]  A. Auton,et al.  Genome-wide patterns of genetic variation in worldwide Arabidopsis thaliana accessions from the RegMap panel , 2011, Nature Genetics.

[69]  Meng Li,et al.  Genetics and population analysis Advance Access publication July 13, 2012 , 2012 .

[70]  P. Visscher,et al.  GCTA: a tool for genome-wide complex trait analysis. , 2011, American journal of human genetics.

[71]  Robin Thompson,et al.  A COMPARISON OF ANALYSIS METHODS FOR LATE‐STAGE VARIETY EVALUATION TRIALS , 2010 .

[72]  Chris-Carolin Schön,et al.  synbreed: a framework for the analysis of genomic prediction data using R , 2012, Bioinform..

[73]  P. Visscher,et al.  Increased accuracy of artificial selection by using the realized relationship matrix. , 2009, Genetics research.

[74]  Rongwei Fu,et al.  Exact moment calculations for genetic models with migration, mutation, and drift. , 2003, Theoretical population biology.

[75]  Joy Bergelson,et al.  Association mapping of local climate-sensitive quantitative trait loci in Arabidopsis thaliana , 2010, Proceedings of the National Academy of Sciences.

[76]  P. Visscher,et al.  Common SNPs explain a large proportion of heritability for human height , 2011 .

[77]  J. Holland,et al.  Estimating and Interpreting Heritability for Plant Breeding: An Update , 2010 .