Robust Genomic Modelling Using Expert Knowledge about Additive, Dominance and Epistasis Variation

A major challenge with modelling non-additive genetic variation is that it is hard to separate non-additive variation from additive and environmental variation. In this paper, we describe how to alleviate this issue, and improve genomic modelling of additive and non-additive variation, by leveraging the ample expert knowledge available about the relative magnitude of the sources of phenotypic variation. The method is Bayesian and uses the recently introduced penalized complexity and hierarchical decomposition prior frameworks, where priors can be specified and visualized in an intuitive way, and be used to induce parsimonious modelling. We evaluate the potential impact for plant breeding through a simulated case study of a wheat breeding program. We compare different models and different priors with varying amounts of expert knowledge. The results show that the proposed priors and expert knowledge improved the robustness of the genomic modelling and the selection of the genetically best individuals in the breeding program. We observed this improvement in both variety selection on genetic values and parent selection on additive values, but the variety selection benefited the most. An improvement was not observed in the overall accuracy of estimating genetic values for all individuals and variance components. Finally, we discuss the importance of expert-knowledge priors for genomic modelling and breeding, and point to future research areas of easy-to-use and parsimonious priors in genomic modelling.

[1]  M. Calus,et al.  Whole-Genome Regression and Prediction Methods Applied to Plant and Animal Breeding , 2013, Genetics.

[2]  Yusheng Zhao,et al.  Genome-based establishment of a high-yielding heterotic pattern for hybrid wheat breeding , 2015, Proceedings of the National Academy of Sciences.

[3]  W. G. Hill,et al.  Data and Theory Point to Mainly Additive Genetic Variance for Complex Traits , 2008, PLoS genetics.

[4]  Haavard Rue,et al.  Bayesian Computing with INLA: A Review , 2016, 1604.00860.

[5]  D. Gianola,et al.  Genomic Heritability: What Is It? , 2014, PLoS genetics.

[6]  J. Poland,et al.  Genomic Selection in Preliminary Yield Trials in a Winter Wheat Breeding Program , 2018, G3: Genes, Genomes, Genetics.

[7]  IN BR Ief THE STATE OF FOOD SECURITY AND NUTRITION IN THE WORLD BUILDING RESILIENCE FOR PEACE AND FOOD SECURITY , 2017 .

[8]  J. Reif,et al.  Mapping QTLs with main and epistatic effects underlying grain yield and heading time in soft winter wheat , 2011, Theoretical and Applied Genetics.

[9]  Trevor W. Rife,et al.  A Field‐Based Analysis of Genetic Improvement for Grain Yield in Winter Wheat Cultivars Developed in the US Central Plains from 1992 to 2014 , 2019, Crop Science.

[10]  Helena Oakey,et al.  Joint modeling of additive and non-additive genetic line effects in single field trials , 2006, Theoretical and Applied Genetics.

[11]  H. Rue,et al.  Bayesian bivariate meta‐analysis of diagnostic test studies with interpretable priors , 2015, Statistics in medicine.

[12]  P. Müller,et al.  Determining the Effective Sample Size of a Parametric Prior , 2008, Biometrics.

[13]  K. Meyer "Bending" and beyond: Better estimates of quantitative genetic parameters? , 2019, Journal of animal breeding and genetics = Zeitschrift fur Tierzuchtung und Zuchtungsbiologie.

[14]  Daniel Gianola,et al.  Predicting genetic predisposition in humans: the promise of whole-genome markers , 2010, Nature Reviews Genetics.

[15]  H. Rue,et al.  Fractional Gaussian noise: Prior specification and model comparison , 2016, 1611.06399.

[16]  W. Ewens Genetics and analysis of quantitative traits , 1999 .

[17]  Erratum to: Contribution of an additive locus to genetic variance when inheritance is multi-factorial with implications on interpretation of GWAS , 2013, Theoretical and Applied Genetics.

[18]  G. de los Campos,et al.  Complex-Trait Prediction in the Era of Big Data. , 2018, Trends in genetics : TIG.

[19]  T. Mackay,et al.  The Genetic Architecture of Quantitative Traits Cannot Be Inferred from Variance Component Analysis , 2016, bioRxiv.

[20]  Jason H. Moore,et al.  Why epistasis is important for tackling complex human disease genetics , 2014, Genome Medicine.

[21]  Thiago G. Martins,et al.  Penalising Model Component Complexity: A Principled, Practical Approach to Constructing Priors , 2014, 1403.4630.

[22]  Ning Gao,et al.  Genomic prediction with epistasis models: on the marker-coding-dependent performance of the extended GBLUP and properties of the categorical epistasis model (CE) , 2017, BMC Bioinformatics.

[23]  P. Shewry,et al.  The contribution of wheat to human diet and health , 2015, Food and energy security.

[24]  C. Ballantine On the Hadamard product , 1968 .

[25]  W. G. Hill,et al.  Expected influence of linkage disequilibrium on genetic variance caused by dominance and epistasis on quantitative traits. , 2015, Journal of animal breeding and genetics = Zeitschrift fur Tierzuchtung und Zuchtungsbiologie.

[26]  Ky L. Mathews,et al.  Genomic selection in multi-environment plant breeding trials using a factor analytic linear mixed model. , 2019, Journal of animal breeding and genetics = Zeitschrift fur Tierzuchtung und Zuchtungsbiologie.

[27]  M. Mette,et al.  Relatedness severely impacts accuracy of marker-assisted selection for disease resistance in hybrid wheat , 2013, Heredity.

[28]  J. Woolliams,et al.  Genomic dissection of maternal, additive and non-additive genetic effects for growth and carcass traits in Nile tilapia , 2019, bioRxiv.

[29]  Ignacy Misztal,et al.  Estimation of Variance Components with Large-Scale Dominance Models , 1997 .

[30]  R. Fisher XV.—The Correlation between Relatives on the Supposition of Mendelian Inheritance. , 1919, Transactions of the Royal Society of Edinburgh.

[31]  Rabia Begum A decade of Genome Medicine: toward precision medicine , 2019, Genome Medicine.

[32]  Daniel Gianola,et al.  Kernel-based variance component estimation and whole-genome prediction of pre-corrected phenotypes and progeny tests for dairy cow health traits , 2014, Front. Genet..

[33]  H. Rue,et al.  Approximate Bayesian inference for latent Gaussian models by using integrated nested Laplace approximations , 2009 .

[34]  Jorge Nocedal,et al.  On the limited memory BFGS method for large scale optimization , 1989, Math. Program..

[35]  L. Penrose,et al.  THE CORRELATION BETWEEN RELATIVES ON THE SUPPOSITION OF MENDELIAN INHERITANCE , 2022 .

[36]  C. Whitelaw,et al.  A Strategy To Exploit Surrogate Sire Technology in Livestock Breeding Programs , 2017, G3: Genes, Genomes, Genetics.

[37]  J. Martini,et al.  On the approximation of interaction effect models by Hadamard powers of the additive genomic relationship. , 2020, Theoretical population biology.

[38]  Gregor Gorjanc,et al.  AlphaSim: Software for Breeding Program Simulation , 2016, The plant genome.

[39]  J. Nocedal Updating Quasi-Newton Matrices With Limited Storage , 1980 .

[40]  I. Steinsland,et al.  Flexible modelling of spatial variation in agricultural field trials with the R package INLA , 2019, Theoretical and Applied Genetics.

[41]  Imperfect Linkage Disequilibrium Generates Phantom Epistasis (& Perils of Big Data) , 2019, G3: Genes, Genomes, Genetics.

[42]  Andrea Riebler,et al.  An intuitive Bayesian spatial model for disease mapping that accounts for scaling , 2016, Statistical methods in medical research.

[43]  L. Varona,et al.  Non-additive Effects in Genomic Selection , 2018, Front. Genet..

[44]  K. Meyer Simple Penalties on Maximum-Likelihood Estimates of Genetic Parameters to Reduce Sampling Variation , 2015, Genetics.

[45]  W. G. Hill,et al.  Dominance genetic variation contributes little to the missing heritability for human complex traits. , 2015, American journal of human genetics.

[46]  Gustavo de Los Campos,et al.  Unraveling Additive from Nonadditive Effects Using Genomic Relationship Matrices , 2014, Genetics.

[47]  Daniel Gianola,et al.  Bayesian Methods in Animal Breeding Theory , 1986 .

[48]  Neil D. Lawrence,et al.  Deep Gaussian Processes , 2012, AISTATS.

[49]  Jin-jie Sun,et al.  Genomic Selection in Wheat , 2019, Applications of Genetic and Genomic Research in Cereals.

[50]  Kevin Fiedler,et al.  Likelihood Bayesian And Mcmc Methods In Quantitative Genetics , 2016 .

[51]  Andres Legarra,et al.  Comparing estimates of genetic variance across different relationship models. , 2016, Theoretical population biology.

[52]  C. R. Henderson Best Linear Unbiased Prediction of Nonadditive Genetic Merits in Noninbred Populations , 1985 .

[53]  D. Gianola,et al.  Contribution of an additive locus to genetic variance when inheritance is multi-factorial with implications on interpretation of GWAS , 2013, TAG. Theoretical and applied genetics. Theoretische und angewandte Genetik.

[54]  H. Grüneberg,et al.  Introduction to quantitative genetics , 1960 .

[55]  M. Goddard,et al.  Prediction of total genetic value using genome-wide dense marker maps. , 2001, Genetics.

[56]  D. Houle Comparing evolvability and variability of quantitative traits. , 1992, Genetics.

[57]  Daniel Gianola,et al.  Kernel-based whole-genome prediction of complex traits: a review , 2014, Front. Genet..

[58]  Alexander I. Young Solving the missing heritability problem , 2019, PLoS genetics.

[59]  G. Acquaah Principles of plant genetics and breeding , 2006 .

[60]  Haavard Rue,et al.  Intuitive Joint Priors for Variance Parameters , 2019, Bayesian Analysis.

[61]  L. Varona,et al.  Orthogonal Estimates of Variances for Additive, Dominance, and Epistatic Effects in Populations , 2017, Genetics.

[62]  Daniel Gianola,et al.  Additive Genetic Variability and the Bayesian Alphabet , 2009, Genetics.

[63]  N. Reinsch,et al.  Including non-additive genetic effects in Bayesian methods for the prediction of genetic values based on genome-wide markers , 2011, BMC Genetics.

[64]  Charles R. Johnson,et al.  Topics in matrix analysis: The Hadamard product , 1991 .

[65]  A. Bentley,et al.  A Two‐Part Strategy for Using Genomic Selection to Develop Inbred Lines , 2017 .

[66]  Timothy B Sackton,et al.  Genotypic Context and Epistasis in Individuals and Populations , 2016, Cell.

[67]  Jeffrey W. White,et al.  Rising Temperatures Reduce Global Wheat Production , 2015 .

[68]  M. Kirkpatrick,et al.  Penalized maximum likelihood estimates of genetic covariance matrices with shrinkage towards phenotypic dispersion , 2011 .

[69]  D. Cros,et al.  Modeling additive and non-additive effects in a hybrid population using genome-wide genotyping: prediction accuracy implications , 2015, Heredity.

[70]  M. Kirst,et al.  Genomic Prediction of Additive and Non-additive Effects Using Genetic Markers and Pedigrees , 2019, G3: Genes, Genomes, Genetics.

[71]  P. Visscher,et al.  10 Years of GWAS Discovery: Biology, Function, and Translation. , 2017, American journal of human genetics.

[72]  Jiqiang Guo,et al.  Stan: A Probabilistic Programming Language. , 2017, Journal of statistical software.

[73]  J. Cheverud Genetics and analysis of quantitative traits , 1999 .

[74]  Robert D. Finn,et al.  The Pfam protein families database: towards a more sustainable future , 2015, Nucleic Acids Res..

[75]  Daniel Gianola,et al.  Inferring genetic values for quantitative traits non-parametrically. , 2008, Genetics research.

[76]  M. Sorrells,et al.  Prediction of Subgenome Additive and Interaction Effects in Allohexaploid Wheat , 2018, G3: Genes, Genomes, Genetics.

[77]  Aki Vehtari,et al.  Hamiltonian Monte Carlo using an adjoint-differentiated Laplace approximation: Bayesian inference for latent Gaussian models and beyond , 2020, NeurIPS.

[78]  F. Schenkel,et al.  Estimation of additive and non-additive genetic effects for fertility and reproduction traits in North American Holstein cattle using genomic information. , 2020, Journal of animal breeding and genetics = Zeitschrift fur Tierzuchtung und Zuchtungsbiologie.

[79]  J. Garner,et al.  Reanalyses of the historical series of UK variety trials to quantify the contributions of genetic and environmental factors to trends and variability in yield over time , 2010, Theoretical and Applied Genetics.

[80]  W. G. Hill,et al.  Influence of Gene Interaction on Complex Trait Variation with Multilocus Models , 2014, Genetics.

[81]  A. Raftery,et al.  Strictly Proper Scoring Rules, Prediction, and Estimation , 2007 .

[82]  J. Foley,et al.  Yield Trends Are Insufficient to Double Global Crop Production by 2050 , 2013, PloS one.

[83]  Michael Betancourt,et al.  Diagnosing Suboptimal Cotangent Disintegrations in Hamiltonian Monte Carlo , 2016, 1604.00695.