Accounting for Genetic Architecture Improves Sequence Based Genomic Prediction for a Drosophila Fitness Trait

The ability to predict quantitative trait phenotypes from molecular polymorphism data will revolutionize evolutionary biology, medicine and human biology, and animal and plant breeding. Efforts to map quantitative trait loci have yielded novel insights into the biology of quantitative traits, but the combination of individually significant quantitative trait loci typically has low predictive ability. Utilizing all segregating variants can give good predictive ability in plant and animal breeding populations, but gives little insight into trait biology. Here, we used the Drosophila Genetic Reference Panel to perform both a genome wide association analysis and genomic prediction for the fitness-related trait chill coma recovery time. We found substantial total genetic variation for chill coma recovery time, with a genetic architecture that differs between males and females, a small number of molecular variants with large main effects, and evidence for epistasis. Although the top additive variants explained 36% (17%) of the genetic variance among lines in females (males), the predictive ability using genomic best linear unbiased prediction and a relationship matrix using all common segregating variants was very low for females and zero for males. We hypothesized that the low predictive ability was due to the mismatch between the infinitesimal genetic architecture assumed by the genomic best linear unbiased prediction model and the true genetic architecture of chill coma recovery time. Indeed, we found that the predictive ability of the genomic best linear unbiased prediction model is markedly improved when we combine quantitative trait locus mapping with genomic prediction by only including the top variants associated with main and epistatic effects in the relationship matrix. This trait-associated prediction approach has the advantage that it yields biologically interpretable prediction models.

[1]  Jim Thurmond,et al.  FlyBase 101 – the basics of navigating FlyBase , 2011, Nucleic Acids Res..

[2]  W. G. Hill,et al.  Data and Theory Point to Mainly Additive Genetic Variance for Complex Traits , 2008, PLoS genetics.

[3]  Kevin R. Thornton,et al.  The Drosophila melanogaster Genetic Reference Panel , 2012, Nature.

[4]  H. Grüneberg,et al.  Introduction to quantitative genetics , 1960 .

[5]  Daniel Gianola,et al.  Marker-assisted prediction of non-additive genetic values , 2011, Genetica.

[6]  R. Fernando,et al.  Prediction of Complex Human Traits Using the Genomic Best Linear Unbiased Predictor , 2013, PLoS genetics.

[7]  R. Gibbs,et al.  INAUGURAL ARTICLE by a Recently Elected Academy Member:Epistasis dominates the genetic architecture of Drosophila quantitative traits , 2012 .

[8]  Thomas Mitchell-Olds,et al.  Epistasis and balanced polymorphism influencing complex trait variation , 2005, Nature.

[9]  R. Gibbs,et al.  Natural variation in genome architecture among 205 Drosophila melanogaster Genetic Reference Panel lines. , 2014, Genome research.

[10]  C. H. WADDINGTON,et al.  Canalization of Development and Genetic Assimilation of Acquired Characters , 1959, Nature.

[11]  P. VanRaden,et al.  Efficient methods to compute genomic predictions. , 2008, Journal of dairy science.

[12]  C. Laurie,et al.  Molecular dissection of a major gene effect on a quantitative trait: the level of alcohol dehydrogenase expression in Drosophila melanogaster. , 1996, Genetics.

[13]  Pablo Cingolani,et al.  © 2012 Landes Bioscience. Do not distribute. , 2022 .

[14]  Stuart A. Kauffman,et al.  ORIGINS OF ORDER , 2019, Origins of Order.

[15]  L. Andersson,et al.  Epistasis and the release of genetic variation during long-term selection , 2006, Nature Genetics.

[16]  Michael E Goddard,et al.  The future of livestock breeding: genomic selection for efficiency, reduced emissions intensity, and adaptation. , 2013, Trends in genetics : TIG.

[17]  Waddington Ch,et al.  Canalization of Development and Genetic Assimilation of Acquired Characters , 1959 .

[18]  Jonathan Flint,et al.  Genetic architecture of quantitative traits in mice, flies, and humans. , 2009, Genome research.

[19]  Shizhong Xu,et al.  Genomic value prediction for quantitative traits under the epistatic model , 2011, BMC Genetics.

[20]  E. Stone,et al.  The genetics of quantitative traits: challenges and prospects , 2009, Nature Reviews Genetics.

[21]  M. Stone An Asymptotic Equivalence of Choice of Model by Cross‐Validation and Akaike's Criterion , 1977 .

[22]  David M. Allen,et al.  The Relationship Between Variable Selection and Data Agumentation and a Method for Prediction , 1974 .

[23]  Daniel Gianola,et al.  Using Whole-Genome Sequence Data to Predict Quantitative Trait Phenotypes in Drosophila melanogaster , 2012, PLoS genetics.

[24]  Himanshu Sinha,et al.  Sequential Elimination of Major-Effect Contributors Identifies Additional Quantitative Trait Loci Conditioning High-Temperature Growth in Yeast , 2008, Genetics.

[25]  D. Allison,et al.  Beyond Missing Heritability: Prediction of Complex Traits , 2011, PLoS genetics.

[26]  E. Lander,et al.  The mystery of missing heritability: Genetic interactions create phantom heritability , 2012, Proceedings of the National Academy of Sciences.

[27]  M. Goddard,et al.  Invited review: Genomic selection in dairy cattle: progress and challenges. , 2009, Journal of dairy science.

[28]  M. Goddard,et al.  Prediction of total genetic value using genome-wide dense marker maps. , 2001, Genetics.

[29]  William J. Astle,et al.  Population Structure and Cryptic Relatedness in Genetic Association Studies , 2009, 1010.4681.

[30]  D. Gianola Priors in Whole-Genome Regression: The Bayesian Alphabet Returns , 2013, Genetics.

[31]  P. Visscher,et al.  Five years of GWAS discovery. , 2012, American journal of human genetics.

[32]  Manuel A. R. Ferreira,et al.  PLINK: a tool set for whole-genome association and population-based linkage analyses. , 2007, American journal of human genetics.

[33]  P. Visscher,et al.  Common SNPs explain a large proportion of heritability for human height , 2011 .

[34]  M. Stone Cross‐Validatory Choice and Assessment of Statistical Predictions , 1976 .

[35]  E. Stone,et al.  Joint genotyping on the fly: Identifying variation among a sequenced panel of inbred lines , 2012, Genome research.

[36]  Judy H. Cho,et al.  Finding the missing heritability of complex diseases , 2009, Nature.

[37]  Daniel R. Richards,et al.  Dissecting the architecture of a quantitative trait locus in yeast , 2002, Nature.

[38]  Ioannis Xenarios,et al.  FastEpistasis: a high performance computing solution for quantitative trait epistasis , 2010, Bioinform..

[39]  L R Schaeffer,et al.  Strategy for applying genome-wide selection in dairy cattle. , 2006, Journal of animal breeding and genetics = Zeitschrift fur Tierzuchtung und Zuchtungsbiologie.

[40]  A. Hoffmann,et al.  Response to selection for rapid chill-coma recovery in Drosophila melanogaster: physiology and life-history traits. , 2005, Genetical research.

[41]  T. Mackay Epistasis and quantitative traits: using model organisms to study gene–gene interactions , 2013, Nature Reviews Genetics.

[42]  M. Goddard Genomic selection: prediction of accuracy and maximisation of long term response , 2009, Genetica.

[43]  M. Calus,et al.  Genomic Prediction in Animals and Plants: Simulation of Data, Validation, Reporting, and Benchmarking , 2013, Genetics.

[44]  Moudud Alam,et al.  A Novel Generalized Ridge Regression Method for Quantitative Genetics , 2013, Genetics.

[45]  B. Sinclair,et al.  Mechanisms underlying insect chill-coma. , 2011, Journal of insect physiology.

[46]  P. Phillips Epistasis — the essential role of gene interactions in the structure and evolution of genetic systems , 2008, Nature Reviews Genetics.

[47]  Ying Liu,et al.  FaST linear mixed models for genome-wide association studies , 2011, Nature Methods.

[48]  Casey S. Greene,et al.  Failure to Replicate a Genetic Association May Provide Important Clues About Genetic Architecture , 2009, PloS one.

[49]  M. Calus,et al.  Whole-Genome Regression and Prediction Methods Applied to Plant and Animal Breeding , 2013, Genetics.

[50]  Zhe Zhang,et al.  Best Linear Unbiased Prediction of Genomic Breeding Values Using a Trait-Specific Marker-Derived Relationship Matrix , 2010, PloS one.

[51]  Aaron J. Lorenz,et al.  Genomic Selection in Plant Breeding , 2011 .

[52]  W. G. Hill,et al.  Genome partitioning of genetic variation for complex traits using common SNPs , 2011, Nature Genetics.

[53]  J. Woolliams,et al.  The Impact of Genetic Architecture on Genome-Wide Evaluation Methods , 2010, Genetics.

[54]  T. Mackay,et al.  Quantitative trait loci for thermotolerance phenotypes in Drosophila melanogaster , 2006, Heredity.

[55]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .