An Upper Bound for Accuracy of Prediction Using GBLUP

This study aims at characterizing the asymptotic behavior of genomic prediction R2 as the size of the reference population increases for common or rare QTL alleles through simulations. Haplotypes derived from whole-genome sequence of 85 Caucasian individuals from the 1,000 Genomes Project were used to simulate random mating in a population of 10,000 individuals for at least 100 generations to create the LD structure in humans for a large number of individuals. To reduce computational demands, only SNPs within a 0.1M region of each of the first 5 chromosomes were used in simulations, and therefore, the total genome length simulated was 0.5M. When the genome length is 30M, to get the same genomic prediction R2 as with a 0.5M genome would require a reference population 60 fold larger. Three scenarios were considered varying in minor allele frequency distributions of markers and QTL, for h2 = 0.8 resembling height in humans. Total number of markers was 4,200 and QTL were 70 for each scenario. In this study, we considered the prediction accuracy in terms of an estimability problem, and thereby provided an upper bound for reliability of prediction, and thus, for prediction R2. Genomic prediction methods GBLUP, BayesB and BayesC were compared. Our results imply that for human height variable selection methods BayesB and BayesC applied to a 30M genome have no advantage over GBLUP when the size of reference population was small (<6,000 individuals), but are superior as more individuals are included in the reference population. All methods become asymptotically equivalent in terms of prediction R2, which approaches genomic heritability when the size of the reference population reaches 480,000 individuals.

[1]  Jean-Luc Jannink,et al.  Genomic selection in plant breeding: from theory to practice. , 2010, Briefings in functional genomics.

[2]  M. Goddard,et al.  Prediction of total genetic value using genome-wide dense marker maps. , 2001, Genetics.

[3]  M. Calus,et al.  Whole-Genome Regression and Prediction Methods Applied to Plant and Animal Breeding , 2013, Genetics.

[4]  Zhe Zhang,et al.  Best Linear Unbiased Prediction of Genomic Breeding Values Using a Trait-Specific Marker-Derived Relationship Matrix , 2010, PloS one.

[5]  B. J. Hayes,et al.  Genomic selection: Genomic selection , 2007 .

[6]  P. VanRaden,et al.  Efficient methods to compute genomic predictions. , 2008, Journal of dairy science.

[7]  M. Calus,et al.  Reliability of direct genomic values for animals with different relationships within and to the reference population. , 2012, Journal of dairy science.

[8]  Dorian Garrick,et al.  Bayesian methods applied to GWAS. , 2013, Methods in molecular biology.

[9]  J. Woolliams,et al.  The Accuracy of Genomic Selection in Norwegian Red Cattle Assessed by Cross-Validation , 2009, Genetics.

[10]  N. Takahata,et al.  Allelic genealogy and human evolution. , 1993, Molecular biology and evolution.

[11]  M P L Calus,et al.  Genomic breeding value prediction: methods and procedures. , 2010, Animal : an international journal of animal bioscience.

[12]  Kenny Q. Ye,et al.  An integrated map of genetic variation from 1,092 human genomes , 2012, Nature.

[13]  D. Garrick,et al.  Technical note: Derivation of equivalent computing algorithms for genomic predictions and reliabilities of animal merit. , 2009, Journal of dairy science.

[14]  Rex D. Fernando,et al.  Analysis of egg production in layer chickens using a random regression model with genomic relationships. , 2013, Poultry science.

[15]  Daniel Gianola,et al.  Using Whole-Genome Sequence Data to Predict Quantitative Trait Phenotypes in Drosophila melanogaster , 2012, PLoS genetics.

[16]  Hans D. Daetwyler,et al.  Accuracy of Predicting the Genetic Risk of Disease Using a Genome-Wide Approach , 2008, PloS one.

[17]  P. Visscher,et al.  Common SNPs explain a large proportion of heritability for human height , 2011 .

[18]  Rohan L. Fernando,et al.  Extension of the bayesian alphabet for genomic selection , 2011, BMC Bioinformatics.

[19]  R. Fernando,et al.  The Impact of Genetic Relationship Information on Genome-Assisted Breeding Values , 2007, Genetics.

[20]  H. Daetwyler,et al.  The importance of information on relatives for the prediction of genomic breeding values and the implications for the makeup of reference data sets in livestock breeding schemes , 2012, Genetics Selection Evolution.

[21]  A. Nejati-Javaremi,et al.  Effect of total allelic relationship on accuracy of evaluation and response to selection. , 1997, Journal of animal science.

[22]  J. Hickey,et al.  Different models of genetic variation and their effect on genomic evaluation , 2011, Genetics Selection Evolution.

[23]  R. Fernando,et al.  Prediction of Complex Human Traits Using the Genomic Best Linear Unbiased Predictor , 2013, PLoS genetics.

[24]  C. R. Henderson Applications of linear models in animal breeding , 1984 .

[25]  T. Meuwissen,et al.  Accuracy of breeding values of 'unrelated' individuals predicted by dense SNP genotyping , 2009, Genetics Selection Evolution.

[26]  R. Bernardo,et al.  Prospects for genomewide selection for quantitative traits in maize , 2007 .

[27]  Ben J Hayes,et al.  Accuracy of genomic breeding values in multi-breed dairy cattle populations , 2009, Genetics Selection Evolution.

[28]  D. Allison,et al.  Beyond Missing Heritability: Prediction of Complex Traits , 2011, PLoS genetics.

[29]  P. Lichtner,et al.  The impact of genetic relationship information on genomic breeding values in German Holstein cattle , 2010, Genetics Selection Evolution.

[30]  R. Fernando,et al.  Implementing a QTL detection study (GWAS) using genomic prediction methodology. , 2013, Methods in molecular biology.

[31]  M. Goddard,et al.  Genomic selection. , 2007, Journal of animal breeding and genetics = Zeitschrift fur Tierzuchtung und Zuchtungsbiologie.

[32]  M. Goddard,et al.  The distribution of the effects of genes affecting quantitative traits in livestock , 2001, Genetics Selection Evolution.

[33]  R. Fernando,et al.  XSim: Simulation of Descendants from Ancestors with Sequence Data , 2015, G3: Genes, Genomes, Genetics.

[34]  Andrés Legarra,et al.  Performance of Genomic Selection in Mice , 2008, Genetics.

[35]  Jeanette J McCarthy,et al.  Genomic Medicine: A Decade of Successes, Challenges, and Opportunities , 2013, Science Translational Medicine.

[36]  R. Varshney,et al.  Genomic Selection for Crop Improvement , 2017, Springer International Publishing.

[37]  J. Woolliams,et al.  The Impact of Genetic Architecture on Genome-Wide Evaluation Methods , 2010, Genetics.

[38]  J. Hess,et al.  Analysis of variance , 2018, Transfusion.

[39]  R. Fernando,et al.  Genomic prediction of simulated multibreed and purebred performance using observed fifty thousand single nucleotide polymorphism genotypes. , 2010, Journal of animal science.

[40]  M. Goddard,et al.  Using the genomic relationship matrix to predict the accuracy of genomic selection. , 2011, Journal of animal breeding and genetics = Zeitschrift fur Tierzuchtung und Zuchtungsbiologie.

[41]  M. Goddard,et al.  Accurate Prediction of Genetic Values for Complex Traits by Whole-Genome Resequencing , 2010, Genetics.

[42]  Daniel Gianola,et al.  Accuracy of Genome-Enabled Prediction in a Dairy Cattle Population using Different Cross-Validation Layouts , 2012, Front. Gene..

[43]  M. Goddard Genomic selection: prediction of accuracy and maximisation of long term response , 2009, Genetica.