Improvement of genomic prediction by integrating additional single nucleotide polymorphisms selected from imputed whole genome sequencing data

The availability of whole genome sequencing (WGS) data enables the discovery of causative single nucleotide polymorphisms (SNPs) or SNPs in high linkage disequilibrium with causative SNPs. This study investigated effects of integrating SNPs selected from imputed WGS data into the data of 54K chip on genomic prediction in Danish Jersey. The WGS SNPs, mainly including peaks of quantitative trait loci, structure variants, regulatory regions of genes, and SNPs within genes with strong effects predicted with variant effect predictor, were selected in previous analyses for dairy breeds in Denmark–Finland–Sweden (DFS) and France (FRA). Animals genotyped with 54K chip, standard LD chip, and customized LD chip which covered selected WGS SNPs and SNPs in the standard LD chip, were imputed to 54K together with DFS and FRA SNPs. Genomic best linear unbiased prediction (GBLUP) and Bayesian four-distribution mixture models considering 54K and selected WGS SNPs as one (a one-component model) or two separate genetic components (a two-component model) were used to predict breeding values. For milk production traits and mastitis, both DFS (0.025) and FRA (0.029) sets of additional WGS SNPs improved reliabilities, and inclusions of all selected WGS SNPs generally achieved highest improvements of reliabilities (0.034). A Bayesian four-distribution model yielded higher reliabilities than a GBLUP model for milk and protein, but extra gains in reliabilities from using selected WGS SNPs were smaller for a Bayesian four-distribution model than a GBLUP model. Generally, no significant difference was observed between one-component and two-component models, except for using GBLUP models for milk.

[1]  Zhe Zhang,et al.  Best Linear Unbiased Prediction of Genomic Breeding Values Using a Trait-Specific Marker-Derived Relationship Matrix , 2010, PloS one.

[2]  Guosheng Su,et al.  Genome position specific priors for genomic prediction , 2012, BMC Genomics.

[3]  Pauline C Ng,et al.  Whole genome sequencing. , 2010, Methods in molecular biology.

[4]  B. Guldbrandtsen,et al.  Genome-wide association studies of growth traits in three dairy cattle breeds using whole-genome sequence data. , 2016, Journal of animal science.

[5]  P. Ma,et al.  Comparison of genomic predictions using medium-density (∼54,000) and high-density (∼777,000) single nucleotide polymorphism marker panels in Nordic Holstein and Red Dairy Cattle populations. , 2012, Journal of dairy science.

[6]  F. Schenkel,et al.  A new approach for efficient genotype imputation using information from relatives , 2014, BMC Genomics.

[7]  I. M. MacLeod,et al.  Exploiting biological priors and sequence variants enhances QTL discovery and genomic prediction of complex traits , 2016, BMC Genomics.

[8]  J. Woolliams,et al.  Within- and across-breed genomic prediction using whole-genome sequence and single nucleotide polymorphism panels , 2016, Genetics Selection Evolution.

[9]  H. Redkey,et al.  A new approach. , 1967, Rehabilitation record.

[10]  W. G. Hill,et al.  Genome partitioning of genetic variation for height from 11,214 sibling pairs. , 2007, American journal of human genetics.

[11]  A. C. Sørensen,et al.  The value of cows in reference populations for genomic selection of new functional traits. , 2012, Animal : an international journal of animal bioscience.

[12]  Guosheng Su,et al.  Genomic evaluation of cattle in a multi-breed context ☆ , 2014 .

[13]  F. Samson,et al.  Genome-Wide Study of Structural Variants in Bovine Holstein, Montbéliarde and Normande Dairy Breeds , 2015, PloS one.

[14]  Tingting Wang,et al.  Multi-breed genomic prediction using Bayes R with sequence data and dropping variants with a small effect , 2017, Genetics Selection Evolution.

[15]  F. Cunningham,et al.  The Ensembl Variant Effect Predictor , 2016, Genome Biology.

[16]  M. Pérez-Enciso,et al.  Sequence- vs. chip-assisted genomic selection: accurate biological information is advised , 2015, Genetics Selection Evolution.

[17]  M. Lund,et al.  Genomic prediction for Nordic Red Cattle using one-step and selection index blending. , 2012, Journal of dairy science.

[18]  K. Weigel,et al.  International genetic evaluations for feed intake in dairy cattle through the collation of data from multiple sources. , 2013, Journal of dairy science.

[19]  Guosheng Su,et al.  A common reference population from four European Holstein populations increases reliability of genomic predictions , 2011, Genetics Selection Evolution.

[20]  M. Goddard,et al.  Invited review: Genomic selection in dairy cattle: progress and challenges. , 2009, Journal of dairy science.

[21]  P M VanRaden,et al.  Genomic imputation and evaluation using high-density Holstein genotypes. , 2013, Journal of dairy science.

[22]  Dorian J. Garrick,et al.  An Upper Bound for Accuracy of Prediction Using GBLUP , 2016, PloS one.

[23]  Tad S. Sonstegard,et al.  Design of a Bovine Low-Density SNP Array Optimized for Imputation , 2012, PloS one.

[24]  M. Lund,et al.  Sequence variants selected from a multi-breed GWAS can improve the reliability of genomic predictions in dairy cattle , 2016, Genetics Selection Evolution.

[25]  Ignacy Misztal,et al.  Incorporation of causative quantitative trait nucleotides in single-step GBLUP , 2017, Genetics Selection Evolution.

[26]  M. Lund,et al.  Model comparison on genomic predictions using high-density markers for different groups of bulls in the Nordic Holstein population. , 2013, Journal of dairy science.

[27]  P. Ma,et al.  Sharing reference data and including cows in the reference population improve genomic predictions in Danish Jersey. , 2016, Animal : an international journal of animal bioscience.

[28]  Hans D. Daetwyler,et al.  Accuracy of Predicting the Genetic Risk of Disease Using a Genome-Wide Approach , 2008, PloS one.

[29]  Genomic prediction using imputed whole-genome sequence data in Holstein Friesian cattle , 2015, Genetics Selection Evolution.

[30]  B. Guldbrandtsen,et al.  Using Sequence Variants in Linkage Disequilibrium with Causative Mutations to Improve Across-Breed Prediction in Dairy Cattle: A Simulation Study , 2016, G3: Genes, Genomes, Genetics.

[31]  B. Guldbrandtsen,et al.  Quantitative trait loci markers derived from whole genome sequence data increases the reliability of genomic prediction. , 2015, Journal of dairy science.

[32]  R. Veerkamp,et al.  Whole-genome sequencing of 234 bulls facilitates mapping of monogenic and complex traits in cattle , 2014, Nature Genetics.

[33]  A reverse genetic approach identifies an ancestral frameshift mutation in RP1 causing recessive progressive retinal degeneration in European cattle breeds , 2016, Genetics Selection Evolution.

[34]  M. Lund,et al.  Comparison of genomic predictions using genomic relationship matrices built with different weighting factors to account for locus-specific variances. , 2014, Journal of dairy science.

[35]  P. VanRaden,et al.  Short communication: Improving accuracy of Jersey genomic evaluations in the United States and Denmark by sharing reference population bulls. , 2015, Journal of dairy science.

[36]  M Erbe,et al.  Improving accuracy of genomic predictions within and between dairy cattle breeds with imputed high-density single nucleotide polymorphism panels. , 2012, Journal of dairy science.

[37]  M. Goddard,et al.  Linkage Disequilibrium and Persistence of Phase in Holstein–Friesian, Jersey and Angus Cattle , 2008, Genetics.

[38]  P. Ma,et al.  Review: How to improve genomic predictions in small dairy cattle populations. , 2016, Animal : an international journal of animal bioscience.

[39]  B. Guldbrandtsen,et al.  Experience from large scale use of the EuroGenomics custom SNP chip in cattle , 2018 .

[40]  P. VanRaden,et al.  Efficient methods to compute genomic predictions. , 2008, Journal of dairy science.

[41]  Guosheng Su,et al.  Genome-wide Association Studies for Female Fertility Traits in Chinese and Nordic Holsteins , 2017, Scientific Reports.

[42]  K. Weigel,et al.  Machine learning classification procedure for selecting SNPs in genomic selection: application to early mortality in broilers. , 2007, Developments in biologicals.