Single nucleotide variants and InDels identified from whole-genome re-sequencing of Guzerat, Gyr, Girolando and Holstein cattle breeds

Whole-genome re-sequencing, alignment and annotation analyses were undertaken for 12 sires representing four important cattle breeds in Brazil: Guzerat (multi-purpose), Gyr, Girolando and Holstein (dairy production). A total of approximately 4.3 billion reads from an Illumina HiSeq 2000 sequencer generated for each animal 10.7 to 16.4-fold genome coverage. A total of 27,441,279 single nucleotide variations (SNVs) and 3,828,041 insertions/deletions (InDels) were detected in the samples, of which 2,557,670 SNVs and 883,219 InDels were novel. The submission of these genetic variants to the dbSNP database significantly increased the number of known variants, particularly for the indicine genome. The concordance rate between genotypes obtained using the Bovine HD BeadChip array and the same variants identified by sequencing was about 99.05%. The annotation of variants identified numerous non-synonymous SNVs and frameshift InDels which could affect phenotypic variation. Functional enrichment analysis was performed and revealed that variants in the olfactory transduction pathway was over represented in all four cattle breeds, while the ECM-receptor interaction pathway was over represented in Girolando and Guzerat breeds, the ABC transporters pathway was over represented only in Holstein breed, and the metabolic pathways was over represented only in Gyr breed. The genetic variants discovered here provide a rich resource to help identify potential genomic markers and their associated molecular mechanisms that impact economically important traits for Gyr, Girolando, Guzerat and Holstein breeding programs.

[1]  Daniel Rios,et al.  Bioinformatics Applications Note Databases and Ontologies Deriving the Consequences of Genomic Variants with the Ensembl Api and Snp Effect Predictor , 2022 .

[2]  Ryan E. Mills,et al.  Small insertions and deletions (INDELs) in human genomes. , 2010, Human molecular genetics.

[3]  Nancy F. Hansen,et al.  Accurate Whole Human Genome Sequencing using Reversible Terminator Chemistry , 2008, Nature.

[4]  T. Sonstegard,et al.  Linkage disequilibrium levels in Bos indicus and Bos taurus cattle using medium and high density SNP chip data and different minor allele frequency distributions , 2014 .

[5]  J. Arora,et al.  Y-chromosomal genes affecting male fertility: A review , 2016, Veterinary world.

[6]  Timothy P. L. Smith,et al.  Development and Characterization of a High Density SNP Genotyping Assay for Cattle , 2009, PloS one.

[7]  T. Meitinger,et al.  Whole genome sequencing of a single Bos taurus animal for single nucleotide polymorphism discovery , 2009, Genome Biology.

[8]  Valdis B. Guðmundsdóttir,et al.  The Y-chromosome point mutation rate in humans , 2015, Nature Genetics.

[9]  Kyung-Tai Lee,et al.  Whole-Genome Resequencing Analysis of Hanwoo and Yanbian Cattle to Identify Genome-Wide SNPs and Signatures of Selection , 2015, Molecules and cells.

[10]  Xiangdong Ding,et al.  Targeted resequencing of GWAS loci reveals novel genetic variants for milk production traits , 2014, BMC Genomics.

[11]  S. Oda,et al.  Whole-genome resequencing shows numerous genes with nonsynonymous SNPs in the Japanese native cattle Kuchinoshima-Ushi , 2011, BMC Genomics.

[12]  P. Stothard,et al.  Whole genome sequencing of Gir cattle for identifying polymorphisms and loci under selection. , 2013, Genome.

[13]  Namshin Kim,et al.  Massively parallel sequencing of Chikso (Korean brindle cattle) to discover genome-wide SNPs and InDels , 2013, Molecules and cells.

[14]  M. Nachman,et al.  Estimate of the mutation rate per nucleotide in humans. , 2000, Genetics.

[15]  Paul Stothard,et al.  Whole genome resequencing of black Angus and Holstein cattle for SNP and CNV discovery , 2011, BMC Genomics.

[16]  K. Shi,et al.  Bovine Mammary Gene Expression Profiling during the Onset of Lactation , 2013, PloS one.

[17]  David J Brayden,et al.  Drug delivery systems in domestic animal species. , 2010, Handbook of experimental pharmacology.

[18]  Elizabeth M. Smigielski,et al.  dbSNP: the NCBI database of genetic variation , 2001, Nucleic Acids Res..

[19]  Seoae Cho,et al.  Comparative Transcriptome Analysis of Adipose Tissues Reveals that ECM-Receptor Interaction Is Involved in the Depot-Specific Adipogenesis in Cattle , 2013, PloS one.

[20]  S. Pant,et al.  Genome-wide association and pathway analysis of feed efficiency in pigs reveal candidate genes and pathways for residual feed intake , 2014, Front. Genet..

[21]  T. Wieland,et al.  Assessment of the genomic variation in a cattle population by re-sequencing of key animals at low to medium coverage , 2013, BMC Genomics.

[22]  Richard Durbin,et al.  Sequence analysis Fast and accurate short read alignment with Burrows – Wheeler transform , 2009 .

[23]  M. Nei,et al.  Extensive Gains and Losses of Olfactory Receptor Genes in Mammalian Evolution , 2007, PloS one.

[24]  Chankyu Park,et al.  Analysis of cattle olfactory subgenome: the first detail study on the characteristics of the complete olfactory receptor repertoire of a ruminant , 2013, BMC Genomics.

[25]  J. Hemmer-Hansen,et al.  Application of SNPs for population genetics of nonmodel organisms: new opportunities and challenges , 2011, Molecular ecology resources.

[26]  Brad T. Sherman,et al.  Bioinformatics enrichment tools: paths toward the comprehensive functional analysis of large gene lists , 2008, Nucleic acids research.

[27]  Ryan E. Mills,et al.  Natural genetic variation caused by small insertions and deletions in the human genome. , 2011, Genome research.

[28]  J. Fink-Gremmels,et al.  Implications of ABC transporters on the disposition of typical veterinary medicinal products. , 2008, European journal of pharmacology.

[29]  Sequencing and annotated analysis of full genome of Holstein breed bull , 2014, Mammalian Genome.

[30]  R. Veerkamp,et al.  Whole-genome sequencing of 234 bulls facilitates mapping of monogenic and complex traits in cattle , 2014, Nature Genetics.

[31]  C. Bendixen,et al.  Deep sequencing of Danish Holstein dairy cattle for variant detection and insight into potential loss-of-function variants in protein coding genes , 2015, BMC Genomics.

[32]  K. Worley,et al.  The Genome Sequence of Taurine Cattle: A Window to Ruminant Biology and Evolution , 2009, Science.

[33]  Namshin Kim,et al.  Whole-Genome Analyses of Korean Native and Holstein Cattle Breeds by Massively Parallel Sequencing , 2014, PloS one.

[34]  S. Sano,et al.  Abundant sequence divergence in the native Japanese cattle Mishima-Ushi (Bos taurus) detected using whole-genome sequencing. , 2013, Genomics.

[35]  Maria Raquel Santos Carvalho,et al.  Programa Nacional de Melhoramento do Guzerá para Leite: resultados do Teste de Progênie, do Programa de Melhoramento Genético de Zebuínos da ABCZ e do Núcleo MOET. , 2017 .

[36]  J. Lenstra,et al.  Revisiting AFLP fingerprinting for an unbiased assessment of genetic structure and differentiation of taurine and zebu cattle , 2014, BMC Genetics.

[37]  Minghong Ma,et al.  Encoding Olfactory Signals via Multiple Chemosensory Systems , 2007, Critical reviews in biochemistry and molecular biology.

[38]  Brad T. Sherman,et al.  Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources , 2008, Nature Protocols.

[39]  J. Ferraz,et al.  Production systems--an example from Brazil. , 2010, Meat science.

[40]  J. Hedegaard,et al.  Global assessment of genomic variation in cattle by genome resequencing and high-throughput genotyping , 2011, BMC Genomics.