Detection of genome-wide structural variations in the Shanghai Holstein cattle population using next-generation sequencing

Objective The Shanghai Holstein cattle breed is susceptible to severe mastitis and other diseases due to the hot weather and long-term humidity in Shanghai, which is the main distribution centre for providing Holstein semen to various farms throughout China. Our objective was to determine the genetic mechanisms influencing economically important traits, especially diseases that have huge impact on the yield and quality of milk as well as reproduction. Methods In our study, we detected the structural variations of 1,092 Shanghai Holstein cows by using next-generation sequencing. We used the DELLY software to identify deletions and insertions, cn.MOPS to identify copy-number variants (CNVs). Furthermore, we annotated these structural variations using different bioinformatics tools, such as gene ontology, cattle quantitative trait locus (QTL) database and ingenuity pathway analysis (IPA). Results The average number of high-quality reads was 3,046,279. After filtering, a total of 16,831 deletions, 12,735 insertions and 490 CNVs were identified. The annotation results showed that these mapped genes were significantly enriched for specific biological functions, such as disease and reproduction. In addition, the enrichment results based on the cattle QTL database showed that the number of variants related to milk and reproduction was higher than the number of variants related to other traits. IPA core analysis found that the structural variations were related to reproduction, lipid metabolism, and inflammation. According to the functional analysis, structural variations were important factors affecting the variation of different traits in Shanghai Holstein cattle. Our results provide meaningful information about structural variations, which may be useful in future assessments of the associations between variations and important phenotypes in Shanghai Holstein cattle. Conclusion Structural variations identified in this study were extremely different from those of previous studies. Many structural variations were found to be associated with mastitis and reproductive system diseases; these results are in accordance with the characteristics of the environment that Shanghai Holstein cattle experience.

[1]  N. Zöllner Purine and pyrimidine metabolism. , 1982, The Proceedings of the Nutrition Society.

[2]  J. Drackley,et al.  Production, digestion, and hepatic lipid metabolism of dairy cows fed increased energy from fat or concentrate. , 1996, Journal of dairy science.

[3]  Y. Schukken,et al.  Preinfection in vitro chemotaxis, phagocytosis, oxidative burst, and expression of CD11/CD18 receptors and their predictive capacity on the outcome of mastitis induced in dairy cows with Escherichia coli. , 1997, Journal of dairy science.

[4]  R. Meidan,et al.  Impaired reproduction in heat-stressed cattle: basic and applied aspects. , 2000, Animal reproduction science.

[5]  G. K. Gaur,et al.  The Gir cattle breed of India - characteristics and present status , 2003 .

[6]  L. Feuk,et al.  Structural variation in the human genome , 2006, Nature Reviews Genetics.

[7]  S. Mccarroll,et al.  Copy-number variation and association studies of human disease , 2007, Nature Genetics.

[8]  A. Shelling,et al.  Genetic variation in human disease and a new role for copy number variants. , 2007, Mutation research.

[9]  Brad T. Sherman,et al.  Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources , 2008, Nature Protocols.

[10]  H. Inui,et al.  RanBP10 acts as a novel coactivator for the androgen receptor. , 2008, Biochemical and biophysical research communications.

[11]  Paul Medvedev,et al.  Computational methods for discovering structural variation with next-generation sequencing , 2009, Nature Methods.

[12]  C. Arce,et al.  Biological pathway analysis by ArrayUnlock and Ingenuity Pathway Analysis , 2009, BMC proceedings.

[13]  Gonçalo R. Abecasis,et al.  The Sequence Alignment/Map format and SAMtools , 2009, Bioinform..

[14]  Richard Durbin,et al.  Sequence analysis Fast and accurate short read alignment with Burrows – Wheeler transform , 2009 .

[15]  R. Wilson,et al.  BreakDancer: An algorithm for high resolution mapping of genomic structural variation , 2009, Nature Methods.

[16]  Kai Ye,et al.  Pindel: a pattern growth approach to detect break points of large deletions and medium sized insertions from paired-end short reads , 2009, Bioinform..

[17]  M. Goddard,et al.  A validated genome-wide association study in 2 dairy cattle breeds for milk production and fertility traits using variable length haplotypes. , 2010, Journal of dairy science.

[18]  P. Stankiewicz,et al.  Structural variation in the human genome and its role in disease. , 2010, Annual review of medicine.

[19]  R. Schnabel,et al.  A Novel Mutation in the Maternally Imprinted PEG3 Domain Results in a Loss of MIMT1 Expression and Causes Abortions and Stillbirths in Cattle (Bos taurus) , 2010, PloS one.

[20]  J. Lupski,et al.  Structural variation of the human genome: mechanisms, assays, and role in male infertility , 2011, Systems biology in reproductive medicine.

[21]  P. VanRaden,et al.  Genomic characteristics of cattle copy number variations , 2011, BMC Genomics.

[22]  Jun Yang,et al.  Transcriptome profiling of low temperature-treated cassava apical shoots showed dynamic responses of tropical plant to cold stress , 2012, BMC Genomics.

[23]  I. Misztal,et al.  Genetic effects of heat stress on milk yield of Thai Holstein crossbreds. , 2011, Journal of dairy science.

[24]  Bradley P. Coe,et al.  Genome structural variation discovery and genotyping , 2011, Nature Reviews Genetics.

[25]  M. Gerstein,et al.  CNVnator: an approach to discover, genotype, and characterize typical and atypical CNVs from family and population genome sequencing. , 2011, Genome research.

[26]  A. Berman Invited review: Are adaptations present to support dairy cattle productivity in warm climates? , 2011, Journal of dairy science.

[27]  Thomas Zichner,et al.  DELLY: structural variant discovery by integrated paired-end and split-read analysis , 2012, Bioinform..

[28]  Xiangdong Ding,et al.  Genome-wide detection of copy number variations using high-density SNP genotyping platforms in Holsteins , 2013, BMC Genomics.

[29]  S. Hochreiter,et al.  cn.MOPS: mixture of Poissons for discovering copy number variations in next-generation sequencing data with a low false discovery rate , 2012, Nucleic acids research.

[30]  Jin Zhang,et al.  An improved approach for accurate and efficient calling of structural variations with low-coverage sequence data , 2012, BMC Bioinformatics.

[31]  B. Crespi,et al.  Association testing of copy number variants in schizophrenia and autism spectrum disorders , 2012, Journal of Neurodevelopmental Disorders.

[32]  Bronwen L. Aken,et al.  Analyses of pig genomes provide insight into porcine demography and evolution , 2012, Nature.

[33]  Xiangdong Ding,et al.  A genome-wide detection of copy number variations using SNP genotyping arrays in swine , 2012, BMC Genomics.

[34]  Heebal Kim,et al.  Deleted copy number variation of Hanwoo and Holstein using next generation sequencing at the population level , 2014, BMC Genomics.

[35]  Qingguo Wang,et al.  Computational tools for copy number variation (CNV) detection using next-generation sequencing data: features and perspectives , 2013, BMC Bioinformatics.

[36]  Yuan Zhang,et al.  Genomic analyses identify distinct patterns of selection in domesticated pigs and Tibetan wild boars , 2013, Nature Genetics.

[37]  Zhiwu Zhang,et al.  Genotyping by Genome Reducing and Sequencing for Outbred Animals , 2013, PloS one.

[38]  D. Kenny,et al.  Global gene expression in endometrium of high and low fertility heifers during the mid-luteal phase of the estrous cycle , 2014, BMC Genomics.

[39]  C. Morton,et al.  Mechanisms for Structural Variation in the Human Genome , 2013, Current Genetic Medicine Reports.

[40]  Yu-ping Wang,et al.  Comparative Studies of Copy Number Variation Detection Methods for Next-Generation Sequencing Technologies , 2013, PloS one.

[41]  Q. Xiong,et al.  Association analysis of HSP70A1A haplotypes with heat tolerance in Chinese Holstein cattle , 2013, Cell Stress and Chaperones.

[42]  Christina A. Castellani,et al.  Biological relevance of CNV calling methods using familial relatedness including monozygotic twins , 2014, BMC Bioinformatics.

[43]  Jianfeng Liu,et al.  Global copy number analyses by next generation sequencing provide insight into pig genome variation , 2014, BMC Genomics.

[44]  M. Quintela-Baluja,et al.  The Immunology of Mammary Gland of Dairy Ruminants between Healthy and Inflammatory Conditions , 2014, Journal of veterinary medicine.

[45]  R. Veerkamp,et al.  Whole-genome sequencing of 234 bulls facilitates mapping of monogenic and complex traits in cattle , 2014, Nature Genetics.

[46]  Mark Yandell,et al.  Wham: Identifying Structural Variants of Biological Consequence , 2015, PLoS Comput. Biol..

[47]  G. Cheng,et al.  The Differential Expression of Immune Genes between Water Buffalo and Yellow Cattle Determines Species-Specific Susceptibility to Schistosoma japonicum Infection , 2015, PloS one.

[48]  Mark Gerstein,et al.  MetaSV: an accurate and integrative structural-variant caller for next generation sequencing , 2015, Bioinform..

[49]  G. Block,et al.  Exposure to Bovine Leukemia Virus Is Associated with Breast Cancer: A Case-Control Study , 2015, PloS one.

[50]  F. Samson,et al.  Genome-Wide Study of Structural Variants in Bovine Holstein, Montbéliarde and Normande Dairy Breeds , 2015, PloS one.

[51]  R. Derijk,et al.  Genetic variants in glucocorticoid and mineralocorticoid receptors are associated with concentrations of plasma cortisol, muscle glycogen content, and meat quality traits in male Nellore cattle. , 2015, Domestic animal endocrinology.

[52]  N. Kristensen,et al.  Absorption and intermediary metabolism of purines and pyrimidines in lactating dairy cows , 2015, British Journal of Nutrition.

[53]  K. Ropka-Molik,et al.  Identification of genome-wide selection signatures in the Limousin beef cattle breed. , 2016, Journal of animal breeding and genetics = Zeitschrift fur Tierzuchtung und Zuchtungsbiologie.

[54]  Timothy P. L. Smith,et al.  Using diverse U.S. beef cattle genomes to identify missense mutations in EPAS1, a gene associated with pulmonary hypertension , 2016, F1000Research.

[55]  Xiaosong Gu,et al.  Ingenuity Pathway Analysis of Gene Expression Profiles in Distal Nerve Stump following Nerve Injury: Insights into Wallerian Degeneration , 2016, Front. Cell. Neurosci..

[56]  Huimin Kang,et al.  Structural Variant Detection by Large-scale Sequencing Reveals New Evolutionary Evidence on Breed Divergence between Chinese and European Pigs , 2016, Scientific Reports.

[57]  Michael A. Black,et al.  SRBreak: A Read-Depth and Split-Read Framework to Identify Breakpoints of Different Events Inside Simple Copy-Number Variable Regions , 2016, Front. Genet..

[58]  MicroRNA expression profiling of porcine mammary epithelial cells after challenge with Escherichia coli in vitro , 2017, BMC Genomics.

[59]  Jing Wang,et al.  CNNdel: Calling Structural Variations on Low Coverage Data Based on Convolutional Neural Networks , 2017, BioMed research international.

[60]  T. Mushiroda,et al.  Pathogen lineage-based genome-wide association study identified CD53 as susceptible locus in tuberculosis , 2017, Journal of Human Genetics.

[61]  X. Zhang,et al.  Genome‐wide genetic variation discovery in Chinese Taihu pig breeds using next generation sequencing , 2016, Animal genetics.

[62]  Jonas Korlach,et al.  Discovery and genotyping of structural variation from long-read haploid genome sequence data , 2017, Genome research.

[63]  M. Collins,et al.  An across-breed genome wide association analysis of susceptibility to paratuberculosis in dairy cattle , 2015, Journal of Dairy Research.

[64]  J. Keele,et al.  Genome‐wide copy number variation in the bovine genome detected using low coverage sequence of popular beef breeds†,‡ , 2017, Animal genetics.