Chromosome-level reference genome and alternative splicing atlas of moso bamboo (Phyllostachys edulis)

Abstract Background Bamboo is one of the most important nontimber forestry products worldwide. However, a chromosome-level reference genome is lacking, and an evolutionary view of alternative splicing (AS) in bamboo remains unclear despite emerging omics data and improved technologies. Results Here, we provide a chromosome-level de novo genome assembly of moso bamboo (Phyllostachys edulis) using additional abundance sequencing data and a Hi-C scaffolding strategy. The significantly improved genome is a scaffold N50 of 79.90 Mb, approximately 243 times longer than the previous version. A total of 51,074 high-quality protein-coding loci with intact structures were identified using single-molecule real-time sequencing and manual verification. Moreover, we provide a comprehensive AS profile based on the identification of 266,711 unique AS events in 25,225 AS genes by large-scale transcriptomic sequencing of 26 representative bamboo tissues using both the Illumina and Pacific Biosciences sequencing platforms. Through comparisons with orthologous genes in related plant species, we observed that the AS genes are concentrated among more conserved genes that tend to accumulate higher transcript levels and share less tissue specificity. Furthermore, gene family expansion, abundant AS, and positive selection were identified in crucial genes involved in the lignin biosynthetic pathway of moso bamboo. Conclusions These fundamental studies provide useful information for future in-depth analyses of comparative genome and AS features. Additionally, our results highlight a global perspective of AS during evolution and diversification in bamboo.

[1]  K. Han,et al.  The chromosome-level genome assemblies of two rattans (Calamus simplicifolius and Daemonorops jenkinsiana) , 2018, GigaScience.

[2]  Robert D. Finn,et al.  Ensembl Genomes 2018: an integrated omics infrastructure for non-vertebrate species , 2017, Nucleic Acids Res..

[3]  Huanming Yang,et al.  Announcing the Genome Atlas of Bamboo and Rattan (GABR) project: promoting research in evolution and in economically and ecologically beneficial plants , 2017, GigaScience.

[4]  Neva C. Durand,et al.  De novo assembly of the Aedes aegypti genome using Hi-C yields chromosome-length scaffolds , 2016, Science.

[5]  The Gene Ontology Consortium,et al.  Expansion of the Gene Ontology knowledgebase and resources , 2016, Nucleic Acids Res..

[6]  Tyson A. Clark,et al.  Unveiling the complexity of the maize transcriptome by single-molecule long-read sequencing , 2016, Nature Communications.

[7]  C. Peng,et al.  Dynamic allocation and transfer of non-structural carbohydrates, a possible mechanism for the explosive growth of Moso bamboo (Phyllostachys heterocycla) , 2016, Scientific Reports.

[8]  Evgeny M. Zdobnov,et al.  BUSCO: assessing genome assembly and annotation completeness with single-copy orthologs , 2015, Bioinform..

[9]  Bairong Shen,et al.  New genes drive the evolution of gene interaction networks in the human and mouse genomes , 2015, Genome Biology.

[10]  Steven L Salzberg,et al.  HISAT: a fast spliced aligner with low memory requirements , 2015, Nature Methods.

[11]  K. Scholthof,et al.  Genome-Wide Analysis of Alternative Splicing Landscapes Modulated during Plant-Virus Interactions in Brachypodium distachyon , 2015, Plant Cell.

[12]  M. Sammeth,et al.  Analysis of alternative splicing events in custom gene datasets by AStalavista. , 2015, Methods in molecular biology.

[13]  M. Beatty,et al.  Genome-Wide Analysis of Alternative Splicing in Zea mays: Landscape and Genetic Regulation[C][W] , 2014, Plant Cell.

[14]  Qin Li,et al.  Single-nucleotide resolution mapping of the Gossypium raimondii transcriptome reveals a new mechanism for alternative splicing of introns. , 2014, Molecular plant.

[15]  Björn Usadel,et al.  Trimmomatic: a flexible trimmer for Illumina sequence data , 2014, Bioinform..

[16]  Zhixi Tian,et al.  Global Dissection of Alternative Splicing in Paleopolyploid Soybean[W] , 2014, Plant Cell.

[17]  Usman Roshan,et al.  Multiple sequence alignment using Probcons and Probalign. , 2014, Methods in molecular biology.

[18]  Peng Cui,et al.  Dynamic regulation of genome-wide pre-mRNA splicing and stress tolerance by the Sm-like protein LSm5 in Arabidopsis , 2014, Genome Biology.

[19]  R. Sun,et al.  Structural Variation of Bamboo Lignin before and after Ethanol Organosolv Pretreatment , 2013, International journal of molecular sciences.

[20]  John W. S. Brown,et al.  Alternative Splicing at the Intersection of Biological Timing, Development, and Stress Responses[OPEN] , 2013, Plant Cell.

[21]  Wenfeng Li,et al.  Genome-Wide Detection of Condition-Sensitive Alternative Splicing in Arabidopsis Roots1[C][W] , 2013, Plant Physiology.

[22]  Ying Lu,et al.  The draft genome of the fast-growing non-timber forest species moso bamboo (Phyllostachys heterocycla) , 2013, Nature Genetics.

[23]  M. Borodovsky,et al.  TrueSight: a new algorithm for splice junction detection using RNA-seq , 2012, Nucleic acids research.

[24]  Michael D. Wilson,et al.  The Evolutionary Landscape of Alternative Splicing in Vertebrate Species , 2012, Science.

[25]  Andreas Heger,et al.  Evidence for conserved post-transcriptional roles of unitary pseudogenes and for frequent bifunctionality of mRNAs , 2012, Genome Biology.

[26]  G. Rätsch,et al.  Polypyrimidine Tract Binding Protein Homologs from Arabidopsis Are Key Regulators of Alternative Splicing with Implications in Fundamental Developmental Processes[W] , 2012, Plant Cell.

[27]  Ramón Doallo,et al.  CircadiOmics: integrating circadian genomics, transcriptomics, proteomics and metabolomics , 2012, Nature Methods.

[28]  Yamile Marquez,et al.  Transcriptome survey reveals increased complexity of the alternative splicing landscape in Arabidopsis , 2012, Genome research.

[29]  Chris Williams,et al.  RNA-SeQC: RNA-seq metrics for quality control and process optimization , 2012, Bioinform..

[30]  M. Long,et al.  Chromosomal Redistribution of Male-Biased Genes in Mammalian Evolution with Two Bursts of Gene Gain on the X Chromosome , 2010, PLoS biology.

[31]  Peter G Zhang,et al.  Extensive divergence in alternative splicing patterns after gene and genome duplication during the evolutionary history of Arabidopsis. , 2010, Molecular biology and evolution.

[32]  Huanming Yang,et al.  Deep RNA sequencing at single base-pair resolution reveals high complexity of the rice transcriptome. , 2010, Genome research.

[33]  Cole Trapnell,et al.  Transcript assembly and quantification by RNA-Seq reveals unannotated transcripts and isoform switching during cell differentiation. , 2010, Nature biotechnology.

[34]  Xu Li,et al.  The Growth Reduction Associated with Repressed Lignin Biosynthesis in Arabidopsis thaliana Is Independent of Flavonoids[C] , 2010, Plant Cell.

[35]  G. Ast,et al.  Alternative splicing and evolution: diversification, exon definition and function , 2010, Nature Reviews Genetics.

[36]  O. Gascuel,et al.  New algorithms and methods to estimate maximum-likelihood phylogenies: assessing the performance of PhyML 3.0. , 2010, Systematic biology.

[37]  T. Nilsen,et al.  Expansion of the eukaryotic proteome by alternative splicing , 2010, Nature.

[38]  Henry D. Priest,et al.  Genome-wide mapping of alternative splicing in Arabidopsis thaliana. , 2010, Genome research.

[39]  Qi Feng,et al.  Genome-wide characterization of the biggest grass, bamboo, based on 10,608 putative full-length cDNA sequences , 2010, BMC Plant Biology.

[40]  M. Irimia,et al.  Splicing in the eukaryotic ancestor: form, function and dysfunction. , 2009, Trends in ecology & evolution.

[41]  Lex E. Flagel,et al.  Gene duplication and evolutionary novelty in plants. , 2009, The New phytologist.

[42]  Mark W. Denny,et al.  Discovery of Lignin in Seaweed Reveals Convergent Evolution of Cell-Wall Architecture , 2009, Current Biology.

[43]  B. Frey,et al.  Deep surveying of alternative splicing complexity in the human transcriptome by high-throughput sequencing , 2008, Nature Genetics.

[44]  Eric T. Wang,et al.  Alternative Isoform Regulation in Human Tissue Transcriptomes , 2008, Nature.

[45]  Martin Vingron,et al.  Ontologizer 2.0 - a multifunctional tool for GO term enrichment analysis and data exploration , 2008, Bioinform..

[46]  Gerard Talavera,et al.  Improvement of phylogenies after removing divergent and ambiguously aligned blocks from protein sequence alignments. , 2007, Systematic biology.

[47]  Ziheng Yang PAML 4: phylogenetic analysis by maximum likelihood. , 2007, Molecular biology and evolution.

[48]  Sylvain Foissac,et al.  ASTALAVISTA: dynamic and flexible analysis of alternative splicing events in custom gene datasets , 2007, Nucleic Acids Res..

[49]  A. Reddy Alternative splicing of pre-messenger RNAs in plants in the genomic era. , 2007, Annual review of plant biology.

[50]  R. Martienssen,et al.  Transposable elements and the epigenetic regulation of the genome , 2007, Nature Reviews Genetics.

[51]  G. Ast,et al.  Different levels of alternative splicing among eukaryotes , 2006, Nucleic acids research.

[52]  V. Brendel,et al.  Genomewide comparative analysis of alternative splicing in plants. , 2006, Proceedings of the National Academy of Sciences of the United States of America.

[53]  Feng Chen,et al.  OrthoMCL-DB: querying a comprehensive multi-species collection of ortholog groups , 2005, Nucleic Acids Res..

[54]  Jeroen Raes,et al.  Duplication and divergence: the evolution of new genes and old ideas. , 2004, Annual review of genetics.

[55]  C. Ponting,et al.  Elevated rates of protein secretion, evolution, and disease among tissue-specific genes. , 2003, Genome research.

[56]  Walter Liese,et al.  Bamboo and Rattan in the World , 2003 .

[57]  B. Graveley,et al.  Alternative splicing of the Drosophila Dscam pre-mRNA is both temporally and spatially regulated. , 2001, Genetics.

[58]  K. Nakai,et al.  Construction of a novel database containing aberrant splicing mutations of mammalian genes. , 1994, Gene.