The Origins, Evolution, and Functional Potential of Alternative Splicing in Vertebrates

Alternative splicing (AS) has the potential to greatly expand the functional repertoire of mammalian transcriptomes. However, few variant transcripts have been characterized functionally, making it difficult to assess the contribution of AS to the generation of phenotypic complexity and to study the evolution of splicing patterns. We have compared the AS of 309 protein-coding genes in the human ENCODE pilot regions against their mouse orthologs in unprecedented detail, utilizing traditional transcriptomic and RNAseq data. The conservation status of every transcript has been investigated, and each functionally categorized as coding (separated into coding sequence [CDS] or nonsense-mediated decay [NMD] linked) or noncoding. In total, 36.7% of human and 19.3% of mouse coding transcripts are species specific, and we observe a 3.6 times excess of human NMD transcripts compared with mouse; in contrast to previous studies, the majority of species-specific AS is unlinked to transposable elements. We observe one conserved CDS variant and one conserved NMD variant per 2.3 and 11.4 genes, respectively. Subsequently, we identify and characterize equivalent AS patterns for 22.9% of these CDS or NMD-linked events in nonmammalian vertebrate genomes, and our data indicate that functional NMD-linked AS is more widespread and ancient than previously thought. Furthermore, although we observe an association between conserved AS and elevated sequence conservation, as previously reported, we emphasize that 30% of conserved AS exons display sequence conservation below the average score for constitutive exons. In conclusion, we demonstrate the value of detailed comparative annotation in generating a comprehensive set of AS transcripts, increasing our understanding of AS evolution in vertebrates. Our data supports a model whereby the acquisition of functional AS has occurred throughout vertebrate evolution and is considered alongside amino acid change as a key mechanism in gene evolution.

[1]  Monte Westerfield,et al.  The Zebrafish Information Network: the zebrafish model organism database , 2005, Nucleic Acids Res..

[2]  Rodrigo Lopez,et al.  Clustal W and Clustal X version 2.0 , 2007, Bioinform..

[3]  J. Jurka,et al.  Repbase Update, a database of eukaryotic repetitive elements , 2005, Cytogenetic and Genome Research.

[4]  B. Graveley Alternative splicing: increasing diversity in the proteomic world. , 2001, Trends in genetics : TIG.

[5]  R. Shamir,et al.  How prevalent is functional alternative splicing in the human genome? , 2004, Trends in genetics : TIG.

[6]  G. Ast,et al.  Multifactorial Interplay Controls the Splicing Profile of Alu-Derived Exons , 2008, Molecular and Cellular Biology.

[7]  E. Birney,et al.  EGASP: the human ENCODE Genome Annotation Assessment Project , 2006, Genome Biology.

[8]  R. Breaker,et al.  Control of alternative RNA splicing and gene expression by eukaryotic riboswitches , 2007, Nature.

[9]  G. Ast,et al.  Alternative splicing of Alu exons—two arms are better than one , 2008, Nucleic acids research.

[10]  B. Williams,et al.  Mapping and quantifying mammalian transcriptomes by RNA-Seq , 2008, Nature Methods.

[11]  G. Ast,et al.  Alternative splicing: current perspectives , 2008, BioEssays : news and reviews in molecular, cellular and developmental biology.

[12]  J. Ott,et al.  Estimating rates of alternative splicing in mammals and invertebrates , 2004, Nature Genetics.

[13]  D. Cazalla,et al.  A Novel SR-Related Protein Is Required for the Second Step of Pre-mRNA Splicing , 2005, Molecular and Cellular Biology.

[14]  Christopher J. Lee,et al.  Alternative splicing in the human, mouse and rat genomes is associated with an increased frequency of exon creation and/or loss , 2003, Nature Genetics.

[15]  Kerstin Jekosch,et al.  The zebrafish genome project: sequence analysis and annotation. , 2004, Methods in cell biology.

[16]  James G. R. Gilbert,et al.  The vertebrate genome annotation (Vega) database , 2004, Nucleic Acids Res..

[17]  B. Graveley The developmental transcriptome of Drosophila melanogaster , 2010, Nature.

[18]  John W. S. Brown,et al.  Regulation of plant gene expression by alternative splicing. , 2010, Biochemical Society transactions.

[19]  Gil Ast,et al.  The Emergence of Alternative 3′ and 5′ Splice Site Exons from Constitutive Exons , 2007, PLoS Comput. Biol..

[20]  Dan Graur,et al.  Alu-containing exons are alternatively spliced. , 2002, Genome research.

[21]  Hagen Blankenburg,et al.  The implications of alternative splicing in the ENCODE protein complement , 2007, Proceedings of the National Academy of Sciences.

[22]  William Stafford Noble,et al.  Identification and analysis of functional elements in 1% of the human genome by the ENCODE pilot project , 2007, Nature.

[23]  Arun K. Ramani,et al.  Genome-wide analysis of alternative splicing in Caenorhabditis elegans. , 2011, Genome research.

[24]  E. Myers,et al.  Basic local alignment search tool. , 1990, Journal of molecular biology.

[25]  Francisco Martinez-Murillo,et al.  Nonsense surveillance regulates expression of diverse classes of mammalian transcripts and mutes genomic noise , 2004, Nature Genetics.

[26]  David States,et al.  Selecting for functional alternative splices in ESTs. , 2002, Genome research.

[27]  K. Pollard,et al.  Detection of nonneutral substitution rates on mammalian phylogenies. , 2010, Genome research.

[28]  M. Gerstein,et al.  RNA-Seq: a revolutionary tool for transcriptomics , 2009, Nature Reviews Genetics.

[29]  D. Haussler,et al.  Evolutionarily conserved elements in vertebrate, insect, worm, and yeast genomes. , 2005, Genome research.

[30]  Y. Takai,et al.  Isolation and characterization of cortactin isoforms and a novel cortactin‐binding protein, CBP90 , 1998, Genes to cells : devoted to molecular & cellular mechanisms.

[31]  K. Hansen,et al.  Genome-Wide Identification of Alternative Splice Forms Down-Regulated by Nonsense-Mediated mRNA Decay in Drosophila , 2009, PLoS genetics.

[32]  Agnes Hotz-Wagenblatt,et al.  Characteristics of Transposable Element Exonization within Human and Mouse , 2010, PloS one.

[33]  A Sureau,et al.  SC35 autoregulates its expression by promoting splicing events that destabilize its mRNAs , 2001, The EMBO journal.

[34]  Peter J. Bickel,et al.  The Developmental Transcriptome of Drosophila melanogaster , 2010, Nature.

[35]  David L. Wheeler,et al.  GenBank , 2015, Nucleic Acids Res..

[36]  M. Boguski,et al.  dbEST — database for “expressed sequence tags” , 1993, Nature Genetics.

[37]  S. Brenner,et al.  Evidence for the widespread coupling of alternative splicing and nonsense-mediated mRNA decay in humans , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[38]  Jennifer Daub,et al.  Expressed sequence tags: medium-throughput protocols. , 2004, Methods in molecular biology.

[39]  J. Manley,et al.  Mechanisms of alternative splicing regulation: insights from molecular and genomics approaches , 2009, Nature Reviews Molecular Cell Biology.

[40]  X. Gu,et al.  Intron gain and loss in segmentally duplicated genes in rice , 2006, Genome Biology.

[41]  G. Ast,et al.  Different levels of alternative splicing among eukaryotes , 2006, Nucleic acids research.

[42]  S. Brenner,et al.  Unproductive splicing of SR genes associated with highly conserved and ultraconserved DNA elements , 2007, Nature.

[43]  Michael D. Wilson,et al.  Five-Vertebrate ChIP-seq Reveals the Evolutionary Dynamics of Transcription Factor Binding , 2010, Science.

[44]  Zhanjiang Liu,et al.  Alternative splicing in teleost fish genomes: same-species and cross-species analysis and comparisons , 2010, Molecular Genetics and Genomics.

[45]  J. Seger,et al.  The adaptive significance of unproductive alternative splicing in primates. , 2010, RNA.

[46]  J. Galagan,et al.  Cross-kingdom patterns of alternative splicing and splice recognition , 2008, Genome Biology.

[47]  J. Harrow,et al.  GENCODE: producing a reference annotation for ENCODE , 2006, Genome Biology.

[48]  T. Nilsen,et al.  Expansion of the eukaryotic proteome by alternative splicing , 2010, Nature.

[49]  Andrew M. Jenkinson,et al.  Ensembl 2009 , 2008, Nucleic Acids Res..

[50]  Patrick Wincker,et al.  Large-scale gene discovery in the pea aphid Acyrthosiphon pisum (Hemiptera) , 2006, Genome Biology.

[51]  R. Sorek,et al.  Intronic sequences flanking alternatively spliced exons are conserved between human and mouse. , 2003, Genome research.

[52]  Cole Trapnell,et al.  Ultrafast and memory-efficient alignment of short DNA sequences to the human genome , 2009, Genome Biology.

[53]  Scott A. Rifkin,et al.  A Gene Expression Map for the Euchromatic Genome of Drosophila melanogaster , 2004, Science.

[54]  C. Gélinas,et al.  CAPERα Is a Novel Rel-TAD-Interacting Factor That Inhibits Lymphocyte Transformation by the Potent Rel/NF-κB Oncoprotein v-Rel , 2008, Journal of Virology.

[55]  Ryan D. Morin,et al.  Profiling the HeLa S3 transcriptome using randomly primed cDNA and massively parallel short-read sequencing. , 2008, BioTechniques.

[56]  Eric T. Wang,et al.  Alternative Isoform Regulation in Human Tissue Transcriptomes , 2008, Nature.

[57]  B. Frey,et al.  Quantitative microarray profiling provides evidence against widespread coupling of alternative splicing with nonsense-mediated mRNA decay to control gene expression. , 2006, Genes & development.

[58]  G. Ast,et al.  Biased exonization of transposed elements in duplicated genes: A lesson from the TIF-IA gene , 2007, BMC Molecular Biology.

[59]  Schraga Schwartz,et al.  Alu Exonization Events Reveal Features Required for Precise Recognition of Exons by the Splicing Machinery , 2009, PLoS Comput. Biol..

[60]  Ewan Birney,et al.  Automated generation of heuristics for biological sequence comparison , 2005, BMC Bioinformatics.

[61]  Joseph K. Pickrell,et al.  Noisy Splicing Drives mRNA Isoform Diversity in Human Cells , 2010, PLoS genetics.

[62]  Agnes Hotz-Wagenblatt,et al.  Comparative analysis of transposed element insertion within human and mouse genomes reveals Alu's unique role in shaping the human transcriptome , 2007, Genome Biology.