Noisy Splicing Drives mRNA Isoform Diversity in Human Cells

While the majority of multiexonic human genes show some evidence of alternative splicing, it is unclear what fraction of observed splice forms is functionally relevant. In this study, we examine the extent of alternative splicing in human cells using deep RNA sequencing and de novo identification of splice junctions. We demonstrate the existence of a large class of low abundance isoforms, encompassing approximately 150,000 previously unannotated splice junctions in our data. Newly-identified splice sites show little evidence of evolutionary conservation, suggesting that the majority are due to erroneous splice site choice. We show that sequence motifs involved in the recognition of exons are enriched in the vicinity of unconserved splice sites. We estimate that the average intron has a splicing error rate of approximately 0.7% and show that introns in highly expressed genes are spliced more accurately, likely due to their shorter length. These results implicate noisy splicing as an important property of genome evolution.

[1]  Brendan J. Frey,et al.  Deciphering the splicing code , 2010, Nature.

[2]  J. Rinn,et al.  Ab initio reconstruction of transcriptomes of pluripotent and lineage committed cells reveals gene structures of thousands of lincRNAs , 2010, Nature Biotechnology.

[3]  Cole Trapnell,et al.  Transcript assembly and quantification by RNA-Seq reveals unannotated transcripts and isoform switching during cell differentiation. , 2010, Nature biotechnology.

[4]  J. Rinn,et al.  Ab initio reconstruction of transcriptomes of pluripotent and lineage committed cells reveals gene structures of thousands of lincRNAs , 2010, Nature biotechnology.

[5]  Y. Xing,et al.  Detection of splice junctions from paired-end RNA-seq data by SpliceMap , 2010, Nucleic acids research.

[6]  R. Guigó,et al.  Transcriptome genetics using second generation sequencing in a Caucasian population , 2010, Nature.

[7]  Joseph K. Pickrell,et al.  Understanding mechanisms underlying human gene expression variation with RNA sequencing , 2010, Nature.

[8]  M. Gerstein,et al.  Dynamic transcriptomes during neural differentiation of human embryonic stem cells revealed by short, long, and paired-end sequencing , 2010, Proceedings of the National Academy of Sciences.

[9]  B. Blencowe,et al.  Regulation of Alternative Splicing by Histone Modifications , 2010, Science.

[10]  M. Lynch Rate, molecular spectrum, and consequences of human mutation , 2010, Proceedings of the National Academy of Sciences.

[11]  K. Pollard,et al.  Detection of nonneutral substitution rates on mammalian phylogenies. , 2010, Genome research.

[12]  L. Feuk,et al.  Global and unbiased detection of splice junctions from RNA-seq data , 2010, Genome Biology.

[13]  Noah Spies,et al.  Biased chromatin signatures around polyadenylation sites and exons. , 2009, Molecular cell.

[14]  Jan Komorowski,et al.  Nucleosomes are well positioned in exons and carry characteristic histone modifications. , 2009, Genome research.

[15]  Christoforos Nikolaou,et al.  Nucleosome positioning as a determinant of exon recognition , 2009, Nature Structural &Molecular Biology.

[16]  G. Ast,et al.  Chromatin organization marks exon-intron structure , 2009, Nature Structural &Molecular Biology.

[17]  Jonathan M. Mudge,et al.  The consensus coding sequence (CCDS) project: Identifying a common protein-coding gene set for the human and mouse genomes. , 2009, Genome research.

[18]  John Moult,et al.  Stochastic noise in splicing machinery , 2009 .

[19]  Richard Durbin,et al.  Sequence analysis Fast and accurate short read alignment with Burrows – Wheeler transform , 2009 .

[20]  H. Fraser,et al.  Common polymorphic transcript variation in human disease. , 2009, Genome research.

[21]  Lior Pachter,et al.  Sequence Analysis , 2020, Definitions.

[22]  Hunter B. Fraser,et al.  Ab initio construction of a eukaryotic transcriptome by massively parallel mRNA sequencing , 2009, Proceedings of the National Academy of Sciences.

[23]  Z. Ning,et al.  Amplification-free Illumina sequencing-library preparation facilitates improved mapping and assembly of GC-biased genomes , 2009, Nature Methods.

[24]  L. Hurst Evolutionary genomics and the reach of selection , 2009, Journal of biology.

[25]  J. Ahringer,et al.  Differential chromatin marking of introns and expressed exons by H3K36me3 , 2008, Nature Genetics.

[26]  Andrew M. Jenkinson,et al.  Ensembl 2009 , 2008, Nucleic Acids Res..

[27]  T. Nilsen,et al.  Dynamic Regulation of Alternative Splicing by Silencers that Modulate 5′ Splice Site Competition , 2008, Cell.

[28]  Richard Durbin,et al.  A large genome center's improvements to the Illumina sequencing system , 2008, Nature Methods.

[29]  B. Frey,et al.  Deep surveying of alternative splicing complexity in the human transcriptome by high-throughput sequencing , 2008, Nature Genetics.

[30]  Christopher J. Lee,et al.  The effect of intron length on exon creation ratios during the evolution of mammalian genomes. , 2008, RNA.

[31]  Eric T. Wang,et al.  Alternative Isoform Regulation in Human Tissue Transcriptomes , 2008, Nature.

[32]  M. Stephens,et al.  RNA-seq: an assessment of technical reproducibility and comparison with gene expression arrays. , 2008, Genome research.

[33]  Marcel H. Schulz,et al.  A Global View of Gene Activity and Alternative Splicing by Deep Sequencing of the Human Transcriptome , 2008, Science.

[34]  B. Williams,et al.  Mapping and quantifying mammalian transcriptomes by RNA-Seq , 2008, Nature Methods.

[35]  C. Burge,et al.  Splicing regulation: from a parts list of regulatory elements to an integrated splicing code. , 2008, RNA.

[36]  Michael Hiller,et al.  Widespread and subtle: alternative splicing at short-distance tandem sites. , 2008, Trends in genetics : TIG.

[37]  Michael Q. Zhang,et al.  RNA landscape of evolution for optimal exon and intron discrimination , 2008, Proceedings of the National Academy of Sciences.

[38]  Jacek Majewski,et al.  Genome-wide analysis of transcript isoform variation in humans , 2008, Nature Genetics.

[39]  O. Jaillon,et al.  Translational control of intron splicing in eukaryotes , 2008, Nature.

[40]  Michael Q. Zhang,et al.  Evolutionary impact of limited splicing fidelity in mammalian genes. , 2007, Trends in genetics : TIG.

[41]  Araxi O. Urrutia,et al.  Splicing and the Evolution of Proteins in Mammals , 2007, Nature Reviews Genetics.

[42]  G. Ast,et al.  Different levels of alternative splicing among eukaryotes , 2006, Nucleic acids research.

[43]  Tatiana Tatusova,et al.  NCBI Reference Sequence (RefSeq): a curated non-redundant sequence database of genomes, transcripts and proteins , 2004, Nucleic Acids Res..

[44]  Michael Lynch,et al.  The Origins of Genome Architecture , 2007 .

[45]  Yimeng Dou,et al.  Genomic splice-site analysis reveals frequent alternative splicing close to the dominant splice site. , 2006, RNA.

[46]  David Haussler,et al.  The UCSC Known Genes , 2006, Bioinform..

[47]  Jun Kawai,et al.  A Simple Physical Model Predicts Small Exon Length Variations , 2006, PLoS genetics.

[48]  P. Baldi,et al.  The architecture of pre-mRNAs affects mechanisms of splice-site pairing. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[49]  P. Green,et al.  Sequence conservation, relative isoform frequencies, and nonsense-mediated decay in evolutionarily conserved alternative splicing. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[50]  F. Clark,et al.  Understanding alternative splicing: towards a cellular code , 2005, Nature Reviews Molecular Cell Biology.

[51]  Tomaso Poggio,et al.  Identification and analysis of alternative splicing events conserved in human and mouse. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[52]  Gene W. Yeo,et al.  Systematic Identification and Analysis of Exonic Splicing Silencers , 2004, Cell.

[53]  L. Chasin,et al.  Computational definition of sequence motifs governing constitutive exon splicing. , 2004, Genes & development.

[54]  R. Shamir,et al.  How prevalent is functional alternative splicing in the human genome? , 2004, Trends in genetics : TIG.

[55]  D. Black Mechanisms of alternative pre-messenger RNA splicing. , 2003, Annual review of biochemistry.

[56]  Terry Gaasterland,et al.  Impact of alternative initiation, splicing, and termination on the diversity of the mRNA transcripts encoded by the mouse transcriptome. , 2003, Genome research.

[57]  Christopher J. Lee,et al.  Alternative splicing in the human, mouse and rat genomes is associated with an increased frequency of exon creation and/or loss , 2003, Nature Genetics.

[58]  Cristian I. Castillo-Davis,et al.  Selection for short introns in highly expressed genes , 2002, Nature Genetics.

[59]  M. Lynch Intron evolution as a population-genetic process , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[60]  Christopher J. Lee,et al.  A genomic view of alternative splicing , 2002, Nature Genetics.

[61]  L. Hurst,et al.  Small introns tend to occur in GC-rich regions in some but not all vertebrates. , 1999, Trends in genetics : TIG.

[62]  R. Amann,et al.  Predictive Identification of Exonic Splicing Enhancers in Human Genes , 2022 .