Deciphering the splicing code

Alternative splicing has a crucial role in the generation of biological complexity, and its misregulation is often involved in human disease. Here we describe the assembly of a ‘splicing code’, which uses combinations of hundreds of RNA features to predict tissue-dependent changes in alternative splicing for thousands of exons. The code determines new classes of splicing patterns, identifies distinct regulatory programs in different tissues, and identifies mutation-verified regulatory sequences. Widespread regulatory strategies are revealed, including the use of unexpectedly large combinations of features, the establishment of low exon inclusion levels that are overcome by features in specific tissues, the appearance of features deeper into introns than previously appreciated, and the modulation of splice variant levels by transcript structure characteristics. The code detected a class of exons whose inclusion silences expression in adult tissues by activating nonsense-mediated messenger RNA decay, but whose exclusion promotes expression during embryogenesis. The code facilitates the discovery and detailed characterization of regulated alternative splicing events on a genome-wide scale.

[1]  Dorothy T. Thayer,et al.  EM algorithms for ML factor analysis , 1982 .

[2]  M. Mckeown Regulation of alternative splicing. , 1990, Genetic engineering.

[3]  Thomas M. Cover,et al.  Elements of Information Theory , 2005 .

[4]  D. Black Activation of c-src neuron-specific splicing by an unusual RNA element in vivo and in vitro , 1992, Cell.

[5]  S. Kawamoto Neuron-specific alternative splicing of nonmuscle myosin II heavy chain-B pre-mRNA requires a cis-acting intron sequence. , 1996, The Journal of biological chemistry.

[6]  T. Südhof,et al.  CASK: a novel dlg/PSD95 homolog with an N-terminal calmodulin-dependent protein kinase domain identified by interaction with neurexins , 1996, The Journal of neuroscience : the official journal of the Society for Neuroscience.

[7]  Global insights , 1996, IEEE Potentials.

[8]  M. Ashiya,et al.  A neuron-specific splicing switch mediated by an array of pre-mRNA repressor sites: evidence of a regulatory role for the polypyrimidine tract binding protein and a brain-specific PTB counterpart. , 1997, RNA.

[9]  I. Pérez,et al.  Mutation of PTB binding sites causes misregulation of alternative 3' splice site selection in vivo. , 1997, RNA.

[10]  D. Black,et al.  A unique intronic splicing enhancer controls the inclusion of the agrin Y exon. , 1997, RNA.

[11]  R. C. Chan,et al.  The polypyrimidine tract binding protein binds upstream of neural cell-specific c-src exon N1 to repress the splicing of the intron downstream , 1997, Molecular and cellular biology.

[12]  Geoffrey E. Hinton,et al.  A View of the Em Algorithm that Justifies Incremental, Sparse, and other Variants , 1998, Learning in Graphical Models.

[13]  P. Sharp,et al.  Alternative Splicing of the Fibronectin EIIIB Exon Depends on Specific TGCATG Repeats , 1998, Molecular and Cellular Biology.

[14]  Hagai Attias,et al.  Independent Factor Analysis , 1999, Neural Computation.

[15]  G. Lipowsky,et al.  Exportin 4: a mediator of a novel nuclear export pathway in higher eukaryotes , 2000, The EMBO journal.

[16]  S. Stamm,et al.  Htra2-beta 1 stimulates an exonic splicing enhancer and can restore full-length SMN expression to survival motor neuron 2 (SMN2). , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[17]  D. Black,et al.  Cooperative Assembly of an hnRNP Complex Induced by a Tissue-Specific Homolog of Polypyrimidine Tract Binding Protein , 2000, Molecular and Cellular Biology.

[18]  J. Côté,et al.  Polypyrimidine Track-binding Protein Binding Downstream of Caspase-2 Alternative Exon 9 Represses Its Inclusion* , 2001, The Journal of Biological Chemistry.

[19]  D. Black,et al.  A CaMK IV responsive RNA element mediates depolarization-induced alternative splicing of ion channels , 2001, Nature.

[20]  J. Côté,et al.  Caspase-2 pre-mRNA alternative splicing: Identification of an intronic element containing a decoy 3' acceptor site. , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[21]  Nir Friedman,et al.  From promoter sequence to expression: a probabilistic framework , 2002, RECOMB '02.

[22]  P. Grabowski,et al.  Function of quaking in myelination: Regulation of alternative splicing , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[23]  M. Hayakawa,et al.  Muscle-specific Exonic Splicing Silencer for Exon Exclusion in Human ATP Synthase γ-Subunit Pre-mRNA* 210 , 2002, The Journal of Biological Chemistry.

[24]  Jinhua Wang,et al.  ESEfinder: a web resource to identify exonic splicing enhancers , 2003, Nucleic Acids Res..

[25]  R. Sorek,et al.  Intronic sequences flanking alternatively spliced exons are conserved between human and mouse. , 2003, Genome research.

[26]  D. Black Mechanisms of alternative pre-messenger RNA splicing. , 2003, Annual review of biochemistry.

[27]  S. Sugano,et al.  A vertebrate RNA‐binding protein Fox‐1 regulates tissue‐specific splicing via the pentanucleotide GCAUG , 2003, The EMBO journal.

[28]  Ivo L. Hofacker,et al.  Vienna RNA secondary structure server , 2003, Nucleic Acids Res..

[29]  Gene W. Yeo,et al.  Systematic Identification and Analysis of Exonic Splicing Silencers , 2004, Cell.

[30]  T. Cooper,et al.  Muscleblind proteins regulate alternative splicing , 2004, The EMBO journal.

[31]  M. Tomita,et al.  Computational comparative analyses of alternative splicing regulation using full-length cDNA of various eukaryotes. , 2004, RNA.

[32]  Michael I. Jordan,et al.  An Introduction to Variational Methods for Graphical Models , 1999, Machine Learning.

[33]  C. Gooding,et al.  Autoregulation of polypyrimidine tract binding protein by alternative splicing leading to nonsense-mediated decay. , 2004, Molecular cell.

[34]  L. Chasin,et al.  Computational definition of sequence motifs governing constitutive exon splicing. , 2004, Genes & development.

[35]  B. Frey,et al.  Revealing global regulatory features of mammalian alternative splicing using a quantitative microarray platform. , 2004, Molecular cell.

[36]  J. Conboy,et al.  The splicing regulatory element, UGCAUG, is phylogenetically and spatially conserved in introns that flank tissue-specific alternative exons , 2005, Nucleic acids research.

[37]  S. Richard,et al.  Target RNA motif and target mRNAs of the Quaking STAR protein , 2005, Nature Structural &Molecular Biology.

[38]  D. Black,et al.  A consensus CaMK IV-responsive RNA sequence mediates regulation of alternative exons in neurons. , 2005, RNA.

[39]  D. Haussler,et al.  Evolutionarily conserved elements in vertebrate, insect, worm, and yeast genomes. , 2005, Genome research.

[40]  C. Gooding,et al.  A class of human exons with predicted distant branch points revealed by analysis of AG dinucleotide exclusion zones , 2006, Genome Biology.

[41]  D. Black,et al.  Structure of PTB Bound to RNA: Specific Binding and Implications for Splicing Regulation , 2005, Science.

[42]  L. Chasin Faculty Opinions recommendation of Structure of PTB bound to RNA: specific binding and implications for splicing regulation. , 2005 .

[43]  T. Cooper,et al.  Identification of Putative New Splicing Targets for ETR-3 Using Sequences Identified by Systematic Evolution of Ligands by Exponential Enrichment , 2005, Molecular and Cellular Biology.

[44]  Ron Shamir,et al.  Accurate identification of alternatively spliced exons using support vector machine , 2005, Bioinform..

[45]  B. Blencowe Alternative Splicing: New Insights from Global Analyses , 2006, Cell.

[46]  B. Frey,et al.  Functional coordination of alternative splicing in the mammalian central nervous system , 2007, Genome Biology.

[47]  Gene W. Yeo,et al.  Inference of Splicing Regulatory Activities by Sequence Neighborhood Analysis , 2006, PLoS genetics.

[48]  M. Hiller,et al.  Using RNA secondary structures to guide sequence motif finding towards single-stranded regions , 2006, Nucleic acids research.

[49]  B. Blencowe,et al.  An RNA map predicting Nova-dependent splicing regulation , 2006, Nature.

[50]  Sang Joon Kim,et al.  A Mathematical Theory of Communication , 2006 .

[51]  Brendan J. Frey,et al.  Inferring global levels of alternative splicing isoforms using a generative model of microarray data , 2006, Bioinform..

[52]  David Haussler,et al.  Unusual Intron Conservation near Tissue-Regulated Exons Found by Splicing Microarrays , 2005, PLoS Comput. Biol..

[53]  Gene W. Yeo,et al.  Correction: Discovery and Analysis of Evolutionarily Conserved Intronic Splicing Regulatory Elements , 2007, PLoS genetics.

[54]  Michael Q. Zhang,et al.  Dual-specificity splice sites function alternatively as 5′ and 3′ splice sites , 2007, Proceedings of the National Academy of Sciences.

[55]  Radford M. Neal Pattern Recognition and Machine Learning , 2007, Technometrics.

[56]  C. Burge,et al.  Coevolutionary networks of splicing cis-regulatory elements , 2007, Proceedings of the National Academy of Sciences.

[57]  Tyson A. Clark,et al.  A correlation with exon expression approach to identify cis-regulatory elements for tissue-specific alternative splicing , 2007, Nucleic acids research.

[58]  V. Lefebvre,et al.  Control of cell fate and differentiation by Sry-related high-mobility-group box (Sox) transcription factors. , 2007, The international journal of biochemistry & cell biology.

[59]  Guey-Shin Wang,et al.  Splicing in disease: disruption of the splicing code and the decoding machinery , 2007, Nature Reviews Genetics.

[60]  C. Burge,et al.  integrated splicing code Splicing regulation : From a parts list of regulatory elements to an , 2022 .

[61]  Tyson A. Clark,et al.  HITS-CLIP yields genome-wide insights into brain alternative RNA processing , 2008, Nature.

[62]  Peter J. Shepard,et al.  Conserved RNA secondary structures promote alternative splicing. , 2008, RNA.

[63]  Eric T. Wang,et al.  Alternative Isoform Regulation in Human Tissue Transcriptomes , 2008, Nature.

[64]  B. Frey,et al.  A systematic analysis of intronic sequences downstream of 5' splice sites reveals a widespread role for U-rich motifs and TIA1/TIAL1 proteins in alternative splicing regulation. , 2008, Genome research.

[65]  Michael Q. Zhang,et al.  Defining the regulatory network of the tissue-specific splicing factors Fox-1 and Fox-2. , 2008, Genes & development.

[66]  B. Frey,et al.  Deep surveying of alternative splicing complexity in the human transcriptome by high-throughput sequencing , 2008, Nature Genetics.

[67]  A. Krasnitz,et al.  An Oncogenomics-Based In Vivo RNAi Screen Identifies Tumor Suppressors in Liver Cancer , 2008, Cell.

[68]  M. Rieder,et al.  Mutations of CASK cause an X-linked brain malformation phenotype with microcephaly and hypoplasia of the brainstem and cerebellum , 2008, Nature Genetics.

[69]  Jeroen A. A. Demmers,et al.  Exportin 4 mediates a novel nuclear import pathway for Sox family transcription factors , 2009, The Journal of cell biology.

[70]  John A. Calarco,et al.  Regulation of Vertebrate Nervous System Alternative Splicing and Development by an SR-Related Protein , 2009, Cell.

[71]  Lourdes Peña Castillo,et al.  Rapid and systematic analysis of the RNA recognition specificities of RNA-binding proteins , 2009, Nature Biotechnology.

[72]  B. Hartmann,et al.  Decrypting the genome's alternative messages. , 2009, Current opinion in cell biology.

[73]  Brendan J. Frey,et al.  Model-based detection of alternative splicing signals , 2010, Bioinform..

[74]  T. Nilsen,et al.  Expansion of the eukaryotic proteome by alternative splicing , 2010, Nature.

[75]  Christopher W. J. Smith,et al.  Alternative splicing: global insights , 2010, The FEBS journal.

[76]  R. Amann,et al.  Predictive Identification of Exonic Splicing Enhancers in Human Genes , 2022 .