Ab initio prediction of mutation-induced cryptic splice-site activation and exon skipping

Mutations that affect splicing of precursor messenger RNAs play a major role in the development of hereditary diseases. Most splicing mutations have been found to eliminate GT or AG dinucleotides that define the 5′ and 3′ ends of introns, leading to exon skipping or cryptic splice-site activation. Although accurate description of the mis-spliced transcripts is critical for predicting phenotypic consequences of these alterations, their exact nature in affected individuals cannot often be determined experimentally. Using a comprehensive collection of exons that sustained cryptic splice-site activation or were skipped as a result of splice-site mutations, we have developed a multivariate logistic discrimination procedure that distinguishes the two aberrant splicing outcomes from DNA sequences. The new algorithm was validated using an independent sample of exons and implemented as a free online utility termed CRYP-SKIP (http://www.dbass.org.uk/cryp-skip/). The web application takes up one or more mutated alleles, each consisting of one exon and flanking intronic sequences, and provides a list of important predictor variables and their values, the overall probability of activating cryptic splice vs exon skipping, and the location and intrinsic strength of predicted cryptic splice sites in the input sequence. These results will facilitate phenotypic prediction of splicing mutations and provide further insights into splicing enhancer and silencer elements and their relative importance for splice-site selection in vivo.

[1]  E. Buratti,et al.  Influence of RNA Secondary Structure on the Pre-mRNA Splicing Process , 2004, Molecular and Cellular Biology.

[2]  Jinhua Wang,et al.  ESEfinder: a web resource to identify exonic splicing enhancers , 2003, Nucleic Acids Res..

[3]  Tom Maniatis,et al.  Selection and Characterization of Pre-mRNA Splicing Enhancers: Identification of Novel SR Protein-Specific Enhancer Sequences , 1999, Molecular and Cellular Biology.

[4]  D. Cooper,et al.  Human Gene Mutation , 1993 .

[5]  A. Zahler,et al.  Determination of the RNA Binding Specificity of the Heterogeneous Nuclear Ribonucleoprotein (hnRNP) H/H′/F/2H9 Family* , 2001, The Journal of Biological Chemistry.

[6]  L. Chasin,et al.  Computational definition of sequence motifs governing constitutive exon splicing. , 2004, Genes & development.

[7]  Rolf Backofen,et al.  Pre-mRNA Secondary Structures Influence Exon Recognition , 2007, PLoS genetics.

[8]  E. Androphy,et al.  In vivo selection reveals combinatorial controls that define a critical exon in the spinal muscular atrophy genes. , 2004, RNA.

[9]  Michael Q. Zhang,et al.  An increased specificity score matrix for the prediction of SF2/ASF-specific exonic splicing enhancers. , 2006, Human molecular genetics.

[10]  Thaned Kangsamaksin,et al.  Exon Inclusion Is Dependent on Predictable Exonic Splicing Enhancers , 2005, Molecular and Cellular Biology.

[11]  Gene W. Yeo,et al.  Systematic Identification and Analysis of Exonic Splicing Silencers , 2004, Cell.

[12]  Zefeng Wang,et al.  General and specific functions of exonic splicing silencers in splicing control. , 2006, Molecular cell.

[13]  G. Dreyfuss,et al.  The hnRNP F protein: unique primary structure, nucleic acid-binding properties, and subcellular localization. , 1994, Nucleic acids research.

[14]  Adrian R. Krainer,et al.  Aberrant 5′ splice sites in human disease genes: mutation pattern, nucleotide structure and comparison of computational tools that predict their utilization , 2007, Nucleic acids research.

[15]  I. Vořechovský,et al.  Aberrant 3′ splice sites in human disease genes: mutation pattern, nucleotide structure and comparison of computational tools that predict their utilization , 2006, Nucleic acids research.

[16]  R. Amann,et al.  Predictive Identification of Exonic Splicing Enhancers in Human Genes , 2022 .

[17]  X. Estivill,et al.  Mutations affecting mRNA splicing are the most common molecular defects in patients with neurofibromatosis type 1. , 2000, Human molecular genetics.

[18]  Gil Ast,et al.  Comparative analysis detects dependencies among the 5' splice-site positions. , 2004, RNA.

[19]  A. Krainer,et al.  Extensive in silico analysis of NF1 splicing defects uncovers determinants for splicing outcome upon 5′ splice‐site disruption , 2007, Human mutation.

[20]  G. Dreyfuss,et al.  hnRNP I, the polypyrimidine tract-binding protein: distinct nuclear localization and association with hnRNAs. , 1992, Nucleic acids research.

[21]  Laura Fiori,et al.  Disease‐causing mutations improving the branch site and polypyrimidine tract: Pseudoexon activation of LINE‐2 and antisense Alu lacking the poly(T)‐tail , 2009, Human mutation.

[22]  G. Ast,et al.  Comparative analysis identifies exonic splicing regulatory sequences--The complex definition of enhancers and silencers. , 2006, Molecular cell.

[23]  B. Frey,et al.  Alternative splicing of conserved exons is frequently species-specific in human and mouse. , 2005, Trends in genetics : TIG.

[24]  R. Reed,et al.  The organization of 3' splice-site sequences in mammalian introns. , 1989, Genes & development.

[25]  David Haussler,et al.  Improved splice site detection in Genie , 1997, RECOMB '97.

[26]  C. Burd,et al.  RNA binding specificity of hnRNP A1: significance of hnRNP A1 high‐affinity binding sites in pre‐mRNA splicing. , 1994, The EMBO journal.

[27]  J. Královičová,et al.  Global control of aberrant splice-site activation by auxiliary splicing sequences: evidence for a gradient in exon and intron definition , 2007, Nucleic acids research.

[28]  Sara G. Becker-Catania,et al.  Splicing defects in the ataxia-telangiectasia gene, ATM: underlying mutations and consequences. , 1999, American journal of human genetics.

[29]  Gene W. Yeo,et al.  A Combinatorial Code for Splicing Silencing: UAGG and GGGG Motifs , 2005, PLoS biology.

[30]  T. Maniatis,et al.  SR proteins are ‘locators’ of the RNA splicing machinery , 1999, Current Biology.

[31]  Igor Vorechovsky,et al.  Position-Dependent Repression and Promotion of DQB1 Intron 3 Splicing by GGGG Motifs1 , 2006, The Journal of Immunology.

[32]  S. Berget,et al.  G triplets located throughout a class of small vertebrate introns enforce intron borders and regulate splice site selection , 1997, Molecular and cellular biology.

[33]  A. Krainer,et al.  Identification of Functional Exonic Splicing Enhancer Motifs Recognized by Individual Sr Proteins Using an in Vitro Randomization and Functional Selection Procedure, We Have Identified Three Novel Classes of Exonic Splicing Enhancers (eses) Recognized by Human Sf2/asf, Srp40, and Srp55, Respectively , 2022 .

[34]  Dominique Stoppa-Lyonnet,et al.  Evaluation of in silico splice tools for decision‐making in molecular diagnosis , 2008, Human mutation.

[35]  Michael Q. Zhang,et al.  RNA landscape of evolution for optimal exon and intron discrimination , 2008, Proceedings of the National Academy of Sciences.

[36]  Gene W. Yeo,et al.  Inference of Splicing Regulatory Activities by Sequence Neighborhood Analysis , 2006, PLoS genetics.

[37]  Marvin B. Shapiro,et al.  RNA splice junctions of different classes of eukaryotes: sequence statistics and functional implications in gene expression. , 1987, Nucleic acids research.