Human GC-AG alternative intron isoforms with weak donor sites show enhanced consensus at acceptor exon positions.

It has been previously observed that the intrinsically weak variant GC donor sites, in order to be recognized by the U2-type spliceosome, possess strong consensus sequences maximized for base pair formation with U1 and U5/U6 snRNAs. However, variability in signal strength is a fundamental mechanism for splice site selection in alternative splicing. Here we report human alternative GC-AG introns (for the first time from any species), and show that while constitutive GC-AG introns do possess strong signals at their donor sites, a large subset of alternative GC-AG introns possess weak consensus sequences at their donor sites. Surprisingly, this subset of alternative isoforms shows strong consensus at acceptor exon positions 1 and 2. The improved consensus at the acceptor exon can facilitate a strong interaction with U5 snRNA, which tethers the two exons for ligation during the second step of splicing. Further, these isoforms nearly always possess alternative acceptor sites and exhibit particularly weak polypyrimidine tracts characteristic of AG-dependent introns. The acceptor exon nucleotides are part of the consensus required for the U2AF(35)-mediated recognition of AG in such introns. Such improved consensus at acceptor exons is not found in either normal or alternative GT-AG introns having weak donor sites or weak polypyrimidine tracts. The changes probably reflect mechanisms that allow GC-AG alternative intron isoforms to cope with two conflicting requirements, namely an apparent need for differential splice strength to direct the choice of alternative sites and a need for improved donor signals to compensate for the central mismatch base pair (C-A) in the RNA duplex of U1 snRNA and the pre-mRNA. The other important findings include (i) one in every twenty alternative introns is a GC-AG intron, and (ii) three of every five observed GC-AG introns are alternative isoforms.

[1]  Christopher B. Burge,et al.  Classification of Introns: U2-Type or U12-Type , 1997, Cell.

[2]  M. Aebi,et al.  Sequence requirements for splicing of higher eukaryotic nuclear pre-mRNA , 1986, Cell.

[3]  A. Newman,et al.  The role of U5 snRNP in pre‐mRNA splicing , 1997, EMBO Journal.

[4]  H. Gallinaro,et al.  The 5' splice site: phylogenetic evolution and variable geometry of association with U1RNA. , 1989, Nucleic acids research.

[5]  Rodrigo Lopez,et al.  The EMBL Nucleotide Sequence Database , 1999, Nucleic Acids Res..

[6]  V. Solovyev,et al.  Analysis of canonical and non-canonical splice sites in mammalian genomes. , 2000, Nucleic acids research.

[7]  J. Beggs,et al.  Protein-RNA interactions in the U5 snRNP of Saccharomyces cerevisiae. , 1998, RNA.

[8]  Stephen M. Mount,et al.  Genomic sequence, splicing, and gene annotation. , 2000, American journal of human genetics.

[9]  Thangavel Alphonse Thanaraj,et al.  Prediction of Exact Boundaries of Exons , 2000, Briefings Bioinform..

[10]  M. Rosenfeld,et al.  Control of alternative pre-mRNA splicing by distributed pentameric repeats. , 1997, Proceedings of the National Academy of Sciences of the United States of America.

[11]  A. Krämer,et al.  The structure and function of proteins involved in mammalian pre-mRNA splicing. , 1996, Annual review of biochemistry.

[12]  R. O’Keefe,et al.  Functional analysis of the U5 snRNA loop 1 in the second catalytic step of yeast pre‐mRNA splicing , 1998, The EMBO journal.

[13]  Kevin Burrage,et al.  ISIS, the intron information system, reveals the high frequency of alternative splicing in the human genome , 2000, Nature Genetics.

[14]  Michael Ruogu Zhang,et al.  Statistical features of human exons and their flanking regions. , 1998, Human molecular genetics.

[15]  Raymond F. Gesteland,et al.  Life Before DNA. (Book Reviews: The RNA World. The Nature of Modern RNA Suggests a Prebiotic RNA World.) , 1993 .

[16]  A. Krainer,et al.  Regulation of alternative splicing in vivo by overexpression of antagonistic splicing factors. , 1994, Science.

[17]  Victor V. Solovyev,et al.  SpliceDB: database of canonical and non-canonical mammalian splice sites , 2001, Nucleic Acids Res..

[18]  Michael R. Green,et al.  Functional recognition of the 3′ splice site AG by the splicing factor U2AF35 , 1999, Nature.

[19]  T A Thanaraj,et al.  Positional characterisation of false positives from computational prediction of human splice sites. , 2000, Nucleic acids research.

[20]  M. Gelfand,et al.  Frequent alternative splicing of human genes. , 1999, Genome research.

[21]  L. Chasin,et al.  Control of 3' splice site choice in vivo by ASF/SF2 and hnRNP A1. , 1999, Nucleic acids research.

[22]  J. G. Patton,et al.  Functional analysis of the polypyrimidine tract in pre-mRNA splicing. , 1997, Nucleic acids research.

[23]  R. Parker,et al.  Evidence for an essential non-Watson–Crick interaction between the first and last nucleotides of a nuclear pre-mRNA intron , 1993, Nature.

[24]  R. Reed,et al.  The organization of 3' splice-site sequences in mammalian introns. , 1989, Genes & development.

[25]  J. Valcárcel,et al.  Interaction of U2AF65 RS region with pre-mRNA branch point and promotion of base pairing with U2 snRNA [corrected]. , 1996, Science.

[26]  H. Madhani,et al.  Dynamic RNA-RNA interactions in the spliceosome. , 1994, Annual review of genetics.

[27]  C. Norman,et al.  U5 snRNA interacts with exon sequences at 5′ and 3′ splice sites , 1992, Cell.

[28]  K. Nakai,et al.  Construction of a novel database containing aberrant splicing mutations of mammalian genes. , 1994, Gene.

[29]  A. Krainer Eukaryotic mRNA processing , 1997 .

[30]  J. Beggs,et al.  The role of PRP8 protein in nuclear pre-mRNA splicing in yeast , 1995, Journal of Cell Science.

[31]  E. Myers,et al.  Basic local alignment search tool. , 1990, Journal of molecular biology.

[32]  Jackson Ij,et al.  A reappraisal of non-consensus mRNA splice sites. , 1991 .

[33]  P. Sharp,et al.  Branch nucleophile selection in pre-mRNA splicing: evidence for the bulged duplex model. , 1994, Genes & development.

[34]  J. Valcárcel,et al.  Evidence for Substrate-Specific Requirement of the Splicing Factor U2AF35 and for Its Function after Polypyrimidine Tract Recognition by U2AF65 , 1999, Molecular and Cellular Biology.

[35]  P. Sharp,et al.  Splicing of precursors to mRNAs by the spliceosomes , 1993 .

[36]  T. D. Schneider,et al.  Sequence logos: a new way to display consensus sequences. , 1990, Nucleic acids research.

[37]  Michael Ruogu Zhang,et al.  A sequence compilation and comparison of exons that are alternatively spliced in neurons. , 1994, Nucleic acids research.

[38]  Melissa J. Moore,et al.  Intron recognition comes of AGe , 2000, Nature Structural Biology.

[39]  Gary D. Stormo,et al.  Displaying the information contents of structural RNA alignments: the structure logos , 1997, Comput. Appl. Biosci..

[40]  J. Valcárcel,et al.  Inhibition of msl-2 splicing by Sex-lethal reveals interaction between U2AF35 and the 3′ splice site AG , 1999, Nature.

[41]  A. Krainer,et al.  Mammalian pre-mRNA splicing factors , 1997 .

[42]  Thangavel Alphonse Thanaraj A clean data set of EST-confirmed splice sites from Homo sapiens and standards for clean-up procedures , 1999, Nucleic Acids Res..

[43]  N L Harris,et al.  Splice junctions, branch point sites, and exons: sequence statistics, identification, and applications to genome project. , 1990, Methods in enzymology.

[44]  M. Aebi,et al.  5′ cleavage site in eukaryotic pre-mRNA splicing is determined by the overall 5′ splice region, not by the conserved 5′ GU , 1987, Cell.

[45]  Thomas Blumenthal,et al.  Both subunits of U2AF recognize the 3′ splice site in Caenorhabditis elegans , 1999, Nature.

[46]  I. Jackson,et al.  A reappraisal of non-consensus mRNA splice sites. , 1991, Nucleic acids research.