Assessing the fraction of short-distance tandem splice sites under purifying selection.

Many alternative splice events result in subtle mRNA changes, and most of them occur at short-distance tandem donor and acceptor sites. The splicing mechanism of such tandem sites likely involves the stochastic selection of either splice site. While tandem splice events are frequent, it is unknown how many are functionally important. Here, we use phylogenetic conservation to address this question, focusing on tandems with a distance of 3-9 nucleotides. We show that previous contradicting results on whether alternative or constitutive tandem motifs are more conserved between species can be explained by a statistical paradox (Simpson's paradox). Applying methods that take biases into account, we found higher conservation of alternative tandems in mouse, dog, and even chicken, zebrafish, and Fugu genomes. We estimated a lower bound for the number of alternative sites that are under purifying (negative) selection. While the absolute number of conserved tandem motifs decreases with the evolutionary distance, the fraction under selection increases. Interestingly, a number of frameshifting tandems are under selection, suggesting a role in regulating mRNA and protein levels via nonsense-mediated decay (NMD). An analysis of the intronic flanks shows that purifying selection also acts on the intronic sequence. We propose that stochastic splice site selection can be an advantageous mechanism that allows constant splice variant ratios in situations where a deviation in this ratio is deleterious.

[1]  Christopher J. Lee,et al.  Alternative splicing in the human, mouse and rat genomes is associated with an increased frequency of exon creation and/or loss , 2003, Nature Genetics.

[2]  Marc Fellous,et al.  Donor splice-site mutations in WT1 are responsible for Frasier syndrome , 1997, Nature Genetics.

[3]  Rolf Backofen,et al.  Widespread occurrence of alternative splicing at NAGNAG acceptors contributes to proteome plasticity , 2004, Nature Genetics.

[4]  R. Guigó,et al.  Comparison of splice sites in mammals and chicken. , 2005, Genome research.

[5]  David Haussler,et al.  The UCSC genome browser database: update 2007 , 2006, Nucleic Acids Res..

[6]  Minhong Yan,et al.  The crystal structures of EDA-A1 and EDA-A2: splice variants with distinct receptor specificity. , 2003, Structure.

[7]  S. Brenner,et al.  Evidence for the widespread coupling of alternative splicing and nonsense-mediated mRNA decay in humans , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[8]  D. Haussler,et al.  Ultraconserved Elements in the Human Genome , 2004, Science.

[9]  G. Condorelli,et al.  Two alternatively spliced forms of the human insulin-like growth factor I receptor have distinct biological activities and internalization kinetics. , 1994, The Journal of biological chemistry.

[10]  Ting Wang,et al.  The UCSC Genome Browser Database: update 2009 , 2008, Nucleic Acids Res..

[11]  Toshiyuki Miyashita,et al.  Frequent occurrence of protein isoforms with or without a single amino acid residue by subtle alternative splicing: the case of Gln in DRPLA affects subcellular localization of the products , 2005, Journal of Human Genetics.

[12]  Gene W. Yeo,et al.  Inference of Splicing Regulatory Activities by Sequence Neighborhood Analysis , 2006, PLoS genetics.

[13]  A. D. de Vos,et al.  Two-amino acid molecular switch in an epithelial morphogen that regulates binding to two distinct receptors. , 2000, Science.

[14]  A. Joyner,et al.  The Exon 8-Containing Prosaposin Gene Splice Variant Is Dispensable for Mouse Development, Lysosomal Function, and Secretion , 2005, Molecular and Cellular Biology.

[15]  S. Clarke,et al.  A Second Protein l-Isoaspartyl Methyltransferase Gene in Arabidopsis Produces Two Transcripts Whose Products Are Sequestered in the Nucleus1[w] , 2004, Plant Physiology.

[16]  B. Branstetter,et al.  Categorization and characterization of lesions of the orbital apex , 2011, Neuroradiology.

[17]  K. Vogan,et al.  An alternative splicing event in the Pax-3 paired domain identifies the linker region as a key determinant of paired domain DNA-binding activity , 1996, Molecular and cellular biology.

[18]  Yimeng Dou,et al.  Genomic splice-site analysis reveals frequent alternative splicing close to the dominant splice site. , 2006, RNA.

[19]  Jun Kawai,et al.  A Simple Physical Model Predicts Small Exon Length Variations , 2006, PLoS genetics.

[20]  C. Obie,et al.  Molecular enzymology of mammalian Delta1-pyrroline-5-carboxylate synthase. Alternative splice donor utilization generates isoforms with different sensitivity to ornithine inhibition. , 1999, The Journal of biological chemistry.

[21]  E. Birney,et al.  Comparative genomics: genome-wide analysis in metazoan eukaryotes , 2003, Nature Reviews Genetics.

[22]  R. Sorek,et al.  Intronic sequences flanking alternatively spliced exons are conserved between human and mouse. , 2003, Genome research.

[23]  Hagen Blankenburg,et al.  The implications of alternative splicing in the ENCODE protein complement , 2007, Proceedings of the National Academy of Sciences.

[24]  R. Shamir,et al.  How prevalent is functional alternative splicing in the human genome? , 2004, Trends in genetics : TIG.

[25]  Bruce G. Bills,et al.  A Simple Physical Model for Deep Moonquakes , 2009 .

[26]  Donny D. Licatalosi,et al.  Splicing Regulation in Neurologic Disease , 2006, Neuron.

[27]  P. Bickel,et al.  Sex Bias in Graduate Admissions: Data from Berkeley , 1975, Science.

[28]  Stefan Stamm,et al.  Human tra2-beta1 autoregulates its protein concentration by influencing alternative splicing of its pre-mRNA. , 2004, Human molecular genetics.

[29]  David Haussler,et al.  The UCSC Genome Browser database: update 2010 , 2009, Nucleic Acids Res..

[30]  P. Green,et al.  Sequence conservation, relative isoform frequencies, and nonsense-mediated decay in evolutionarily conserved alternative splicing. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[31]  E. H. Simpson,et al.  The Interpretation of Interaction in Contingency Tables , 1951 .

[32]  S. Brenner,et al.  Unproductive splicing of SR genes associated with highly conserved and ultraconserved DNA elements , 2007, Nature.

[33]  Nancy F. Hansen,et al.  Comparative analyses of multi-species sequences from targeted genomic regions , 2003, Nature.

[34]  Terry Gaasterland,et al.  Impact of alternative initiation, splicing, and termination on the diversity of the mRNA transcripts encoded by the mouse transcriptome. , 2003, Genome research.

[35]  Michael R. Green,et al.  Functional recognition of the 3′ splice site AG by the splicing factor U2AF35 , 1999, Nature.

[36]  Christopher B. Burge,et al.  Identification and analysis of alternative splicing events conserved in human and mouse Gene , 2005 .

[37]  M. Schmidt,et al.  Cloning of an interferon regulatory factor 2 isoform with different regulatory ability. , 2000, Nucleic acids research.

[38]  T A Thanaraj,et al.  Categorization and characterization of transcript-confirmed constitutively and alternatively spliced introns and exons from human. , 2002, Human molecular genetics.

[39]  Tyson A. Clark,et al.  Ultraconserved elements are associated with homeostatic control of splicing regulators by alternative splicing and nonsense-mediated decay. , 2007, Genes & development.

[40]  Francisco E. Baralle,et al.  Genomic variants in exons and introns: identifying the splicing spoilers , 2004, Nature Reviews Genetics.

[41]  Rolf Backofen,et al.  Phylogenetically widespread alternative splicing at unusual GYNGYN donors , 2006, Genome Biology.

[42]  K. Tsai,et al.  Wobble Splicing Reveals the Role of the Branch Point Sequence-to-NAGNAG Region in 3′ Tandem Splice Site Selection , 2007, Molecular and Cellular Biology.

[43]  B. Frey,et al.  Alternative splicing of conserved exons is frequently species-specific in human and mouse. , 2005, Trends in genetics : TIG.

[44]  H. Takeda,et al.  A novel POU domain gene, zebrafish pou2: expression and roles of two alternatively spliced twin products in early development. , 1994, Genes & development.

[45]  Yi Xing,et al.  Evidence for a subpopulation of conserved alternative splicing events under selection pressure for protein reading frame preservation. , 2004, Nucleic acids research.

[46]  N D Hastie,et al.  Did nucleotides or amino acids drive evolutionary conservation of the WT1 +/-KTS alternative splice? , 2000, Human molecular genetics.

[47]  S. Julious,et al.  Confounding and Simpson's paradox , 1994, BMJ.

[48]  Christopher J. Lee,et al.  Protein Modularity of Alternatively Spliced Exons Is Associated with Tissue-Specific Regulation of Alternative Splicing , 2005, PLoS genetics.

[49]  A. Krainer,et al.  The gene encoding the splicing factor SF2/ASF is a proto-oncogene , 2007, Nature Structural &Molecular Biology.

[50]  D. Black Mechanisms of alternative pre-messenger RNA splicing. , 2003, Annual review of biochemistry.

[51]  B. Frey,et al.  Revealing global regulatory features of mammalian alternative splicing using a quantitative microarray platform. , 2004, Molecular cell.

[52]  A. Schedl,et al.  Two Splice Variants of the Wilms' Tumor 1 Gene Have Distinct Functions during Sex Determination and Nephron Formation , 2001, Cell.

[53]  Yael Mandel-Gutfreund,et al.  Alternative splicing regulation at tandem 3′ splice sites , 2006, Nucleic acids research.

[54]  Brenton R Graveley,et al.  Mutually Exclusive Splicing of the Insect Dscam Pre-mRNA Directed by Competing Intronic RNA Secondary Structures , 2005, Cell.

[55]  Ramil N. Nurtdinov,et al.  Overlapping Alternative donor splice Sites in the Human genome , 2007, J. Bioinform. Comput. Biol..

[56]  Francisco E. Baralle,et al.  Reduced splicing efficiency induced by synonymous substitutions may generate a substrate for natural selection of new splicing isoforms: the case of CFTR exon 12 , 2006, Nucleic acids research.

[57]  David Haussler,et al.  Transcriptome and Genome Conservation of Alternative Splicing Events in Humans and Mice , 2003, Pacific Symposium on Biocomputing.

[58]  Tyson A. Clark,et al.  Nova regulates brain-specific splicing to shape the synapse , 2005, Nature Genetics.

[59]  Kristen W. Lynch,et al.  Consequences of regulated pre-mRNA splicing in the immune system , 2004, Nature Reviews Immunology.

[60]  D. Haussler,et al.  Evolutionarily conserved elements in vertebrate, insect, worm, and yeast genomes. , 2005, Genome research.

[61]  Stephen M. Mount,et al.  Comprehensive analysis of alternative splicing in rice and comparative analyses with Arabidopsis , 2006, BMC Genomics.

[62]  Rolf Backofen,et al.  TassDB: a database of alternative tandem splice sites , 2006, Nucleic Acids Res..

[63]  Rolf Backofen,et al.  Alternative Splicing at NAGNAG Acceptors: Simply Noise or Noise and More? , 2006, PLoS genetics.

[64]  Rolf Backofen,et al.  Single-nucleotide polymorphisms in NAGNAG acceptors are highly predictive for variations of alternative splicing. , 2006, American journal of human genetics.

[65]  J. Heath,et al.  Association of the Signaling Adaptor FRS2 with Fibroblast Growth Factor Receptor 1 (Fgfr1) Is Mediated by Alternative Splicing of the Juxtamembrane Domain* , 2002, The Journal of Biological Chemistry.

[66]  J. Jean,et al.  γ-Glutamyltransferase and Its Isoform Mediate an Endoplasmic Reticulum Stress Response* , 2001, The Journal of Biological Chemistry.