Prediction of alternatively skipped exons and splicing enhancers from exon junction arrays

BackgroundAlternative splicing of exons in a pre-mRNA transcript is an important mechanism which contributes to protein diversity in human. Arrays for detecting alternative splicing are available using several different probe designs, including those based on exon-junctions. In this work, we introduce a new method for predicting alternatively skipped exons from exon-junction arrays. Predictions based on our method are compared against controls and their sequences are analyzed to identify motifs important for regulating alternative splicing.ResultsOur comparison of several alternative methods shows that an exon-skipping score based on neighboring junctions best discriminates between positive and negative controls. Sequence analysis of our predicted exons confirms the presence of known splicing regulatory sequences. In addition, we also derive a set of development-related alternatively spliced genes based on fetal versus adult tissue comparisons and find that our predictions are consistent with their functional annotations. Ab initio motif finding algorithms are applied to identify several motifs that may be relevant for splicing during development.ConclusionThis work describes a new method for analyzing exon-junction arrays, identifies sequence motifs that are specific for alternative and constitutive splicing and suggests a role for several known splicing factors and their motifs in developmental regulation.

[1]  R. Shamir,et al.  How prevalent is functional alternative splicing in the human genome? , 2004, Trends in genetics : TIG.

[2]  A. Krainer,et al.  Pre-mRNA splicing in the new millennium. , 2001, Current opinion in cell biology.

[3]  John A Thompson,et al.  Alternatively spliced FGFR-1 isoform signaling differentially modulates endothelial cell responses to peroxynitrite. , 2003, Archives of biochemistry and biophysics.

[4]  D. Black Mechanisms of alternative pre-messenger RNA splicing. , 2003, Annual review of biochemistry.

[5]  T. Speed,et al.  Summaries of Affymetrix GeneChip probe level data. , 2003, Nucleic acids research.

[6]  G. Ast,et al.  Comparative analysis identifies exonic splicing regulatory sequences--The complex definition of enhancers and silencers. , 2006, Molecular cell.

[7]  A. Rosenthal,et al.  Exon 2 of the gene for neural cell adhesion molecule L1 is alternatively spliced in B cells. , 1995, Brain research. Molecular brain research.

[8]  Tyson A. Clark,et al.  Nova regulates brain-specific splicing to shape the synapse , 2005, Nature Genetics.

[9]  Gene W. Yeo,et al.  Inference of Splicing Regulatory Activities by Sequence Neighborhood Analysis , 2006, PLoS genetics.

[10]  Christopher B. Burge,et al.  Recognition of Unknown Conserved Alternatively Spliced Exons , 2005, PLoS Comput. Biol..

[11]  L. Chasin,et al.  Computational definition of sequence motifs governing constitutive exon splicing. , 2004, Genes & development.

[12]  J. Thompson,et al.  CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. , 1994, Nucleic acids research.

[13]  Martin Vingron,et al.  Variance stabilization applied to microarray data calibration and to the quantification of differential expression , 2002, ISMB.

[14]  M. Tomita,et al.  Computational comparative analyses of alternative splicing regulation using full-length cDNA of various eukaryotes. , 2004, RNA.

[15]  T. Jatkoe,et al.  Predicting splice variant from DNA chip expression data. , 2001, Genome research.

[16]  L. Chasin,et al.  Human Genomic Sequences That Inhibit Splicing , 2000, Molecular and Cellular Biology.

[17]  International Human Genome Sequencing Consortium Initial sequencing and analysis of the human genome , 2001, Nature.

[18]  B. Frey,et al.  Revealing global regulatory features of mammalian alternative splicing using a quantitative microarray platform. , 2004, Molecular cell.

[19]  Brendan J. Frey,et al.  Inferring global levels of alternative splicing isoforms using a generative model of microarray data , 2006, Bioinform..

[20]  Christopher J. Lee,et al.  Detecting tissue-specific regulation of alternative splicing as a qualitative change in microarray data. , 2004, Nucleic acids research.

[21]  G. Vassal,et al.  Neurofibromatosis 1 (NF1) mRNAs expressed in the central nervous system are differentially spliced in the 5' part of the gene. , 1995, Human molecular genetics.

[22]  Christopher J. Lee,et al.  Alternative splicing in the human, mouse and rat genomes is associated with an increased frequency of exon creation and/or loss , 2003, Nature Genetics.

[23]  S. Berget,et al.  G triplets located throughout a class of small vertebrate introns enforce intron borders and regulate splice site selection , 1997, Molecular and cellular biology.

[24]  R. Amann,et al.  Predictive Identification of Exonic Splicing Enhancers in Human Genes , 2022 .

[25]  Tyson A. Clark,et al.  Genomewide Analysis of mRNA Processing in Yeast Using Splicing-Specific Microarrays , 2002, Science.

[26]  Tom Maniatis,et al.  Selection and Characterization of Pre-mRNA Splicing Enhancers: Identification of Novel SR Protein-Specific Enhancer Sequences , 1999, Molecular and Cellular Biology.

[27]  Douglas L. Brutlag,et al.  BioProspector: Discovering Conserved DNA Motifs in Upstream Regulatory Regions of Co-Expressed Genes , 2000, Pacific Symposium on Biocomputing.

[28]  D. Geschwind,et al.  Expression patterns of epidermal growth factor receptor and fibroblast growth factor receptor 1 mRNA in fetal human brain , 2003, The Journal of comparative neurology.

[29]  Tyson A. Clark,et al.  Alternative splicing and differential gene expression in colon cancer detected by a whole genome exon array , 2006, BMC Genomics.

[30]  I-Min A. Dubchak,et al.  Computational analysis of candidate intron regulatory elements for tissue-specific alternative pre-mRNA splicing. , 2001, Nucleic acids research.

[31]  C. Burge,et al.  A computational analysis of sequence features involved in recognition of short introns , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[32]  Tomaso Poggio,et al.  Identification and analysis of alternative splicing events conserved in human and mouse. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[33]  Gene W. Yeo,et al.  Variation in alternative splicing across human tissues , 2004, Genome Biology.

[34]  H. Margalit,et al.  Conserved sequence elements associated with exon skipping. , 2003, Nucleic acids research.

[35]  Tyson A. Clark,et al.  A correlation with exon expression approach to identify cis-regulatory elements for tissue-specific alternative splicing , 2007, Nucleic acids research.

[36]  Christopher J. Lee,et al.  Genome-wide detection of alternative splicing in expressed sequences of human genes , 2001, Nucleic Acids Res..

[37]  H. Bussemaker,et al.  Regulatory element detection using correlation with expression , 2001, Nature Genetics.

[38]  Terrence S. Furey,et al.  The UCSC Table Browser data retrieval tool , 2004, Nucleic Acids Res..

[39]  David Haussler,et al.  Unusual Intron Conservation near Tissue-Regulated Exons Found by Splicing Microarrays , 2005, PLoS Comput. Biol..

[40]  J. Vandesompele,et al.  Quantification of NF1 transcripts reveals novel highly expressed splice variants , 2002, FEBS letters.

[41]  Toshiyuki Miyashita,et al.  Detecting tissue-specific alternative splicing and disease-associated aberrant splicing of the PTCH gene with exon junction microarrays. , 2005, Human molecular genetics.

[42]  Juha Muilu,et al.  Conservation of human alternative splice events in mouse. , 2003, Nucleic acids research.

[43]  T A Thanaraj,et al.  Categorization and characterization of transcript-confirmed constitutively and alternatively spliced introns and exons from human. , 2002, Human molecular genetics.

[44]  P. Fehlbaum,et al.  A microarray configuration to quantify expression levels and relative abundance of splice variants , 2005, Nucleic acids research.

[45]  S. Shapiro,et al.  An Analysis of Variance Test for Normality (Complete Samples) , 1965 .

[46]  Harold R. Garner,et al.  Evidence for the regulation of alternative splicing via complementary DNA sequence repeats , 2005, Bioinform..

[47]  K. Heller,et al.  Sequence information for the splicing of human pre-mRNA identified by support vector machine classification. , 2003, Genome research.

[48]  Simon Cawley,et al.  ANOSVA: a statistical method for detecting splice variation from expression data , 2005, ISMB.

[49]  Charles Elkan,et al.  Unsupervised learning of multiple motifs in biopolymers using expectation maximization , 1995, Mach. Learn..

[50]  M. Ashburner,et al.  Gene Ontology: tool for the unification of biology , 2000, Nature Genetics.

[51]  T. Vorherr,et al.  Peptide sequence analysis and molecular cloning reveal two calcium pump isoforms in the human erythrocyte membrane. , 1990, The Journal of biological chemistry.

[52]  Yael Mandel-Gutfreund,et al.  Detection and measurement of alternative splicing using splicing-sensitive microarrays. , 2005, Methods.

[53]  J. Castle,et al.  Genome-Wide Survey of Human Alternative Pre-mRNA Splicing with Exon Junction Microarrays , 2003, Science.

[54]  T. Cooper,et al.  Finding signals that regulate alternative splicing in the post-genomic era , 2002, Genome Biology.

[55]  B. Blencowe Exonic splicing enhancers: mechanism of action, diversity and role in human genetic diseases. , 2000, Trends in biochemical sciences.

[56]  Tyson A. Clark,et al.  Discovery of tissue-specific exons using comprehensive human exon microarrays , 2007, Genome Biology.

[57]  B. Frey,et al.  Quantitative microarray profiling provides evidence against widespread coupling of alternative splicing with nonsense-mediated mRNA decay to control gene expression. , 2006, Genes & development.

[58]  S. Stamm,et al.  Htra2-beta 1 stimulates an exonic splicing enhancer and can restore full-length SMN expression to survival motor neuron 2 (SMN2). , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[59]  T. Cooper,et al.  Identification of a new class of exonic splicing enhancers by in vivo selection , 1997, Molecular and cellular biology.

[60]  Gene W. Yeo,et al.  Discovery and Analysis of Evolutionarily Conserved Intronic Splicing Regulatory Elements , 2007, PLoS Genetics.

[61]  G. Crooks,et al.  WebLogo: a sequence logo generator. , 2004, Genome research.

[62]  W. Gish,et al.  Gene structure prediction and alternative splicing analysis using genomically aligned ESTs. , 2001, Genome research.

[63]  A. J. Lopez,et al.  Alternative splicing of pre-mRNA: developmental consequences and mechanisms of regulation. , 1998, Annual review of genetics.

[64]  J. Conboy,et al.  The splicing regulatory element, UGCAUG, is phylogenetically and spatially conserved in introns that flank tissue-specific alternative exons , 2005, Nucleic acids research.

[65]  R. Sorek,et al.  Intronic sequences flanking alternatively spliced exons are conserved between human and mouse. , 2003, Genome research.

[66]  S. Batalov,et al.  A gene atlas of the mouse and human protein-encoding transcriptomes. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[67]  Nabil Belacel,et al.  Microarray analysis of alternative splicing. , 2006, Omics : a journal of integrative biology.

[68]  Terry Gaasterland,et al.  Impact of alternative initiation, splicing, and termination on the diversity of the mRNA transcripts encoded by the mouse transcriptome. , 2003, Genome research.

[69]  M. Gelfand,et al.  Frequent alternative splicing of human genes. , 1999, Genome research.

[70]  Michael Q. Zhang,et al.  Profiling alternatively spliced mRNA isoforms for prostate cancer classification , 2006, BMC Bioinformatics.

[71]  D. Goldstein,et al.  Alternative ion channel splicing in mesial temporal lobe epilepsy and Alzheimer's disease , 2007, Genome Biology.

[72]  Rafael A Irizarry,et al.  Exploration, normalization, and summaries of high density oligonucleotide array probe level data. , 2003, Biostatistics.

[73]  N. Bresolin,et al.  Silencer elements as possible inhibitors of pseudoexon splicing. , 2004, Nucleic acids research.