Genome wide identification and classification of alternative splicing based on EST data

MOTIVATION Alternative splicing is currently seen to explain the vast disparity between the number of predicted genes in the human genome and the highly diverse proteome. The mapping of expressed sequences tag (EST) consensus sequences derived from the GeneNest database onto the genome provides an efficient way of predicting exon-intron boundaries, gene structure and alternative splicing events. However, the alternative splicing events are obscured by a large number of putatively artificial exon boundaries arising due to genomic contamination or alignment errors. The current work describes a methodology to associate quality values to the predicted exon-intron boundaries. High quality exon-intron boundaries are used to predict constitutive and alternative splicing ranked by confidence values, aiming to facilitate large-scale analysis of alternative splicing and splicing in general. RESULTS Applying the current methodology, constitutive splicing is observed in 33,270 EST clusters, out of which 45% are alternatively spliced. The classification derived from the computed confidence values for 17 of these splice events frequently correlate (15/17) with RT-PCR experiments performed for 40 different tissue samples. As an application of the confidence measure, an evaluation of distribution of alternative splicing revealed that majority of variants correspond to the coding regions of the genes. However, still a significant fraction maps to non-coding regions, thereby indicating a functional relevance of alternative splicing in untranslated regions. AVAILABILITY The predicted alternative splice variants are visualized in the SpliceNest database at http://splicenest.molgen.mpg.de

[1]  R. Sorek,et al.  A novel algorithm for computational identification of contaminated EST libraries. , 2003, Nucleic acids research.

[2]  W. Gish,et al.  Gene structure prediction and alternative splicing analysis using genomically aligned ESTs. , 2001, Genome research.

[3]  Martin Vingron,et al.  Genome-scale design of PCR primers and long oligomers for DNA microarrays. , 2003, Nucleic acids research.

[4]  Michael Weir,et al.  Ordered partitioning reveals extended splice-site consensus information. , 2003, Genome research.

[5]  N. Gray,et al.  Regulation of mRNA translation by 5'- and 3'-UTR-binding factors. , 2003, Trends in biochemical sciences.

[6]  A. Kornblihtt,et al.  Alternative splicing: multiple control mechanisms and involvement in human disease. , 2002, Trends in genetics : TIG.

[7]  S. Kuersten,et al.  The power of the 3′ UTR: translational control and development , 2003, Nature Reviews Genetics.

[8]  G. Mize,et al.  Role of two upstream open reading frames in the translational control of oncogene mdm2 , 1999, Oncogene.

[9]  D. Selkoe Alzheimer's disease: genes, proteins, and therapy. , 2001, Physiological reviews.

[10]  T. Cooper,et al.  The regulation of splice-site selection, and its role in human disease. , 1997, American journal of human genetics.

[11]  G. C. Roberts,et al.  Alternative splicing: combinatorial output from the genome. , 2002, Current opinion in chemical biology.

[12]  J. Valcárcel,et al.  Alternative pre-mRNA splicing: the logic of combinatorial control. , 2000, Trends in biochemical sciences.

[13]  J. Beavo,et al.  The Calmodulin-dependent Phosphodiesterase Gene PDE1C Encodes Several Functionally Different Splice Variants in a Tissue-specific Manner* , 1996, The Journal of Biological Chemistry.

[14]  S. Stamm,et al.  Differential regulation of 5′ splice variants of the glutamate transporter EAAT2 in an in vivo model of chemical hypoxia induced by 3‐nitropropionic acid , 2003, Journal of neuroscience research.

[15]  G. Rubin,et al.  A computer program for aligning a cDNA sequence with a genomic DNA sequence. , 1998, Genome research.

[16]  W. Gilbert Why genes in pieces? , 1978, Nature.

[17]  K. Buetow,et al.  Computational analysis and experimental validation of tumor-associated alternative RNA splicing in human cancer. , 2003, Cancer research.

[18]  D. Cooper,et al.  The mutational spectrum of single base-pair substitutions in mRNA splice junctions of human genes: Causes and consequences , 1992, Human Genetics.

[19]  M Vingron,et al.  GeneNest: automated generation and visualization of gene indices. , 2000, Trends in genetics : TIG.

[20]  J. Sambrook,et al.  Adenovirus amazes at Cold Spring Harbor , 1977, Nature.

[21]  H. Margalit,et al.  Conserved sequence elements associated with exon skipping. , 2003, Nucleic acids research.

[22]  T A Thanaraj,et al.  Human GC-AG alternative intron isoforms with weak donor sites show enhanced consensus at acceptor exon positions. , 2001, Nucleic acids research.

[23]  Christopher J. Lee,et al.  A genomic view of alternative splicing , 2002, Nature Genetics.

[24]  David States,et al.  Selecting for functional alternative splices in ESTs. , 2002, Genome research.

[25]  A. Chauhan,et al.  Regulated splicing of the fibronectin EDA exon is essential for proper skin wound healing and normal lifespan , 2003, The Journal of cell biology.

[26]  C. Chu,et al.  Aberrant caspase-activated DNase (CAD) transcripts in human hepatoma cells , 2003, British Journal of Cancer.

[27]  J. R. Campbell,et al.  Alternate promoters and variable splicing lead to hNedd4-2 isoforms with a C2 domain and varying number of WW domains. , 2003, American journal of physiology. Renal physiology.

[28]  T A Thanaraj,et al.  Categorization and characterization of transcript-confirmed constitutively and alternatively spliced introns and exons from human. , 2002, Human molecular genetics.

[29]  Susan Blackmore,et al.  The Power Of , 2000 .

[30]  J. C. Villaescusa,et al.  New splicing variants for human Tyrosine Hydroxylase gene with possible implications for the detection of minimal residual disease in patients with neuroblastoma , 2003, Neuroscience Letters.

[31]  X. Jeunemaître,et al.  Multiple Promoters in the WNK1 Gene: One Controls Expression of a Kidney-Specific Kinase-Defective Isoform , 2003, Molecular and Cellular Biology.

[32]  M. Hentze,et al.  The human intronless melanocortin 4-receptor gene is NMD insensitive. , 2002, Human Molecular Genetics.

[33]  G. Mize,et al.  The Two Upstream Open Reading Frames of Oncogene mdm2 Have Different Translational Regulatory Properties* , 2003, Journal of Biological Chemistry.

[34]  M. Gelfand,et al.  Frequent alternative splicing of human genes. , 1999, Genome research.

[35]  J. C. Clemens,et al.  Drosophila Dscam Is an Axon Guidance Receptor Exhibiting Extraordinary Molecular Diversity , 2000, Cell.

[36]  Martin Vingron,et al.  SpliceNest: visualizing gene structure and alternative splicing based on EST clusters , 2002 .

[37]  S. Brenner,et al.  Evidence for the widespread coupling of alternative splicing and nonsense-mediated mRNA decay in humans , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[38]  Matthew A. Fury,et al.  Molecular BKology: The Study of Splicing and Dicing , 2002, Science's STKE.