Inferring global levels of alternative splicing isoforms using a generative model of microarray data

MOTIVATION Alternative splicing (AS) is a frequent step in metozoan gene expression whereby the exons of genes are spliced in different combinations to generate multiple isoforms of mature mRNA. AS functions to enrich an organism's proteomic complexity and regulates gene expression. Despite its importance, the mechanisms underlying AS and its regulation are not well understood, especially in the context of global gene expression patterns. We present here an algorithm referred to as the Generative model for the Alternative Splicing Array Platform (GenASAP) that can predict the levels of AS for thousands of exon skipping events using data generated from custom microarrays. GenASAP uses Bayesian learning in an unsupervised probability model to accurately predict AS levels from the microarray data. GenASAP is capable of learning the hybridization profiles of microarray data, while modeling noise processes and missing or aberrant data. GenASAP has been successfully applied to the global discovery and analysis of AS in mammalian cells and tissues. RESULTS GenASAP was applied to data obtained from a custom microarray designed for the monitoring of 3126 AS events in mouse cells and tissues. The microarray design included probes specific for exon body and junction sequences formed by the splicing of exons. Our results show that GenASAP provides accurate predictions for over one-third of the total events, as verified by independent RT-PCR assays. SUPPLEMENTARY INFORMATION http://www.psi.toronto.edu/GenASAP.

[1]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[2]  H. Lodish Molecular Cell Biology , 1986 .

[3]  Judea Pearl,et al.  Probabilistic reasoning in intelligent systems , 1988 .

[4]  Fabio Gagliardi Cozman,et al.  Truncated Gaussians as tolerance sets , 1994 .

[5]  Michael I. Jordan Learning in Graphical Models , 1999, NATO ASI Series.

[6]  Geoffrey E. Hinton,et al.  A View of the Em Algorithm that Justifies Incremental, Sparse, and other Variants , 1998, Learning in Graphical Models.

[7]  D. Black Protein Diversity from Alternative Splicing A Challenge for Bioinformatics and Post-Genome Biology , 2000, Cell.

[8]  B. Blencowe Exonic splicing enhancers: mechanism of action, diversity and role in human genetic diseases. , 2000, Trends in biochemical sciences.

[9]  J. C. Clemens,et al.  Drosophila Dscam Is an Axon Guidance Receptor Exhibiting Extraordinary Molecular Diversity , 2000, Cell.

[10]  David M. Rocke,et al.  A Model for Measurement Error for Gene Expression Arrays , 2001, J. Comput. Biol..

[11]  Bosiljka Tasic,et al.  Alternative pre-mRNA splicing and proteome expansion in metazoans , 2002, Nature.

[12]  Martin Vingron,et al.  Variance stabilization applied to microarray data calibration and to the quantification of differential expression , 2002, ISMB.

[13]  Christopher J. Lee,et al.  Genome-wide detection of tissue-specific alternative splicing in the human transcriptome. , 2002, Nucleic acids research.

[14]  Andrew J. Olson,et al.  Computational analysis of alternative splicing using EST tissue information. , 2002, Genomics.

[15]  Tyson A. Clark,et al.  Genomewide Analysis of mRNA Processing in Yeast Using Splicing-Specific Microarrays , 2002, Science.

[16]  A. Krainer,et al.  Listening to silence and understanding nonsense: exonic mutations that affect splicing , 2002, Nature Reviews Genetics.

[17]  Christopher J. Lee,et al.  Discovery of novel splice forms and functional analysis of cancer-specific alternative splicing in human expressed sequences. , 2003, Nucleic acids research.

[18]  K. Buetow,et al.  Computational analysis and experimental validation of tumor-associated alternative RNA splicing in human cancer. , 2003, Cancer research.

[19]  J. Castle,et al.  Genome-Wide Survey of Human Alternative Pre-mRNA Splicing with Exon Junction Microarrays , 2003, Science.

[20]  David Haussler,et al.  Gene structure-based splice variant deconvolution using a microarry platform , 2003, ISMB.

[21]  David M. Rocke,et al.  Variance-stabilizing transformations for two-color microarrays , 2004, Bioinform..

[22]  Christopher J. Lee,et al.  Detecting tissue-specific regulation of alternative splicing as a qualitative change in microarray data. , 2004, Nucleic acids research.

[23]  S. Haas,et al.  Strengths and weaknesses of EST-based prediction of tissue-specific alternative splicing , 2004, BMC Genomics.

[24]  Yixue Li,et al.  Identification of alternatively spliced mRNA variants related to cancers by genome-wide ESTs alignment , 2004, Oncogene.

[25]  R. Shamir,et al.  How prevalent is functional alternative splicing in the human genome? , 2004, Trends in genetics : TIG.

[26]  B. Frey,et al.  Revealing global regulatory features of mammalian alternative splicing using a quantitative microarray platform. , 2004, Molecular cell.

[27]  Kae Sato,et al.  Non-cross-linking gold nanoparticle aggregation as a detection method for single-base substitutions , 2005, Nucleic acids research.

[28]  P. Fehlbaum,et al.  A microarray configuration to quantify expression levels and relative abundance of splice variants , 2005, Nucleic acids research.

[29]  Simon Cawley,et al.  ANOSVA: a statistical method for detecting splice variation from expression data , 2005, ISMB.

[30]  B. Frey,et al.  Alternative splicing of conserved exons is frequently species-specific in human and mouse. , 2005, Trends in genetics : TIG.

[31]  F. Clark,et al.  Understanding alternative splicing: towards a cellular code , 2005, Nature Reviews Molecular Cell Biology.

[32]  S. Brenner,et al.  Global analysis of positive and negative pre-mRNA splicing regulators in Drosophila. , 2005, Genes & development.

[33]  V. Beneš,et al.  Alternative Splicing Microarrays Reveal Functional Expression of Neuron-specific Regulators in Hodgkin Lymphoma Cells* , 2005, Journal of Biological Chemistry.

[34]  William J. Byrne,et al.  Convergence Theorems for Generalized Alternating Minimization Procedures , 2005, J. Mach. Learn. Res..

[35]  Tyson A. Clark,et al.  Nova regulates brain-specific splicing to shape the synapse , 2005, Nature Genetics.

[36]  B. Frey,et al.  Genome-wide analysis of mouse transcripts using exon microarrays and factor graphs , 2005, Nature Genetics.