Prediction of Mutant mRNA Splice Isoforms by Information Theory‐Based Exon Definition

Mutations that affect mRNA splicing often produce multiple mRNA isoforms, resulting in complex molecular phenotypes. Definition of an exon and its inclusion in mature mRNA relies on joint recognition of both acceptor and donor splice sites. This study predicts cryptic and exon‐skipping isoforms in mRNA produced by splicing mutations from the combined information contents (Ri, which measures binding‐site strength, in bits) and distribution of the splice sites defining these exons. The total information content of an exon (Ri,total) is the sum of the Ri values of its acceptor and donor splice sites, adjusted for the self‐information of the distance separating these sites, that is, the gap surprisal. Differences between total information contents of an exon (ΔRi,total) are predictive of the relative abundance of these exons in distinct processed mRNAs. Constraints on splice site and exon selection are used to eliminate nonconforming and poorly expressed isoforms. Molecular phenotypes are computed by the Automated Splice Site and Exon Definition Analysis (http://splice.uwo.ca) server. Predictions of splicing mutations were highly concordant (85.2%; n = 61) with published expression data. In silico exon definition analysis will contribute to streamlining assessment of abnormal and normal splice isoforms resulting from mutations.

[1]  Myron Tribus,et al.  Thermostatics and thermodynamics : an introduction to energy, information and states of matter, with engineering applications , 1961 .

[2]  T. D. Schneider,et al.  Information analysis of human splice site mutations , 1998, Human mutation.

[3]  E. Fleck,et al.  Spectrum of clinical phenotypes and gene variants in cardiac myosin-binding protein C mutation carriers with hypertrophic cardiomyopathy. , 2001, Journal of the American College of Cardiology.

[4]  M. Bolisetty,et al.  Splicing of internal large exons is defined by novel cis-acting sequence elements , 2012, Nucleic acids research.

[5]  J. Goodship,et al.  Sequencing EVC and EVC2 identifies mutations in two-thirds of Ellis–van Creveld syndrome patients , 2006, Human Genetics.

[6]  B. Lämmle,et al.  The novel acceptor splice site mutation 11396(G-->A) in the factor XII gene causes a truncated transcript in cross-reacting material negative patients. , 1995, Human molecular genetics.

[7]  Peter K. Rogana,et al.  Information theory-based analysis of CYP 2 C 19 , CYP 2 D 6 and CYP 3 A 5 splicing mutations , 2003 .

[8]  C. Alonso,et al.  RNA analysis of eight BRCA1 and BRCA2 unclassified variants identified in breast/ovarian cancer families from Spain , 2003, Human mutation.

[9]  R Kole,et al.  Selection of splice sites in pre-mRNAs with short internal exons , 1991, Molecular and cellular biology.

[10]  B. Andresen,et al.  Splicing of phenylalanine hydroxylase (PAH) exon 11 is vulnerable: molecular pathology of mutations in PAH exon 11. , 2012, Molecular genetics and metabolism.

[11]  T A Thanaraj,et al.  Categorization and characterization of transcript-confirmed constitutively and alternatively spliced introns and exons from human. , 2002, Human molecular genetics.

[12]  Elisa Goina,et al.  Binding of DAZAP1 and hnRNPA1/A2 to an Exonic Splicing Silencer in a Natural BRCA1 Exon 18 Mutant , 2008, Molecular and Cellular Biology.

[13]  T. Fukao,et al.  A novel mutation (c.951C>T) in an exonic splicing enhancer results in exon 10 skipping in the human mitochondrial acetoacetyl-CoA thiolase gene. , 2010, Molecular genetics and metabolism.

[14]  Tom Maniatis,et al.  Specific transcription and RNA splicing defects in five cloned β-thalassaemia genes , 1983, Nature.

[15]  P. Jordan,et al.  A missense mutation in the APC tumor suppressor gene disrupts an ASF/SF2 splicing enhancer motif and causes pathogenic skipping of exon 14. , 2009, Mutation research.

[16]  E. Mucaki,et al.  Comprehensive prediction of mRNA splicing effects of BRCA1 and BRCA2 variants , 2011, Human mutation.

[17]  T. Maniatis,et al.  Serine/arginine-rich protein-dependent suppression of exon skipping by exonic splicing enhancers. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[18]  Peter K Rogan,et al.  Automated splicing mutation analysis by information theory , 2005, Human mutation.

[19]  J. Murray,et al.  Analysis of RNA splicing defects in PITX2 mutants supports a gene dosage model of Axenfeld-Rieger syndrome , 2006, BMC Medical Genetics.

[20]  T. Maniatis,et al.  Arginine/serine-rich domains of SR proteins can function as activators of pre-mRNA splicing. , 1998, Molecular cell.

[21]  M. Vidaud,et al.  A 5' splice-region G----C mutation in exon 1 of the human beta-globin gene inhibits pre-mRNA splicing: a mechanism for beta+-thalassemia. , 1989, Proceedings of the National Academy of Sciences of the United States of America.

[22]  Benjamin J. Raphael,et al.  Using positional distribution to identify splicing elements and predict pre-mRNA processing defects in human genes , 2011, Proceedings of the National Academy of Sciences.

[23]  A. Krainer,et al.  Identification of Functional Exonic Splicing Enhancer Motifs Recognized by Individual Sr Proteins Using an in Vitro Randomization and Functional Selection Procedure, We Have Identified Three Novel Classes of Exonic Splicing Enhancers (eses) Recognized by Human Sf2/asf, Srp40, and Srp55, Respectively , 2022 .

[24]  S. Berget Exon Recognition in Vertebrate Splicing (*) , 1995, The Journal of Biological Chemistry.

[25]  M. Rodés,et al.  Analysis of the CTNS gene in 32 cystinosis patients from Spain , 2009, Clinical genetics.

[26]  R Kole,et al.  Cooperation of pre-mRNA sequence elements in splice site selection , 1992, Molecular and cellular biology.

[27]  E. Jaynes Information Theory and Statistical Mechanics , 1957 .

[28]  Michael Q. Zhang,et al.  Exonic Splicing Enhancer Motif Recognized by Human SC35 under Splicing Conditions , 2000, Molecular and Cellular Biology.

[29]  Gary D. Stormo,et al.  Delila system tools , 1984, Nucleic Acids Res..

[30]  T. D. Schneider,et al.  Information content of individual genetic sequences. , 1997, Journal of theoretical biology.

[31]  J. Struewing,et al.  CDKN2A point mutations D153spl(c.457G>T) and IVS2+1G>T result in aberrant splice products affecting both p16INK4a and p14ARF , 2003, Oncogene.

[32]  E. Buratti,et al.  Exon and intron definition in pre‐mRNA splicing , 2013, Wiley interdisciplinary reviews. RNA.

[33]  K. Imaizumi,et al.  Identification of a Cis-acting Element for the Regulation ofSMN Exon 7 Splicing* , 2002, The Journal of Biological Chemistry.

[34]  Kate B. Cook,et al.  RBPDB: a database of RNA-binding specificities , 2010, Nucleic Acids Res..

[35]  M. Gabut,et al.  The SR Protein SC35 Is Responsible for Aberrant Splicing of the E1α Pyruvate Dehydrogenase mRNA in a Case of Mental Retardation with Lactic Acidosis , 2005, Molecular and Cellular Biology.

[36]  S. Gabriel,et al.  Whole exome sequencing identifies a splicing mutation in NSUN2 as a cause of a Dubowitz-like syndrome , 2012, Journal of Medical Genetics.

[37]  E. Lander,et al.  Exome sequencing identifies GATA1 mutations resulting in Diamond-Blackfan anemia. , 2012, The Journal of clinical investigation.

[38]  F. Pagani,et al.  A High Proportion of DNA Variants of BRCA1 and BRCA2 Is Associated with Aberrant Splicing in Breast/Ovarian Cancer Patients , 2010, Clinical Cancer Research.

[39]  N. Anagnou,et al.  Beta-thalassemia resulting from a single nucleotide substitution in an acceptor splice site. , 1985, Nucleic acids research.

[40]  A. Di Rienzo,et al.  Characterization of a novel splicing variant in the RAPTOR gene. , 2009, Mutation research.

[41]  Peter K. Rogan,et al.  Ab initio exon definition using an information theory-based approach , 2009, 2009 43rd Annual Conference on Information Sciences and Systems.

[42]  A. Krainer,et al.  Disruption of an SF2/ASF-dependent exonic splicing enhancer in SMN2 causes spinal muscular atrophy in the absence of SMN1 , 2002, Nature Genetics.

[43]  Xiaowei Chen,et al.  Intronic alterations in BRCA1 and BRCA2: effect on mRNA splicing fidelity and expression , 2006, Human mutation.

[44]  D. Hwang,et al.  U1 small nuclear RNA-promoted exon selection requires a minimal distance between the position of U1 binding and the 3' splice site across the exon , 1997, Molecular and cellular biology.

[45]  O. Díez,et al.  The variants BRCA1 IVS6-1G>A and BRCA2 IVS15+1G>A lead to aberrant splicing of the transcripts , 2009, Breast Cancer Research and Treatment.

[46]  Gil Ast,et al.  Overlapping splicing regulatory motifs—combinatorial effects on splicing , 2010, Nucleic acids research.

[47]  J. Vandesompele,et al.  Pathological splice mutations outside the invariant AG/GT splice sites of BRCA1 exon 5 increase alternative transcript levels in the 5′ end of the BRCA1 gene , 2002, Oncogene.

[48]  H. Butzkueven,et al.  Common variation in the MOG gene influences transcript splicing in humans , 2010, Journal of Neuroimmunology.

[49]  L. Kalaydjieva,et al.  Aberrant splicing of phenylalanine hydroxylase mRNA: the major cause for phenylketonuria in parts of southern Europe. , 1991, Genomics.

[50]  B. Grandchamp,et al.  A 5′ splice region G → C mutation in exon 3 of the human β‐spectrin gene leads to decreased levels of β‐spectrin mRNA and is responsible for dominant hereditary spherocytosis (spectrin Guemene‐Penfao) , 1998, British journal of haematology.

[51]  Thomas M. Cover,et al.  Elements of Information Theory , 2005 .

[52]  K. Weinberg,et al.  Novel splicing, missense, and deletion mutations in seven adenosine deaminase-deficient patients with late/delayed onset of combined immunodeficiency disease. Contribution of genotype to phenotype. , 1993, The Journal of clinical investigation.

[53]  B. Asselain,et al.  Impact of BRCA1 and BRCA2 variants on splicing: clues from an allelic imbalance study , 2009, European Journal of Human Genetics.

[54]  M. Ugarte,et al.  Qualitative and quantitative analysis of the effect of splicing mutations in propionic acidemia underlying non-severe phenotypes , 2004, Human Genetics.

[55]  R. E. Tully,et al.  Locus Reference Genomic sequences: an improved basis for describing human DNA variants , 2010, Genome Medicine.

[56]  Petr Divina,et al.  Ab initio prediction of mutation-induced cryptic splice-site activation and exon skipping , 2009, European Journal of Human Genetics.

[57]  T. D. Schneider,et al.  Anatomy of Escherichia coli ribosome binding sites. , 2001, Journal of molecular biology.

[58]  Michael Q. Zhang,et al.  An increased specificity score matrix for the prediction of SF2/ASF-specific exonic splicing enhancers. , 2006, Human molecular genetics.

[59]  Peter Johnson,et al.  Prediction of single‐nucleotide substitutions that result in exon skipping: identification of a splicing silencer in BRCA1 exon 6 , 2011, Human mutation.

[60]  C. P. Morris,et al.  Iduronate-2-sulfatase gene mutations in 16 patients with mucopolysaccharidosis type II (Hunter syndrome). , 1993, Human molecular genetics.

[61]  Thangavel Alphonse Thanaraj,et al.  ASD: a bioinformatics resource on alternative splicing , 2005, Nucleic Acids Res..

[62]  Ludwine Messiaen,et al.  Differentiating pathogenic mutations from polymorphic alterations in the splice sites of BRCA1 and BRCA2 , 2003, Genes, chromosomes & cancer.

[63]  G. Holder,et al.  ADVIRC is caused by distinct mutations in BEST1 that alter pre-mRNA splicing , 2008, Journal of Medical Genetics.

[64]  S. Berget,et al.  Exon definition may facilitate splice site selection in RNAs with multiple exons. , 1990, Molecular and cellular biology.

[65]  F. Natacci,et al.  NF1 exon 7 skipping and sequence alterations in exonic splice enhancers (ESEs) in a neurofibromatosis 1 patient , 2003, Human Genetics.

[66]  Zhujun Zhang,et al.  Splicing analysis disclosed a determinant single nucleotide for exon skipping caused by a novel intraexonic four-nucleotide deletion in the dystrophin gene , 2006, Journal of Medical Genetics.

[67]  Peter K Rogan,et al.  Information theory-based analysis of CYP2C19, CYP2D6 and CYP3A5 splicing mutations. , 2003, Pharmacogenetics.