Intrinsic Disorder in the Human Spliceosomal Proteome

The spliceosome is a molecular machine that performs the excision of introns from eukaryotic pre-mRNAs. This macromolecular complex comprises in human cells five RNAs and over one hundred proteins. In recent years, many spliceosomal proteins have been found to exhibit intrinsic disorder, that is to lack stable native three-dimensional structure in solution. Building on the previous body of proteomic, structural and functional data, we have carried out a systematic bioinformatics analysis of intrinsic disorder in the proteome of the human spliceosome. We discovered that almost a half of the combined sequence of proteins abundant in the spliceosome is predicted to be intrinsically disordered, at least when the individual proteins are considered in isolation. The distribution of intrinsic order and disorder throughout the spliceosome is uneven, and is related to the various functions performed by the intrinsic disorder of the spliceosomal proteins in the complex. In particular, proteins involved in the secondary functions of the spliceosome, such as mRNA recognition, intron/exon definition and spliceosomal assembly and dynamics, are more disordered than proteins directly involved in assisting splicing catalysis. Conserved disordered regions in spliceosomal proteins are evolutionarily younger and less widespread than ordered domains of essential spliceosomal proteins at the core of the spliceosome, suggesting that disordered regions were added to a preexistent ordered functional core. Finally, the spliceosomal proteome contains a much higher amount of intrinsic disorder predicted to lack secondary structure than the proteome of the ribosome, another large RNP machine. This result agrees with the currently recognized different functions of proteins in these two complexes.

[1]  C. Vonrhein,et al.  Structure of the 30S ribosomal subunit , 2000, Nature.

[2]  K. Nagai,et al.  Structure of the spliceosomal U4 snRNP core domain and its implication for snRNP biogenesis , 2011, Nature.

[3]  Christopher A Jackson,et al.  Protein Arginine Methylation Facilitates Cotranscriptional Recruitment of Pre-mRNA Splicing Factors , 2010, Molecular and Cellular Biology.

[4]  M. Blackledge,et al.  Structural characterization of flexible proteins using small-angle X-ray scattering. , 2007, Journal of the American Chemical Society.

[5]  J. L. King,et al.  Non-Darwinian evolution. , 1969, Science.

[6]  Melissa S Jurica,et al.  Pre-mRNA splicing: awash in a sea of proteins. , 2003, Molecular cell.

[7]  Lilia M. Iakoucheva,et al.  Serine/arginine-rich splicing factors belong to a class of intrinsically disordered proteins , 2006, Nucleic acids research.

[8]  A Keith Dunker,et al.  Another window into disordered protein function. , 2007, Structure.

[9]  Alexander Varshavsky,et al.  N-Terminal Acetylation of Cellular Proteins Creates Specific Degradation Signals , 2010, Science.

[10]  E. Sontheimer,et al.  A role for ubiquitin in the spliceosome assembly pathway , 2008, Nature Structural &Molecular Biology.

[11]  T. N. Bhat,et al.  The Protein Data Bank , 2000, Nucleic Acids Res..

[12]  H. Urlaub,et al.  Isolation of an active step I spliceosome and composition of its RNP core , 2008, Nature.

[13]  Andrew M. MacMillan,et al.  Crystal structure of a core spliceosomal protein interface , 2006, Proceedings of the National Academy of Sciences of the United States of America.

[14]  Mark T Bedford,et al.  Arginine methylation an emerging regulator of protein function. , 2005, Molecular cell.

[15]  Michael Sattler,et al.  U2AF-homology motif interactions are required for alternative splicing regulation by SPF45 , 2007, Nature Structural &Molecular Biology.

[16]  B. Séraphin,et al.  Proteomic analysis identifies a new complex required for nuclear pre‐mRNA retention and splicing , 2004, The EMBO journal.

[17]  Henning Urlaub,et al.  Characterization of purified human Bact spliceosomal complexes reveals compositional and morphological changes during spliceosome activation and first step catalysis. , 2010, RNA.

[18]  Z. Gryczynski,et al.  Multiple U2AF65 binding sites within SF3b155: thermodynamic and spectroscopic characterization of protein-protein interactions among pre-mRNA splicing factors. , 2006, Journal of molecular biology.

[19]  Steven P. Gygi,et al.  Comprehensive proteomic analysis of the human spliceosome , 2002, Nature.

[20]  R. Lührmann,et al.  Proline-rich Sequence Recognition , 2009, Molecular & Cellular Proteomics.

[21]  Henning Urlaub,et al.  The evolutionarily conserved core design of the catalytic activation step of the yeast spliceosome. , 2009, Molecular cell.

[22]  A. Steven,et al.  Glycine loops in proteins: their occurrence in certain intermediate filament chains, loricrins and single-stranded RNA binding proteins. , 1991, International journal of biological macromolecules.

[23]  T. Härd,et al.  Solution structure of the ribosomal protein S19 from Thermus thermophilus. , 1999, Journal of molecular biology.

[24]  M. Yusupov,et al.  Crystal Structure of the Eukaryotic Ribosome , 2010, Science.

[25]  SödingJohannes Protein homology detection by HMM--HMM comparison , 2005 .

[26]  C. Brown,et al.  Intrinsic protein disorder in complete genomes. , 2000, Genome informatics. Workshop on Genome Informatics.

[27]  Michele Magrane,et al.  UniProt Knowledgebase: a hub of integrated protein data , 2011, Database J. Biol. Databases Curation.

[28]  Henning Urlaub,et al.  Semiquantitative Proteomic Analysis of the Human Spliceosome via a Novel Two-Dimensional Gel Electrophoresis Method , 2011, Molecular and Cellular Biology.

[29]  Peter Tompa,et al.  Structure and Function of Intrinsically Disordered Proteins , 2009 .

[30]  Janusz M. Bujnicki,et al.  GeneSilico protein structure prediction meta-server , 2003, Nucleic Acids Res..

[31]  Poul Nissen,et al.  The social life of ribosomal proteins , 2005, The FEBS journal.

[32]  J. Manley,et al.  Phosphorylation of the ASF/SF2 RS domain affects both protein-protein and protein-RNA interactions and is necessary for splicing. , 1997, Genes & development.

[33]  Leszek Rychlewski,et al.  ELM server: a new resource for investigating short functional sites in modular eukaryotic proteins , 2003, Nucleic Acids Res..

[34]  R. Lührmann,et al.  Crystal structure of a complex between human spliceosomal cyclophilin H and a U4/U6 snRNP-60K peptide. , 2003, Journal of molecular biology.

[35]  K. Nagai,et al.  Crystal Structures of Two Sm Protein Complexes and Their Implications for the Assembly of the Spliceosomal snRNPs , 1999, Cell.

[36]  C. Will,et al.  Splicing of a rare class of introns by the U12-dependent spliceosome , 2005, Biological chemistry.

[37]  Peter Tompa,et al.  Structural disorder promotes assembly of protein complexes , 2007, BMC Structural Biology.

[38]  Akihiro Nakao,et al.  RPG: the Ribosomal Protein Gene database , 2004, Nucleic Acids Res..

[39]  S. Sugano,et al.  Solution structures of the SURP domains and the subunit-assembly mechanism within the splicing factor SF3a complex in 17S U2 snRNP. , 2006, Structure.

[40]  Monika Fuxreiter,et al.  Close encounters of the third kind: disordered domains and the interactions of proteins , 2009, BioEssays : news and reviews in molecular, cellular and developmental biology.

[41]  R. Lührmann,et al.  Symmetrical dimethylation of arginine residues in spliceosomal Sm protein B/B' and the Sm-like protein LSm4, and their interaction with the SMN protein. , 2001, RNA.

[42]  L. Holm,et al.  The Pfam protein families database , 2005, Nucleic Acids Res..

[43]  J. Woolford,et al.  Assembly of ribosomes and spliceosomes: complex ribonucleoprotein machines. , 2009, Current opinion in cell biology.

[44]  P. Sharp,et al.  The SRm160/300 splicing coactivator subunits. , 2000, RNA.

[45]  Henning Urlaub,et al.  Protein Composition and Electron Microscopy Structure of Affinity-Purified Human Spliceosomal B Complexes Isolated under Physiological Conditions , 2006, Molecular and Cellular Biology.

[46]  Michael Sattler,et al.  Structural basis for the molecular recognition between human splicing factors U2AF65 and SF1/mBBP. , 2003, Molecular cell.

[47]  T L Blundell,et al.  Properties of polyproline II, a secondary structure element implicated in protein–protein interactions , 2005, Proteins.

[48]  Tracy L. Johnson,et al.  A bird's-eye view of post-translational modifications in the spliceosome and their roles in spliceosome dynamics. , 2010, Molecular bioSystems.

[49]  A. Krainer,et al.  Arginine Methylation Controls the Subcellular Localization and Functions of the Oncoprotein Splicing Factor SF2/ASF , 2010, Molecular and Cellular Biology.

[50]  G. Dreyfuss,et al.  In vivo and in vitro arginine methylation of RNA-binding proteins , 1995, Molecular and cellular biology.

[51]  Christopher J. Oldfield,et al.  Intrinsic disorder and functional proteomics. , 2007, Biophysical journal.

[52]  A. Mushegian,et al.  Prp8, the pivotal protein of the spliceosomal catalytic center, evolved from a retroelement-encoded reverse transcriptase. , 2011, RNA.

[53]  S. Riva,et al.  Interaction of hnRNP A1 with snRNPs and pre-mRNAs: evidence for a possible role of A1 RNA annealing activity in the first steps of spliceosome assembly. , 1992, Nucleic acids research.

[54]  Lesley Collins,et al.  Complex spliceosomal organization ancestral to extant eukaryotes. , 2005, Molecular biology and evolution.

[55]  Zsuzsanna Dosztányi,et al.  ANCHOR: web server for predicting protein binding regions in disordered proteins , 2009, Bioinform..

[56]  S. Riva,et al.  hnRNP A1 selectively interacts through its Gly-rich domain with different RNA-binding proteins. , 1996, Journal of molecular biology.

[57]  Conrad C. Huang,et al.  UCSF Chimera—A visualization system for exploratory research and analysis , 2004, J. Comput. Chem..

[58]  S. Valadkhan,et al.  The spliceosomal proteome: At the heart of the largest cellular ribonucleoprotein machine , 2010, Proteomics.

[59]  J. Ebert,et al.  The Crystal Structure of the Exon Junction Complex Reveals How It Maintains a Stable Grip on mRNA , 2006, Cell.

[60]  Henning Urlaub,et al.  The human 18S U11/U12 snRNP contains a set of novel proteins not found in the U2-dependent spliceosome. , 2004, RNA.

[61]  M. Olive,et al.  hnRNP A1 Recruited to an Exon In Vivo Can Function as an Exon Splicing Silencer , 1999, Molecular and Cellular Biology.

[62]  K. F. Dyer,et al.  The Quiet Revolution: A New Synthesis of Biological Knowledge. , 1971 .

[63]  Henning Urlaub,et al.  Composition and three‐dimensional EM structure of double affinity‐purified, human prespliceosomal A complexes , 2007, The EMBO journal.

[64]  H. Stark,et al.  Cryo-electron microscopy of spliceosomal components. , 2006, Annual review of biophysics and biomolecular structure.

[65]  Philip E. Bourne,et al.  Sm/Lsm Genes Provide a Glimpse into the Early Evolution of the Spliceosome , 2009, PLoS Comput. Biol..

[66]  Janusz M. Bujnicki,et al.  Structural bioinformatics of the human spliceosomal proteome , 2012, Nucleic acids research.

[67]  T. Steitz,et al.  The complete atomic structure of the large ribosomal subunit at 2.4 A resolution. , 2000, Science.

[68]  D. Svergun,et al.  Structural analysis of intrinsically disordered proteins by small-angle X-ray scattering. , 2012, Molecular bioSystems.

[69]  J. S. Sodhi,et al.  Prediction and functional analysis of native disorder in proteins from the three kingdoms of life. , 2004, Journal of molecular biology.

[70]  Janusz M. Bujnicki,et al.  MetaDisorder: a meta-server for the prediction of intrinsic disorder in proteins , 2012, BMC Bioinformatics.

[71]  S. Valadkhan The spliceosome: caught in a web of shifting interactions. , 2007, Current opinion in structural biology.

[72]  J. Cáceres,et al.  The SR protein family of splicing factors: master regulators of gene expression. , 2009, The Biochemical journal.

[73]  C. Oubridge,et al.  Crystal structure of human spliceosomal U1 snRNP at 5.5 Å resolution , 2009, Nature.

[74]  John A. Calarco,et al.  Regulation of Vertebrate Nervous System Alternative Splicing and Development by an SR-Related Protein , 2009, Cell.

[75]  J. McPherson,et al.  In Search of a Function for BCLAF1 , 2010, TheScientificWorldJournal.

[76]  H. Urlaub,et al.  Phosphorylation of human PRP28 by SRPK2 is required for integration of the U4/U6-U5 tri-snRNP into the spliceosome , 2008, Nature Structural &Molecular Biology.

[77]  Woan-Yuh Tarn,et al.  A Novel Spliceosome Containing U11, U12, and U5 snRNPs Excises a Minor Class (AT–AC) Intron In Vitro , 1996, Cell.

[78]  Johannes Söding,et al.  Protein homology detection by HMM?CHMM comparison , 2005, Bioinform..

[79]  Henning Urlaub,et al.  Small Nuclear Ribonucleoprotein Remodeling During Catalytic Activation of the Spliceosome , 2002, Science.

[80]  Michael R. Green,et al.  A Novel Peptide Recognition Mode Revealed by the X-Ray Structure of a Core U2AF35/U2AF65 Heterodimer , 2001, Cell.

[81]  M. Akke,et al.  Conformation and dynamics of ribosomal stalk protein L12 in solution and on the ribosome. , 2004, Biochemistry.

[82]  A Keith Dunker,et al.  Characterization of molecular recognition features, MoRFs, and their binding partners. , 2007, Journal of proteome research.

[83]  M. Garcia-Blanco,et al.  SR proteins escort the U4/U6.U5 tri-snRNP to the spliceosome. , 1995, RNA.

[84]  G. Dreyfuss,et al.  SMN, the product of the spinal muscular atrophy gene, binds preferentially to dimethylarginine-containing protein targets. , 2001, Molecular cell.

[85]  Adam Godzik,et al.  Between order and disorder in protein structures: analysis of "dual personality" fragments in proteins. , 2007, Structure.

[86]  Christopher J. Oldfield,et al.  Functional anthology of intrinsic disorder. 1. Biological processes and functions of proteins with long disordered regions. , 2007, Journal of proteome research.

[87]  C. Will,et al.  The Spliceosome: Design Principles of a Dynamic RNP Machine , 2009, Cell.

[88]  Adam Godzik,et al.  Strong functional patterns in the evolution of eukaryotic genomes revealed by the reconstruction of ancestral protein domain repertoires , 2011, Genome Biology.

[89]  E. Westhof,et al.  The ribozyme core of group II introns: a structure in want of partners. , 2009, Trends in biochemical sciences.

[90]  Michael R. Green,et al.  U2AF homology motifs: protein recognition in the RRM world. , 2004, Genes & development.

[91]  E. Birney,et al.  Pfam: the protein families database , 2013, Nucleic Acids Res..

[92]  Ross Smith,et al.  Functional diversity of the hnRNPs: past, present and perspectives. , 2010, The Biochemical journal.