Insights into the regulation of intrinsically disordered proteins in the human proteome by analyzing sequence and gene expression data

BackgroundDisordered proteins need to be expressed to carry out specified functions; however, their accumulation in the cell can potentially cause major problems through protein misfolding and aggregation. Gene expression levels, mRNA decay rates, microRNA (miRNA) targeting and ubiquitination have critical roles in the degradation and disposal of human proteins and transcripts. Here, we describe a study examining these features to gain insights into the regulation of disordered proteins.ResultsIn comparison with ordered proteins, disordered proteins have a greater proportion of predicted ubiquitination sites. The transcripts encoding disordered proteins also have higher proportions of predicted miRNA target sites and higher mRNA decay rates, both of which are indicative of the observed lower gene expression levels. The results suggest that the disordered proteins and their transcripts are present in the cell at low levels and/or for a short time before being targeted for disposal. Surprisingly, we find that for a significant proportion of highly disordered proteins, all four of these trends are reversed. Predicted estimates for miRNA targets, ubiquitination and mRNA decay rate are low in the highly disordered proteins that are constitutively and/or highly expressed.ConclusionsMechanisms are in place to protect the cell from these potentially dangerous proteins. The evidence suggests that the enrichment of signals for miRNA targeting and ubiquitination may help prevent the accumulation of disordered proteins in the cell. Our data also provide evidence for a mechanism by which a significant proportion of highly disordered proteins (with high expression levels) can escape rapid degradation to allow them to successfully carry out their function.

[1]  H. Lodish,et al.  Micromanagement of the immune system by microRNAs , 2008, Nature Reviews Immunology.

[2]  C. Croce,et al.  MicroRNA signatures in human cancers , 2006, Nature Reviews Cancer.

[3]  G. Church,et al.  Correlation between transcriptome and interactome mapping data from Saccharomyces cerevisiae , 2001, Nature Genetics.

[4]  Q. Cui,et al.  Principles of microRNA regulation of a human cellular signaling network , 2006, Molecular systems biology.

[5]  D. Rubinsztein,et al.  The roles of intracellular protein-degradation pathways in neurodegeneration , 2006, Nature.

[6]  V. Uversky Alpha-synuclein misfolding and neurodegenerative diseases. , 2008, Current protein & peptide science.

[7]  Bernd Bukau,et al.  The N-end rule pathway for regulated proteolysis: prokaryotic and eukaryotic strategies. , 2007, Trends in cell biology.

[8]  G. Hannon,et al.  A complex system of small RNAs in the unicellular green alga Chlamydomonas reinhardtii. , 2007, Genes & development.

[9]  David T. Jones,et al.  Improving the accuracy of transmembrane protein topology prediction using evolutionary information , 2007, Bioinform..

[10]  A Keith Dunker,et al.  Protein intrinsic disorder and human papillomaviruses: increased amount of disorder in E6 and E7 oncoproteins from high risk HPVs. , 2006, Journal of proteome research.

[11]  Martin Rechsteiner,et al.  Recognition of the polyubiquitin proteolytic signal , 2000, The EMBO journal.

[12]  Zhijin Wu,et al.  Preprocessing of oligonucleotide array data , 2004, Nature Biotechnology.

[13]  I. Mellman,et al.  Coordinated protein sorting, targeting and distribution in polarized cells , 2008, Nature Reviews Molecular Cell Biology.

[14]  L. Iakoucheva,et al.  Intrinsic Disorder and Protein Function , 2002 .

[15]  Kristin C. Gunsalus,et al.  microRNA Target Predictions across Seven Drosophila Species and Comparison to Mammalian Targets , 2005, PLoS Comput. Biol..

[16]  Jean YH Yang,et al.  Bioconductor: open software development for computational biology and bioinformatics , 2004, Genome Biology.

[17]  A Keith Dunker,et al.  Functional anthology of intrinsic disorder. 2. Cellular components, domains, technical terms, developmental processes, and coding sequence diversities correlated with long disordered regions. , 2007, Journal of proteome research.

[18]  Anton J. Enright,et al.  Prediction of microRNA targets. , 2007, Drug discovery today.

[19]  M. Willis,et al.  The ubiquitin-proteasome system in cardiac dysfunction. , 2008, Biochimica et biophysica acta.

[20]  J. S. Sodhi,et al.  Prediction and functional analysis of native disorder in proteins from the three kingdoms of life. , 2004, Journal of molecular biology.

[21]  V. Ambros The functions of animal microRNAs , 2004, Nature.

[22]  Linda Hicke,et al.  Regulation of membrane protein transport by ubiquitin and ubiquitin-binding proteins. , 2003, Annual review of cell and developmental biology.

[23]  M. Gerstein,et al.  Relating whole-genome expression data with protein-protein interactions. , 2002, Genome research.

[24]  J. Dice Chaperone-Mediated Autophagy , 2007 .

[25]  S. Vucetic,et al.  Flavors of protein disorder , 2003, Proteins.

[26]  V. Uversky Intrinsically Disordered Proteins , 2000 .

[27]  K. Gunsalus,et al.  Combinatorial microRNA target predictions , 2005, Nature Genetics.

[28]  Shinn-Ying Ho,et al.  Computational identification of ubiquitylation sites from protein sequences , 2008, BMC Bioinformatics.

[29]  Stijn van Dongen,et al.  miRBase: tools for microRNA genomics , 2007, Nucleic Acids Res..

[30]  Marc S. Cortese,et al.  Comparing and combining predictors of mostly disordered proteins. , 2005, Biochemistry.

[31]  C. Croce,et al.  MicroRNA-cancer connection: the beginning of a new tale. , 2006, Cancer research.

[32]  R. Deshaies,et al.  Diverse roles for ubiquitin-dependent proteolysis in transcriptional activation , 2003, Nature Cell Biology.

[33]  Dennis B. Troup,et al.  NCBI GEO: mining tens of millions of expression profiles—database and tools update , 2006, Nucleic Acids Res..

[34]  Peter Tompa,et al.  Intrinsically Disordered Proteins Display No Preference for Chaperone Binding In Vivo , 2008, PLoS Comput. Biol..

[35]  A. Dunker,et al.  Protein disorder is positively correlated with gene expression in Escherichia coli. , 2008, Journal of proteome research.

[36]  D. Baulcombe,et al.  miRNAs control gene expression in the single-cell alga Chlamydomonas reinhardtii , 2007, Nature.

[37]  M. Magnasco,et al.  Decay rates of human mRNAs: correlation with functional characteristics and sequence attributes. , 2003, Genome research.

[38]  A. Hatzigeorgiou,et al.  A guide through present computational approaches for the identification of mammalian microRNA targets , 2006, Nature Methods.

[39]  O. Hobert Gene Regulation by Transcription Factors and MicroRNAs , 2008, Science.

[40]  P. Tompa Intrinsically unstructured proteins. , 2002, Trends in biochemical sciences.

[41]  Lilia M. Iakoucheva,et al.  Intrinsic Disorder Is a Common Feature of Hub Proteins from Four Eukaryotic Interactomes , 2006, PLoS Comput. Biol..

[42]  Christopher J. Oldfield,et al.  Intrinsically disordered protein. , 2001, Journal of molecular graphics & modelling.

[43]  N. Rajewsky microRNA target predictions in animals , 2006, Nature Genetics.

[44]  C. Patterson,et al.  The Bitter End: The Ubiquitin-Proteasome System and Cardiac Dysfunction , 2007, Circulation.

[45]  S. Teichmann,et al.  Tight Regulation of Unstructured Proteins: From Transcript Synthesis to Protein Degradation , 2008, Science.

[46]  H. Dyson,et al.  Intrinsically unstructured proteins: re-assessing the protein structure-function paradigm. , 1999, Journal of molecular biology.

[47]  B. Ha,et al.  Structures of proteases for ubiqutin and ubiquitin-like modifiers. , 2008, BMB reports.

[48]  Christopher J. Oldfield,et al.  Functional anthology of intrinsic disorder. 3. Ligands, post-translational modifications, and diseases associated with intrinsically disordered proteins. , 2007, Journal of proteome research.

[49]  C. Brown,et al.  Intrinsic protein disorder in complete genomes. , 2000, Genome informatics. Workshop on Genome Informatics.

[50]  Zoran Obradovic,et al.  The protein trinity—linking function and disorder , 2001, Nature Biotechnology.

[51]  P. Lansbury,et al.  NACP, a protein implicated in Alzheimer's disease and learning, is natively unfolded. , 1996, Biochemistry.

[52]  Christopher J. Oldfield,et al.  Showing your ID: intrinsic disorder as an ID for recognition, regulation and cell signaling , 2005, Journal of molecular recognition : JMR.

[53]  A. Dunker,et al.  Abundance of intrinsic disorder in protein associated with cardiovascular disease. , 2006, Biochemistry.

[54]  V. Uversky,et al.  Why are “natively unfolded” proteins unstructured under physiologic conditions? , 2000, Proteins.

[55]  F. Marshall Previously unidentified changes in renal cell carcinoma gene expression identified by parametric analysis of microarray data. , 2005, The Journal of urology.

[56]  V. Uversky Natively unfolded proteins: A point where biology waits for physics , 2002, Protein science : a publication of the Protein Society.

[57]  Vladimir N Uversky,et al.  Amyloidogenesis of natively unfolded proteins. , 2008, Current Alzheimer research.

[58]  A. Goldberg,et al.  Protein degradation and protection against misfolded or damaged proteins , 2003, Nature.

[59]  Qikai Xu,et al.  Global Protein Stability Profiling in Mammalian Cells , 2008, Science.

[60]  P. Tompa,et al.  Fuzzy complexes: polymorphism and structural disorder in protein-protein interactions. , 2008, Trends in biochemical sciences.

[61]  Yi Wen Kong,et al.  How do microRNAs regulate gene expression? , 2008, Biochemical Society transactions.

[62]  P. Romero,et al.  Sequence complexity of disordered protein , 2001, Proteins.

[63]  P. Lansbury,et al.  Amyloid fibrillogenesis: themes and variations. , 2000, Current opinion in structural biology.

[64]  Christopher J. Oldfield,et al.  Intrinsically disordered proteins in human diseases: introducing the D2 concept. , 2008, Annual review of biophysics.

[65]  H. Dyson,et al.  Intrinsically unstructured proteins and their functions , 2005, Nature Reviews Molecular Cell Biology.

[66]  Christine A. Orengo,et al.  Inferring Function Using Patterns of Native Disorder in Proteins , 2007, PLoS Comput. Biol..

[67]  C. Burge,et al.  Conserved Seed Pairing, Often Flanked by Adenosines, Indicates that Thousands of Human Genes are MicroRNA Targets , 2005, Cell.

[68]  A. Caudy,et al.  Regulation of Transcriptional Activation Domain Function by Ubiquitin , 2001, Science.

[69]  Ariel Fernández,et al.  Protein structure protection commits gene expression patterns , 2008, Genome Biology.

[70]  D. Fushman,et al.  Polyubiquitin chains: polymeric protein signals. , 2004, Current opinion in chemical biology.

[71]  E. Scalbert,et al.  Implication of microRNAs in the cardiovascular system. , 2008, Current opinion in pharmacology.

[72]  Amos Bairoch,et al.  Annotation of post‐translational modifications in the Swiss‐Prot knowledge base , 2004, Proteomics.

[73]  P. Tompa The interplay between structure and function in intrinsically unstructured proteins , 2005, FEBS letters.

[74]  S. Cohen,et al.  microRNAs in neurodegeneration , 2008, Current Opinion in Neurobiology.

[75]  Daniela Hoeller,et al.  Ubiquitin and ubiquitin-like proteins in cancer pathogenesis , 2006, Nature Reviews Cancer.

[76]  Wen-Hsiung Li,et al.  MicroRNA regulation of human protein protein interaction network. , 2007, RNA.

[77]  V. Uversky,et al.  Protein folding revisited. A polypeptide chain at the folding – misfolding – nonfolding cross-roads: which way to go? , 2003, Cellular and Molecular Life Sciences CMLS.

[78]  J L Sussman,et al.  Structural disorder serves as a weak signal for intracellular protein degradation , 2008, Proteins.

[79]  S. Batalov,et al.  A gene atlas of the mouse and human protein-encoding transcriptomes. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[80]  A. Dunker,et al.  Controlled Chaos , 2008, Science.

[81]  H. Lodish,et al.  Micromanagement of the immune system by microRNAs , 2008, Nature Reviews Immunology.

[82]  Marc S. Cortese,et al.  Flexible nets , 2005, The FEBS journal.

[83]  S. Elledge,et al.  Identification of SCF Ubiquitin Ligase Substrates by Global Protein Stability Profiling , 2008, Science.

[84]  Barry Robson,et al.  Protein folding revisited. , 2008, Progress in molecular biology and translational science.

[85]  A Keith Dunker,et al.  Intrinsic disorder in scaffold proteins: getting more from less. , 2008, Progress in biophysics and molecular biology.

[86]  Peter Tompa,et al.  Structural disorder promotes assembly of protein complexes , 2007, BMC Structural Biology.

[87]  J. Dice Chaperone-Mediated Autophagy , 2007, Autophagy.

[88]  Ivan Dikic,et al.  Ubiquitylation and cell signaling , 2005, The EMBO journal.

[89]  A. Hatzigeorgiou,et al.  TarBase: A comprehensive database of experimentally supported animal microRNA targets. , 2005, RNA.

[90]  P. Tompa,et al.  Malleable machines take shape in eukaryotic transcriptional regulation. , 2008, Nature chemical biology.

[91]  T JonesDavid,et al.  The DISOPRED server for the prediction of protein disorder , 2004 .

[92]  L. Iakoucheva,et al.  Intrinsic disorder in cell-signaling and cancer-associated proteins. , 2002, Journal of molecular biology.

[93]  Edwin Wang,et al.  MicroRNAs preferentially target the genes with high transcriptional regulation complexity. , 2006, Biochemical and biophysical research communications.

[94]  Bernard F. Buxton,et al.  The DISOPRED server for the prediction of protein disorder , 2004, Bioinform..

[95]  Andreas Prlic,et al.  Ensembl 2008 , 2007, Nucleic Acids Res..

[96]  R. Mayer,et al.  Ubiquitin and ubiquitin-like proteins as multifunctional signals , 2005, Nature Reviews Molecular Cell Biology.

[97]  R. Myers,et al.  Evolving gene/transcript definitions significantly alter the interpretation of GeneChip data , 2005, Nucleic acids research.

[98]  Sue Povey,et al.  The HGNC Database in 2008: a resource for the human genome , 2007, Nucleic Acids Res..

[99]  G. Orphanides,et al.  A Unified Theory of Gene Expression , 2002, Cell.

[100]  J. Astola,et al.  Systematic bioinformatic analysis of expression levels of 17,330 human genes across 9,783 samples from 175 types of healthy and pathological tissues , 2008, Genome Biology.

[101]  Christopher J. Oldfield,et al.  Functional anthology of intrinsic disorder. 1. Biological processes and functions of proteins with long disordered regions. , 2007, Journal of proteome research.

[102]  Michail Yu. Lobanov,et al.  Prediction of Amyloidogenic and Disordered Regions in Protein Chains , 2006, PLoS Comput. Biol..

[103]  E. Birney,et al.  EnsMart: a generic system for fast and flexible access to biological data. , 2003, Genome research.

[104]  Marco Botta,et al.  Microarray data analysis and mining approaches. , 2008, Briefings in functional genomics & proteomics.

[105]  D. Bartel MicroRNAs Genomics, Biogenesis, Mechanism, and Function , 2004, Cell.

[106]  M. Bolognesi,et al.  Function and Structure of Inherently Disordered Proteins This Review Comes from a Themed Issue on Proteins Edited Prediction of Non-folding Proteins and Regions Frequency of Disordered Regions Protein Evolution Partitioning Unstructured Proteins and Regions into Groups Involvement of Inherently Diso , 2022 .

[107]  W R Taylor,et al.  A model recognition approach to the prediction of all-helical membrane protein structure and topology. , 1994, Biochemistry.

[108]  P. Tompa,et al.  Janus chaperones: Assistance of both RNA‐ and protein‐folding by ribosomal proteins , 2009, FEBS letters.

[109]  H. Aburatani,et al.  Interpreting expression profiles of cancers by genome-wide survey of breadth of expression in normal tissues. , 2005, Genomics.

[110]  A. Raghavan,et al.  Microarray-based analyses of mRNA decay in the regulation of mammalian gene expression. , 2004, Briefings in functional genomics & proteomics.

[111]  Howard Riezman,et al.  Proteasome-Independent Functions of Ubiquitin in Endocytosis and Signaling , 2007, Science.

[112]  Lin He,et al.  MicroRNAs: small RNAs with a big role in gene regulation , 2004, Nature reviews genetics.

[113]  M. Muratani,et al.  How the ubiquitin–proteasome system controls transcription , 2003, Nature Reviews Molecular Cell Biology.

[114]  René Bernards,et al.  A Genomic and Functional Inventory of Deubiquitinating Enzymes , 2005, Cell.

[115]  N. Graham,et al.  Areas beneath the relative operating characteristics (ROC) and relative operating levels (ROL) curves: Statistical significance and interpretation , 2002 .