Molecular recognition features (MoRFs) in three domains of life.

Intrinsically disordered proteins and protein regions offer numerous advantages in the context of protein-protein interactions when compared to the structured proteins and domains. These advantages include ability to interact with multiple partners, to fold into different conformations when bound to different partners, and to undergo disorder-to-order transitions concomitant with their functional activity. Molecular recognition features (MoRFs) are widespread elements located in disordered regions that undergo disorder-to-order transition upon binding to their protein partners. We characterize abundance, composition, and functions of MoRFs and their association with the disordered regions across 868 species spread across Eukaryota, Bacteria and Archaea. We found that although disorder is substantially elevated in Eukaryota, MoRFs have similar abundance and amino acid composition across the three domains of life. The abundance of MoRFs is highly correlated with the amount of intrinsic disorder in Bacteria and Archaea but only modestly correlated in Eukaryota. Proteins with MoRFs have significantly more disorder and MoRFs are present in many disordered regions, with Eukaryota having more MoRF-free disordered regions. MoRF-containing proteins are enriched in the ribosome, nucleus, nucleolus and microtubule and are involved in translation, protein transport, protein folding, and interactions with DNAs. Our insights into the nature and function of MoRFs enhance our understanding of the mechanisms underlying the disorder-to-order transition and protein-protein recognition and interactions. The fMoRFpred method that we used to annotate MoRFs is available at http://biomine.ece.ualberta.ca/fMoRFpred/.

[1]  Vladimir N. Uversky,et al.  The Mysterious Unfoldome: Structureless, Underappreciated, Yet Vital Part of Any Given Proteome , 2009, Journal of biomedicine & biotechnology.

[2]  B. Pontius Close encounters: why unstructured, polymeric domains can increase rates of specific macromolecular association. , 1993, Trends in biochemical sciences.

[3]  Christopher J. Oldfield,et al.  Exploring the binding diversity of intrinsically disordered proteins involved in one‐to‐many binding , 2013, Protein science : a publication of the Protein Society.

[4]  Carol V Robinson,et al.  Studies of the RNA degradosome-organizing domain of the Escherichia coli ribonuclease RNase E. , 2004, Journal of molecular biology.

[5]  Robert H. Oakley,et al.  Cellular Processing of the Glucocorticoid Receptor Gene and Protein: New Mechanisms for Generating Tissue-specific Actions of Glucocorticoids* , 2010, The Journal of Biological Chemistry.

[6]  Lukasz Kurgan,et al.  Exceptionally abundant exceptions: comprehensive characterization of intrinsic disorder in all domains of life , 2014, Cellular and Molecular Life Sciences.

[7]  Marc S. Cortese,et al.  Rational drug design via intrinsically disordered protein. , 2006, Trends in biotechnology.

[8]  Richard J. Edwards,et al.  SLiMFinder: A Probabilistic Method for Identifying Over-Represented, Convergently Evolved, Short Linear Motifs in Proteins , 2007, PloS one.

[9]  A. Keith Dunker,et al.  Mining α-Helix-Forming Molecular Recognition Features with Cross Species Sequence Alignments† , 2007 .

[10]  L. Iakoucheva,et al.  Intrinsic Disorder and Protein Function , 2002 .

[11]  W. Marsden I and J , 2012 .

[12]  A Keith Dunker,et al.  The alphabet of intrinsic disorder , 2013, Intrinsically disordered proteins.

[13]  A G Cochran,et al.  Antagonists of protein-protein interactions. , 2000, Chemistry & biology.

[14]  Aaas News,et al.  Book Reviews , 1893, Buffalo Medical and Surgical Journal.

[15]  Lukasz Kurgan,et al.  Untapped Potential of Disordered Proteins in Current Druggable Human Proteome. , 2016, Current drug targets.

[16]  Kengo Kinoshita,et al.  Domain distribution and intrinsic disorder in hubs in the human protein–protein interaction network , 2010, Protein science : a publication of the Protein Society.

[17]  김삼묘,et al.  “Bioinformatics” 특집을 내면서 , 2000 .

[18]  Adam Godzik,et al.  Between order and disorder in protein structures: analysis of "dual personality" fragments in proteins. , 2007, Structure.

[19]  Avner Schlessinger,et al.  Large-scale analysis of thermostable, mammalian proteins provides insights into the intrinsically disordered proteome. , 2009, Journal of proteome research.

[20]  Lukasz A. Kurgan,et al.  MoRFpred, a computational tool for sequence-based prediction and characterization of short disorder-to-order transitioning binding regions in proteins , 2012, Bioinform..

[21]  Zoran Obradovic,et al.  The protein trinity—linking function and disorder , 2001, Nature Biotechnology.

[22]  P. Tompa Intrinsically unstructured proteins. , 2002, Trends in biochemical sciences.

[23]  Vladimir N Uversky,et al.  Intrinsically disordered proteins and novel strategies for drug discovery , 2012, Expert opinion on drug discovery.

[24]  P. Tompa,et al.  Prevalent structural disorder in E. coli and S. cerevisiae proteomes. , 2006, Journal of proteome research.

[25]  P. Tompa The interplay between structure and function in intrinsically unstructured proteins , 2005, FEBS letters.

[26]  Predrag Radivojac,et al.  The structural and functional signatures of proteins that undergo multiple events of post‐translational modification , 2014, Protein science : a publication of the Protein Society.

[27]  V. Uversky Multitude of binding modes attainable by intrinsically disordered proteins: a portrait gallery of disorder-based complexes. , 2011, Chemical Society reviews.

[28]  A. Valencia,et al.  Prediction of protein--protein interaction sites in heterocomplexes with neural networks. , 2002, European journal of biochemistry.

[29]  A Keith Dunker,et al.  Drugs for 'protein clouds': targeting intrinsically disordered transcription factors. , 2010, Current opinion in pharmacology.

[30]  R. Stephenson A and V , 1962, The British journal of ophthalmology.

[31]  Lukasz Kurgan,et al.  NOT THAT RIGID MIDGETS AND NOT SO FLEXIBLE GIANTS: ON THE ABUNDANCE AND ROLES OF INTRINSIC DISORDER IN SHORT AND LONG PROTEINS , 2012 .

[32]  S. Sharma,et al.  Protein-protein interactions: lessons learned. , 2002, Current medicinal chemistry. Anti-cancer agents.

[33]  Christopher J. Oldfield,et al.  Functional anthology of intrinsic disorder. 1. Biological processes and functions of proteins with long disordered regions. , 2007, Journal of proteome research.

[34]  Lukasz Kurgan,et al.  RAPID: fast and accurate sequence-based prediction of intrinsic disorder content on proteomic scale. , 2013, Biochimica et biophysica acta.

[35]  Michael B Yaffe,et al.  Computational prediction of protein-protein interactions. , 2015, Methods in molecular biology.

[36]  V. Uversky Natively unfolded proteins: A point where biology waits for physics , 2002, Protein science : a publication of the Protein Society.

[37]  J. Botto,et al.  The plant cell , 2007, Plant Molecular Biology Reporter.

[38]  P. Y. Chou,et al.  Empirical predictions of protein conformation. , 1978, Annual review of biochemistry.

[39]  A. Barabasi,et al.  Lethality and centrality in protein networks , 2001, Nature.

[40]  Xinchen Wang,et al.  Tissue-specific alternative splicing remodels protein-protein interaction networks. , 2012, Molecular cell.

[41]  A Keith Dunker,et al.  Characterization of molecular recognition features, MoRFs, and their binding partners. , 2007, Journal of proteome research.

[42]  A Keith Dunker,et al.  Alternative splicing in concert with protein intrinsic disorder enables increased functional diversity in multicellular organisms. , 2006, Proceedings of the National Academy of Sciences of the United States of America.

[43]  J. S. Sodhi,et al.  Prediction and functional analysis of native disorder in proteins from the three kingdoms of life. , 2004, Journal of molecular biology.

[44]  A. Dunker,et al.  Orderly order in protein intrinsic disorder distribution: disorder in 3500 proteomes from viruses and the three domains of life , 2012, Journal of biomolecular structure & dynamics.

[45]  A Keith Dunker,et al.  Intrinsic disorder in scaffold proteins: getting more from less. , 2008, Progress in biophysics and molecular biology.

[46]  H. Dyson,et al.  Coupling of folding and binding for unstructured proteins. , 2002, Current opinion in structural biology.

[47]  Bin Xue,et al.  Archaic chaos: intrinsically disordered proteins in Archaea , 2010, BMC Systems Biology.

[48]  Silvio C. E. Tosatto,et al.  ESpritz: accurate and fast prediction of protein disorder , 2012, Bioinform..

[49]  Christopher J. Oldfield,et al.  Intrinsic disorder and functional proteomics. , 2007, Biophysical journal.

[50]  Luca Gentilucci,et al.  Current Medicinal Chemistry , 2010 .

[51]  Marc S. Cortese,et al.  Coupled folding and binding with α-helix-forming molecular recognition elements , 2005 .

[52]  R. Lemieux,et al.  How Emil Fischer was led to the lock and key concept for enzyme specificity. , 1994, Advances in carbohydrate chemistry and biochemistry.

[53]  István Simon,et al.  Disordered Binding Regions and Linear Motifs—Bridging the Gap between Two Models of Molecular Recognition , 2012, PloS one.

[54]  Sonia Longhi,et al.  The C-terminal domain of measles virus nucleoprotein belongs to the class of intrinsically disordered proteins that fold upon binding to their physiological partner. , 2004, Virus research.

[55]  Aidan Budd,et al.  Short linear motifs: ubiquitous and functionally diverse protein interaction modules directing cell regulation. , 2014, Chemical reviews.

[56]  A. Barabasi,et al.  Network biology: understanding the cell's functional organization , 2004, Nature Reviews Genetics.

[57]  F. Cohen,et al.  An evolutionary trace method defines binding surfaces common to protein families. , 1996, Journal of molecular biology.

[58]  C. Deber,et al.  Alpha-helical, but not beta-sheet, propensity of proline is determined by peptide environment. , 1996, Proceedings of the National Academy of Sciences of the United States of America.

[59]  S. Metallo,et al.  Intrinsically disordered proteins are potential drug targets. , 2010, Current opinion in chemical biology.

[60]  M. Vihinen,et al.  Accuracy of protein flexibility predictions , 1994, Proteins.

[61]  D T Jones,et al.  Protein secondary structure prediction based on position-specific scoring matrices. , 1999, Journal of molecular biology.

[62]  Zsuzsanna Dosztányi,et al.  ANCHOR: web server for predicting protein binding regions in disordered proteins , 2009, Bioinform..

[63]  Lance Wells,et al.  Multiple Tissue-specific Roles for the O-GlcNAc Post-translational Modification in the Induction of and Complications Arising from Type II Diabetes* , 2014, The Journal of Biological Chemistry.

[64]  M. Bolognesi,et al.  Function and Structure of Inherently Disordered Proteins This Review Comes from a Themed Issue on Proteins Edited Prediction of Non-folding Proteins and Regions Frequency of Disordered Regions Protein Evolution Partitioning Unstructured Proteins and Regions into Groups Involvement of Inherently Diso , 2022 .

[65]  Z. Obradovic,et al.  Identification and functions of usefully disordered proteins. , 2002, Advances in protein chemistry.

[66]  E. Barbar,et al.  Polybivalency and disordered proteins in ordering macromolecular assemblies. , 2015, Seminars in cell & developmental biology.

[67]  V. Uversky,et al.  Why are “natively unfolded” proteins unstructured under physiologic conditions? , 2000, Proteins.

[68]  G. G. Stokes "J." , 1890, The New Yale Book of Quotations.

[69]  A Keith Dunker,et al.  Alternative splicing of intrinsically disordered regions and rewiring of protein interactions. , 2013, Current opinion in structural biology.

[70]  A. J. Clifford,et al.  BIOCHIMICA ET BIOPHYSICA ACTA , 2022 .

[71]  Jörg Gsponer,et al.  Computational identification of MoRFs in protein sequences , 2015, Bioinform..

[72]  Georgios N Tsaousis,et al.  Analysis of Molecular Recognition Features (MoRFs) in membrane proteins. , 2013, Biochimica et biophysica acta.

[73]  Monika Fuxreiter,et al.  Close encounters of the third kind: disordered domains and the interactions of proteins , 2009, BioEssays : news and reviews in molecular, cellular and developmental biology.

[74]  Jeff Hasty,et al.  Protein interactions: Unspinning the web , 2001, Nature.

[75]  Christopher J. Oldfield,et al.  The unfoldomics decade: an update on intrinsically disordered proteins , 2008, BMC Genomics.

[76]  P. Tompa,et al.  The pairwise energy content estimated from amino acid composition discriminates between folded and intrinsically unstructured proteins. , 2005, Journal of molecular biology.

[77]  P. Tompa,et al.  Reduction in Structural Disorder and Functional Complexity in the Thermal Adaptation of Prokaryotes , 2010, PloS one.

[78]  Anna Tramontano,et al.  Assessment of protein disorder region predictions in CASP10 , 2014, Proteins.

[79]  U. Samanta,et al.  CH/pi interaction in the packing of the adenine ring in protein structures. , 1995, Journal of molecular biology.

[80]  Patrick T. Dolan,et al.  Intrinsic disorder mediates hepatitis C virus core–host cell protein interactions , 2015, Protein science : a publication of the Protein Society.

[81]  María Martín,et al.  Activities at the Universal Protein Resource (UniProt) , 2013, Nucleic Acids Res..

[82]  M. Jiménez,et al.  Tryptophan residues: Scarce in proteins but strong stabilizers of β‐hairpin peptides , 2010, Biopolymers.

[83]  Yaoqi Zhou,et al.  Intrinsically Semi-disordered State and Its Role in Induced Folding and Protein Aggregation , 2013, Cell Biochemistry and Biophysics.

[84]  L. Iakoucheva,et al.  Intrinsic disorder in cell-signaling and cancer-associated proteins. , 2002, Journal of molecular biology.

[85]  Shu-Lin Wang,et al.  Computational methods for the prediction of protein-protein interactions. , 2010, Protein and peptide letters.

[86]  Lukasz Kurgan,et al.  Disordered Proteinaceous Machines , 2014, Chemical reviews.

[87]  Auinash Kalsotra,et al.  Functional consequences of developmentally regulated alternative splicing , 2011, Nature Reviews Genetics.

[88]  H. Dyson,et al.  Intrinsically unstructured proteins: re-assessing the protein structure-function paradigm. , 1999, Journal of molecular biology.

[89]  Christopher J. Oldfield,et al.  Functional anthology of intrinsic disorder. 3. Ligands, post-translational modifications, and diseases associated with intrinsically disordered proteins. , 2007, Journal of proteome research.

[90]  A. Keith Dunker,et al.  Intrinsic Disorder in the Protein Data Bank , 2007, Journal of biomolecular structure & dynamics.

[91]  A Keith Dunker,et al.  TOP-IDP-scale: a new amino acid scale measuring propensity for intrinsic disorder. , 2008, Protein and peptide letters.

[92]  A. Valencia,et al.  Computational methods for the prediction of protein interactions. , 2002, Current opinion in structural biology.

[93]  A. Elofsson,et al.  What properties characterize the hub proteins of the protein-protein interaction network of Saccharomyces cerevisiae? , 2006, Genome Biology.

[94]  Christopher J. Oldfield,et al.  Showing your ID: intrinsic disorder as an ID for recognition, regulation and cell signaling , 2005, Journal of molecular recognition : JMR.

[95]  Silvio C. E. Tosatto,et al.  Comprehensive large-scale assessment of intrinsic protein disorder , 2015, Bioinform..

[96]  David T. Jones,et al.  DISOPRED3: precise disordered region predictions with annotated protein-binding activity , 2014, Bioinform..

[97]  Ren Sun,et al.  Identification and comparative analysis of hepatitis C virus-host cell protein interactions. , 2013, Molecular bioSystems.

[98]  A. Dunker,et al.  Understanding protein non-folding. , 2010, Biochimica et biophysica acta.

[99]  Kevin W. Plaxco,et al.  The importance of being unfolded , 1997, Nature.

[100]  P. Romero,et al.  Sequence complexity of disordered protein , 2001, Proteins.

[101]  Christine A. Orengo,et al.  Inferring Function Using Patterns of Native Disorder in Proteins , 2007, PLoS Comput. Biol..

[102]  Weiru Wang,et al.  Targeting protein-protein interaction by small molecules. , 2014, Annual review of pharmacology and toxicology.

[103]  Marc S. Cortese,et al.  Flexible nets , 2005, The FEBS journal.

[104]  Xiuzhen Zhang,et al.  Abundance of intrinsically unstructured proteins in P. falciparum and other apicomplexan parasite proteomes. , 2006, Molecular and biochemical parasitology.

[105]  Christopher J. Oldfield,et al.  Intrinsically disordered proteins and multicellular organisms. , 2015, Seminars in cell & developmental biology.

[106]  Philip M. Kim,et al.  The role of disorder in interaction networks: a structural analysis , 2008, Molecular systems biology.

[107]  T. Allers Overexpression and purification of halophilic proteins in Haloferax volcanii , 2010, Bioengineered bugs.

[108]  Ignacio E. Sánchez,et al.  The eukaryotic linear motif resource ELM: 10 years and counting , 2013, Nucleic Acids Res..

[109]  Vladimir N Uversky,et al.  What does it mean to be natively unfolded? , 2002, European journal of biochemistry.

[110]  Christopher J. Oldfield,et al.  Flexible nets: disorder and induced fit in the associations of p53 and 14-3-3 with their partners , 2008, BMC Genomics.

[111]  E. Cino,et al.  Binding of disordered proteins to a protein hub , 2013, Scientific Reports.

[112]  Michael B. Yaffe,et al.  Scansite 2.0: proteome-wide prediction of cell signaling interactions using short sequence motifs , 2003, Nucleic Acids Res..

[113]  A Keith Dunker,et al.  Functional anthology of intrinsic disorder. 2. Cellular components, domains, technical terms, developmental processes, and coding sequence diversities correlated with long disordered regions. , 2007, Journal of proteome research.

[114]  F. Young Biochemistry , 1955, The Indian Medical Gazette.

[115]  Zsuzsanna Dosztányi,et al.  Prediction of Protein Binding Regions in Disordered Proteins , 2009, PLoS Comput. Biol..

[116]  Lukasz Kurgan,et al.  The intrinsic disorder status of the human hepatitis C virus proteome. , 2014, Molecular bioSystems.

[117]  E. Fischer Einfluss der Configuration auf die Wirkung der Enzyme , 1894 .

[118]  Kengo Kinoshita,et al.  Identification of transient hub proteins and the possible structural basis for their multiple interactions , 2008, Protein science : a publication of the Protein Society.

[119]  Marc S. Cortese,et al.  Analysis of molecular recognition features (MoRFs). , 2006, Journal of molecular biology.

[120]  David C Fry,et al.  Small-molecule inhibitors of protein-protein interactions: how to mimic a protein partner. , 2012, Current pharmaceutical design.

[121]  Lukasz Kurgan,et al.  A creature with a hundred waggly tails: intrinsically disordered proteins in the ribosome , 2013, Cellular and Molecular Life Sciences.

[122]  István Simon,et al.  BIOINFORMATICS ORIGINAL PAPER doi:10.1093/bioinformatics/btm035 Structural bioinformatics Local structural disorder imparts plasticity on linear motifs , 2022 .

[123]  Jeffrey L. Wrana,et al.  An Alternative Splicing Switch Regulates Embryonic Stem Cell Pluripotency and Reprogramming , 2011, Cell.

[124]  Kevin Barraclough,et al.  I and i , 2001, BMJ : British Medical Journal.

[125]  G. Hong,et al.  Nucleic Acids Research , 2015, Nucleic Acids Research.

[126]  H. Dyson,et al.  Intrinsically unstructured proteins and their functions , 2005, Nature Reviews Molecular Cell Biology.

[127]  Maurizio Recanatini,et al.  Structure-based design of small-molecule protein – protein interaction modulators : the story , 2014 .