Sequence and Structure Properties Uncover the Natural Classification of Protein Complexes Formed by Intrinsically Disordered Proteins via Mutual Synergistic Folding

Intrinsically disordered proteins mediate crucial biological functions through their interactions with other proteins. Mutual synergistic folding (MSF) occurs when all interacting proteins are disordered, folding into a stable structure in the course of the complex formation. In these cases, the folding and binding processes occur in parallel, lending the resulting structures uniquely heterogeneous features. Currently there are no dedicated classification approaches that take into account the particular biological and biophysical properties of MSF complexes. Here, we present a scalable clustering-based classification scheme, built on redundancy-filtered features that describe the sequence and structure properties of the complexes and the role of the interaction, which is directly responsible for structure formation. Using this approach, we define six major types of MSF complexes, corresponding to biologically meaningful groups. Hence, the presented method also shows that differences in binding strength, subcellular localization, and regulation are encoded in the sequence and structural properties of proteins. While current protein structure classification methods can also handle complex structures, we show that the developed scheme is fundamentally different, and since it takes into account defining features of MSF complexes, it serves as a better representation of structures arising through this specific interaction mode.

[1]  R. Kriwacki,et al.  Regulation of cell division by intrinsically unstructured proteins: intrinsic flexibility, modularity, and signaling conduits. , 2008, Biochemistry.

[2]  H. Dyson,et al.  Intrinsically disordered proteins in cellular signalling and regulation , 2014, Nature Reviews Molecular Cell Biology.

[3]  Motonori Ota,et al.  IDEAL: Intrinsically Disordered proteins with Extensive Annotations and Literature , 2011, Nucleic Acids Res..

[4]  James C. Hu,et al.  The Gene Ontology Resource: 20 years and still GOing strong , 2019 .

[5]  Toby J Gibson,et al.  Cell regulation: determined to signal discrete cooperation. , 2009, Trends in biochemical sciences.

[6]  Steven E. Brenner,et al.  SCOPe: classification of large macromolecular structures in the structural classification of proteins—extended database , 2018, Nucleic Acids Res..

[7]  Zigang Dong,et al.  Phosphorylation of H4 Ser 47 promotes HIRA-mediated nucleosome assembly. , 2011, Genes & development.

[8]  R. Kaptein,et al.  Nuclear magnetic resonance solution structure of the Arc repressor using relaxation matrix calculations. , 1994, Journal of molecular biology.

[9]  N. Dokholyan,et al.  Glutathionylation at Cys-111 induces dissociation of wild type and FALS mutant SOD1 dimers. , 2011, Biochemistry.

[10]  S. Teichmann,et al.  Tight Regulation of Unstructured Proteins: From Transcript Synthesis to Protein Degradation , 2008, Science.

[11]  Anna Vangone,et al.  Contacts-based prediction of binding affinity in protein–protein complexes , 2015, eLife.

[12]  M. Madan Babu,et al.  The contribution of intrinsically disordered regions to protein function, cellular complexity, and human disease , 2016, Biochemical Society transactions.

[13]  Masahiro Ito,et al.  Evolutionary Approach of Intrinsically Disordered CIP/KIP Proteins , 2019, Scientific Reports.

[14]  P. Tompa,et al.  Fuzzy complexes: polymorphism and structural disorder in protein-protein interactions. , 2008, Trends in biochemical sciences.

[15]  C. Ibarra,et al.  1.3‐Å resolution structure of human glutathione S‐transferase with S‐hexyl glutathione bound reveals possible extended ligandin binding site , 2002, Proteins.

[16]  R. Pappu,et al.  Conserved interdomain linker promotes phase separation of the multivalent adaptor protein Nck , 2015, Proceedings of the National Academy of Sciences.

[17]  Y. Wang,et al.  Multiscaled exploration of coupled folding and binding of an intrinsically disordered molecular recognition element in measles virus nucleoprotein , 2013, Proceedings of the National Academy of Sciences of the United States of America.

[18]  R. Pappu,et al.  Differential solvation of intrinsically disordered linkers drives the formation of spatially organized droplets in ternary systems of linear multivalent proteins , 2018 .

[19]  J. Clarke,et al.  Insights into Coupled Folding and Binding Mechanisms from Kinetic Studies* , 2016, The Journal of Biological Chemistry.

[20]  S. Alberti,et al.  One domain fits all: Using disordered regions to sequester misfolded proteins , 2018, The Journal of cell biology.

[21]  wwPDB consortium,et al.  Protein Data Bank: the single global archive for 3D macromolecular structure data , 2019, Nucleic Acids Res..

[22]  Zsuzsanna Dosztányi,et al.  IUPred2A: context-dependent prediction of protein disorder as a function of redox state and protein binding , 2018, Nucleic Acids Res..

[23]  D. Eliezer,et al.  Intrinsically disordered proteins in synaptic vesicle trafficking and release , 2019, The Journal of Biological Chemistry.

[24]  D. Barford,et al.  Insights into Degron Recognition by APC/C Coactivators from the Structure of an Acm1-Cdh1 Complex , 2013, Molecular cell.

[25]  H. Dyson,et al.  Intrinsically unstructured proteins and their functions , 2005, Nature Reviews Molecular Cell Biology.

[26]  H. Dyson,et al.  Mechanism of coupled folding and binding of an intrinsically disordered protein , 2007, Nature.

[27]  Chi-Ren Shyu,et al.  Structural Similarity and Classification of Protein Interaction Interfaces , 2011, PloS one.

[28]  Silvio C. E. Tosatto,et al.  PhaSePro: the database of proteins driving liquid–liquid phase separation , 2019, Nucleic Acids Res..

[29]  Ian Sillitoe,et al.  CATH: expanding the horizons of structure-based functional annotations for genome sequences , 2018, Nucleic Acids Res..

[30]  Márton Miskei,et al.  FuzDB: database of fuzzy complexes, a tool to develop stochastic structure-function relationships for protein complexes and higher-order assemblies , 2016, Nucleic Acids Res..

[31]  K. Sletten,et al.  Fibril in senile systemic amyloidosis is derived from normal transthyretin. , 1990, Proceedings of the National Academy of Sciences of the United States of America.

[32]  M. Vrljic,et al.  The Structure of the Yeast Plasma Membrane SNARE Complex Reveals Destabilizing Water-filled Cavities* , 2008, Journal of Biological Chemistry.

[33]  Peter E. Wright,et al.  Graded enhancement of p53 binding to CREB-binding protein (CBP) by multisite phosphorylation , 2010, Proceedings of the National Academy of Sciences.

[34]  Bin Zhang,et al.  PhosphoSitePlus, 2014: mutations, PTMs and recalibrations , 2014, Nucleic Acids Res..

[35]  Christopher J. Oldfield,et al.  Classification of Intrinsically Disordered Regions and Proteins , 2014, Chemical reviews.

[36]  The UniProt Consortium,et al.  UniProt: a worldwide hub of protein knowledge , 2018, Nucleic Acids Res..

[37]  R. Huber,et al.  The crystal structure of dihydrofolate reductase from Thermotoga maritima: molecular features of thermostability. , 2000, Journal of molecular biology.

[38]  Wei Xu,et al.  Mutual synergistic folding in recruitment of CBP/p300 by p160 nuclear receptor coactivators , 2002, Nature.

[39]  Norman E. Davey,et al.  Attributes of short linear motifs. , 2012, Molecular bioSystems.

[40]  Walter Keller,et al.  Structural basis for nucleic acid and toxin recognition of the bacterial antitoxin CcdA. , 2006, Journal of molecular biology.

[41]  D T Jones,et al.  A systematic comparison of protein structure classifications: SCOP, CATH and FSSP. , 1999, Structure.

[42]  Steven E. Brenner,et al.  The value of protein structure classification information—Surveying the scientific literature , 2015, Proteins.

[43]  Peter Tompa,et al.  Intrinsically disordered chaperones in plants and animals. , 2010, Biochemistry and cell biology = Biochimie et biologie cellulaire.

[44]  A Keith Dunker,et al.  Intrinsic disorder in scaffold proteins: getting more from less. , 2008, Progress in biophysics and molecular biology.

[45]  I. Simon,et al.  Physical Background of the Disordered Nature of “Mutual Synergetic Folding” Proteins , 2018, International journal of molecular sciences.

[46]  H. Kimura,et al.  Structures of human nucleosomes containing major histone H3 variants. , 2011, Acta crystallographica. Section D, Biological crystallography.

[47]  Lukasz Kurgan,et al.  Functional Analysis of Human Hub Proteins and Their Interactors Involved in the Intrinsic Disorder-Enriched Interactions , 2017, International journal of molecular sciences.

[48]  J. Yates,et al.  Methylation of the Retinoblastoma Tumor Suppressor by SMYD2* , 2010, The Journal of Biological Chemistry.

[49]  M. Henriksson,et al.  Regulation of transcription factors c-Myc, Max, and c-Myb by casein kinase II. , 1994, Cellular & molecular biology research.

[50]  Erzsébet Fichó,et al.  MFIB: a repository of protein complexes with mutual folding induced by binding , 2017, Bioinform..

[51]  P. Tompa,et al.  The pairwise energy content estimated from amino acid composition discriminates between folded and intrinsically unstructured proteins. , 2005, Journal of molecular biology.

[52]  M. Bordoni,et al.  SOD1 in Amyotrophic Lateral Sclerosis: “Ambivalent” Behavior Connected to the Disease , 2018, International journal of molecular sciences.

[53]  Cathryn M. Gould,et al.  Phospho.ELM: a database of phosphorylation sites—update 2011 , 2010, Nucleic acids research.

[54]  Allegra Via,et al.  Phospho.ELM: a database of phosphorylation sites—update 2008 , 2008, Nucleic Acids Res..

[55]  Zsuzsanna Dosztányi,et al.  Degrons in cancer , 2017, Science Signaling.

[56]  Norman E. Davey,et al.  Motif switches: decision-making in cell regulation. , 2012, Current opinion in structural biology.

[57]  G. Tusnády,et al.  Sequential, structural and functional properties of protein complexes are defined by how folding and binding intertwine. , 2019, Journal of molecular biology.

[58]  A. Dunker,et al.  Disorder and sequence repeats in hub proteins and their implications for network evolution. , 2006, Journal of proteome research.

[59]  Zsuzsanna Dosztányi,et al.  DIBS: a repository of disordered binding sites mediating interactions with ordered proteins , 2017, Bioinform..

[60]  Luc Tremblay,et al.  The NMR solution structure of a mutant of the Max b/HLH/LZ free of DNA: insights into the specific and reversible DNA binding mechanism of dimeric transcription factors. , 2004, Journal of molecular biology.

[61]  The Gene Ontology Consortium,et al.  The Gene Ontology Resource: 20 years and still GOing strong , 2018, Nucleic Acids Res..

[62]  István Simon,et al.  Molecular principles of the interactions of disordered proteins. , 2007, Journal of molecular biology.

[63]  I. Simon,et al.  Analysis of Heterodimeric “Mutual Synergistic Folding”-Complexes , 2019, International journal of molecular sciences.

[64]  Gert Vriend,et al.  A series of PDB related databases for everyday needs , 2010, Nucleic Acids Res..

[65]  Allegra Via,et al.  Phospho.ELM: a database of phosphorylation sites—update 2008 , 2007, Nucleic Acids Res..

[66]  Hui-Chun Cheng,et al.  Structural mechanism of WASP activation by the enterohaemorrhagic E. coli effector EspFU , 2008, Nature.