Toward a quantitative theory of intrinsically disordered proteins and their function

A large number of proteins are sufficiently unstable that their full 3D structure cannot be resolved. The origins of this intrinsic disorder are not well understood, but its ubiquitous presence undercuts the principle that a protein's structure determines its function. Here we present a quantitative theory that makes predictions regarding the role of intrinsic disorder in protein structure and function. In particular, we discuss the implications of analytical solutions of a series of fundamental thermodynamic models of protein interactions in which disordered proteins are characterized by positive folding free energies. We validate our predictions by assigning protein function by using the gene ontology classification—in which “protein binding”, “catalytic activity”, and “transcription regulator activity” are the three largest functional categories—and by performing genome-wide surveys of both the amount of disorder in these functional classes and binding affinities for both prokaryotic and eukaryotic genomes. Specifically, without assuming any a priori structure–function relationship, the theory predicts that both catalytic and low-affinity binding (Kd ≳10−7 M) proteins prefer ordered structures, whereas only high-affinity binding proteins (found mostly in eukaryotes) can tolerate disorder. Relevant to both transcription and signal transduction, the theory also explains how increasing disorder can tune the binding affinity to maximize the specificity of promiscuous interactions. Collectively, these studies provide insight into how natural selection acts on folding stability to optimize protein function.

[1]  Renxiao Wang,et al.  The PDBbind database: methodologies and updates. , 2005, Journal of medicinal chemistry.

[2]  L. Pauling,et al.  Nature of Forces between Large Molecules of Biological Interest , 1948, Nature.

[3]  V. Wood,et al.  Gene Ontology annotation status of the fission yeast genome: preliminary coverage approaches 100% , 2006, Yeast.

[4]  T. Gibson,et al.  Protein disorder prediction: implications for structural proteomics. , 2003, Structure.

[5]  Peter D. Karp,et al.  EcoCyc: A comprehensive view of Escherichia coli biology , 2008, Nucleic Acids Res..

[6]  R. Drysdale FlyBase : a database for the Drosophila research community. , 2008, Methods in molecular biology.

[7]  Benjamin A. Shoemaker,et al.  Speeding molecular recognition by using the folding funnel: the fly-casting mechanism. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[8]  J. Gawroński Amsterdam , 2008, Water in Times of Climate Change.

[9]  Christopher J. Oldfield,et al.  Functional anthology of intrinsic disorder. 1. Biological processes and functions of proteins with long disordered regions. , 2007, Journal of proteome research.

[10]  R. S. Spolar,et al.  Coupling of local folding to site-specific binding of proteins to DNA. , 1994, Science.

[11]  S. Vajda,et al.  Anchor residues in protein-protein interactions. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[12]  Jaime Prilusky,et al.  FoldIndex copyright: a simple tool to predict whether a given protein sequence is intrinsically unfolded , 2005, Bioinform..

[13]  Lincoln Stein,et al.  Gramene: a growing plant comparative genomics resource , 2007, Nucleic Acids Res..

[14]  Cathy H. Wu,et al.  The Universal Protein Resource (UniProt) , 2004, Nucleic Acids Res..

[15]  H. Dyson,et al.  Intrinsically unstructured proteins: re-assessing the protein structure-function paradigm. , 1999, Journal of molecular biology.

[16]  L. Iakoucheva,et al.  Intrinsic disorder in cell-signaling and cancer-associated proteins. , 2002, Journal of molecular biology.

[17]  I. Bahar,et al.  Coupling between catalytic site and collective dynamics: a requirement for mechanochemical activity of enzymes. , 2005, Structure.

[18]  Zoran Obradovic,et al.  Length-dependent prediction of protein intrinsic disorder , 2006, BMC Bioinformatics.

[19]  Rachael P. Huntley,et al.  The GOA database in 2009—an integrated Gene Ontology Annotation resource , 2008, Nucleic Acids Res..

[20]  Kimberly Van Auken,et al.  WormBase: new content and better access , 2006, Nucleic Acids Res..

[21]  H. Dyson,et al.  Intrinsically unstructured proteins and their functions , 2005, Nature Reviews Molecular Cell Biology.

[22]  J. Ladbury,et al.  Searching for specificity in SH domains. , 2000, Chemistry & biology.

[23]  C. Brown,et al.  Intrinsic protein disorder in complete genomes. , 2000, Genome informatics. Workshop on Genome Informatics.

[24]  M. Ashburner,et al.  Gene Ontology: tool for the unification of biology , 2000, Nature Genetics.

[25]  Blaz Zupan,et al.  dictyBase—a Dictyostelium bioinformatics resource update , 2008, Nucleic Acids Res..

[26]  A K Dunker,et al.  Protein disorder and the evolution of molecular recognition: theory, predictions and observations. , 1998, Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing.

[27]  Kara Dolinski,et al.  Gene Ontology annotations at SGD: new data sources and annotation methods , 2007, Nucleic Acids Res..

[28]  Carlos J. Camacho,et al.  Structural and Thermodynamic Approach to Peptide Immunogenicity , 2008, PLoS Comput. Biol..

[29]  R. Zwanzig,et al.  Two-state models of protein folding kinetics. , 1997, Proceedings of the National Academy of Sciences of the United States of America.

[30]  T. N. Bhat,et al.  The Protein Data Bank , 2000, Nucleic Acids Res..

[31]  R. Nussinov,et al.  Structured disorder and conformational selection , 2001, Proteins.

[32]  Owen White,et al.  The TIGRFAMs database of protein families , 2003, Nucleic Acids Res..

[33]  Lilia M. Iakoucheva,et al.  Intrinsic Disorder Is a Common Feature of Hub Proteins from Four Eukaryotic Interactomes , 2006, PLoS Comput. Biol..

[34]  J. Beckmann,et al.  FoldIndex©: a simple tool to predict whether a given protein sequence is intrinsically unfolded , 2005 .

[35]  A Keith Dunker,et al.  Alternative splicing in concert with protein intrinsic disorder enables increased functional diversity in multicellular organisms. , 2006, Proceedings of the National Academy of Sciences of the United States of America.

[36]  J. S. Sodhi,et al.  Prediction and functional analysis of native disorder in proteins from the three kingdoms of life. , 2004, Journal of molecular biology.

[37]  B K Shoichet,et al.  A relationship between protein stability and protein function. , 1995, Proceedings of the National Academy of Sciences of the United States of America.

[38]  M. Degroot,et al.  Probability and Statistics , 2021, Examining an Operational Approach to Teaching Probability.