The physics and bioinformatics of binding and folding—an energy landscape perspective

It has been recognized in the last few years that unstructured proteins play an important role in biological organisms, often participating in signal transduction, transcriptional regulation, and a variety of other regulatory activities. Various hypotheses have been put forward for the ubiquity of the unfolded state; rapid turnover, faster or more specific binding kinetics, multifunctionality may all possibly explain apparent ubiquitousness of unfolded proteins in eukaryotic cells. In this paper we extend the energy landscape theory of protein folding to construct an analytical model of how binding and folding are coupled thermodynamically when the energy landscape is partially rugged. To deduce the parameters that enter the theory, which is based on Generalized Random Energy Model, we have analyzed in a bioinformatic sense a large structural database of more than 500 protein complexes. We find that Miyazawa–Jernigan contact potential shows similar energy gaps for folding for both hydrophobic and hydrophilic proteins, but that for binding contacts hydrophobic interfaces turn out to be funneled while hydrophilic ones are antifunneled. This suggests evolution has found a mechanism for avoiding frustration between folding and binding by making use of indirect water‐mediated interactions. By juxtaposing the monomeric protein folding free energy profile in the protein complex database with another database consisting of only well‐folded monomers, we estimate that at least 15% of monomers in the former database are unfolded in the absence of partner protein interface interactions. When employing the parameters characteristic of these unfolded monomers to construct binding/folding phase diagrams, we find that these monomers would indeed fold if sufficiently stabilizing binding contacts, consistent with that fold, are formed. © 2003 Wiley Periodicals, Inc. Biopolymers: 333–349, 2003

[1]  Emil Fischer,et al.  Ueber die optischen Isomeren des Traubenzuckers, der Gluconsäure und der Zuckersäure , 1890 .

[2]  L. Rosenhead Conduction of Heat in Solids , 1947, Nature.

[3]  R. Nussinov,et al.  Folding funnels, binding funnels, and protein function , 1999, Protein science : a publication of the Protein Society.

[4]  D. Koshland The Key–Lock Theory and the Induced Fit Theory , 1995 .

[5]  E. Fischer Einfluss der Configuration auf die Wirkung der Enzyme , 1894 .

[6]  J. Janin The kinetics of protein‐protein recognition , 1997, Proteins.

[7]  Charles DeLisi,et al.  Protein‐protein recognition: exploring the energy funnels near the binding sites , 1999, Proteins.

[8]  U Bastolla,et al.  How to guarantee optimal stability for most representative structures in the protein data bank , 2001, Proteins.

[9]  Y. Ohya,et al.  A fluorescent indicator for detecting protein-protein interactions in vivo based on protein splicing. , 2000, Analytical chemistry.

[10]  N. Ben-Tal,et al.  Residue frequencies and pairing preferences at protein–protein interfaces , 2001, Proteins.

[11]  J. Onuchic,et al.  Toward an outline of the topography of a realistic protein-folding funnel. , 1995, Proceedings of the National Academy of Sciences of the United States of America.

[12]  K A Dill,et al.  Ligand binding to proteins: The binding landscape model , 1997, Protein science : a publication of the Protein Society.

[13]  J. Onuchic,et al.  Theory of protein folding: the energy landscape perspective. , 1997, Annual review of physical chemistry.

[14]  Gennady Verkhivker,et al.  Deciphering common failures in molecular docking of ligand-protein complexes , 2000, J. Comput. Aided Mol. Des..

[15]  Shoji Takada,et al.  Microscopic theory of protein folding rates. I. Fine structure of the free energy profile and folding routes from a variational approach , 2000, cond-mat/0008454.

[16]  Luis Fernández Pacios,et al.  Distinct Molecular Surfaces and Hydrophobicity of Amino Acid Residues in Proteins , 2001, J. Chem. Inf. Comput. Sci..

[17]  H. Dyson,et al.  Coupling of folding and binding for unstructured proteins. , 2002, Current opinion in structural biology.

[18]  J R Banavar,et al.  Protein threading by learning , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[19]  H. Dyson,et al.  Intrinsically unstructured proteins: re-assessing the protein structure-function paradigm. , 1999, Journal of molecular biology.

[20]  D. Thirumalai,et al.  Pair potentials for protein folding: Choice of reference states and sensitivity of predicted native states to variations in the interaction schemes , 2008, Protein science : a publication of the Protein Society.

[21]  R. S. Spolar,et al.  Coupling of local folding to site-specific binding of proteins to DNA. , 1994, Science.

[22]  P. Romero,et al.  Sequence complexity of disordered protein , 2001, Proteins.

[23]  J. A. D'aquino,et al.  Binding of small organic molecules to macromolecular targets: Evaluation of conformational entropy changes , 2000, Proteins.

[24]  C. Arrowsmith,et al.  Thermodynamic analysis of the structural stability of the tetrameric oligomerization domain of p53 tumor suppressor. , 1995, Biochemistry.

[25]  P. Wolynes,et al.  The energy landscapes and motions of proteins. , 1991, Science.

[26]  P E Wright,et al.  Structural studies of p21Waf1/Cip1/Sdi1 in the free and Cdk2-bound state: conformational disorder mediates binding diversity. , 1996, Proceedings of the National Academy of Sciences of the United States of America.

[27]  Peter G. Wolynes,et al.  Role of explicitly cooperative interactions in protein folding funnels: A simulation study , 2001 .

[28]  Wolynes,et al.  Correlated energy landscape model for finite, random heteropolymers. , 1996, Physical review. E, Statistical physics, plasmas, fluids, and related interdisciplinary topics.

[29]  Gennady M Verkhivker,et al.  Predicting structural effects in HIV‐1 protease mutant complexes with flexible ligand docking and protein side‐chain optimization , 1998, Proteins.

[30]  P. Wolynes,et al.  Statistical mechanics of a correlated energy landscape model for protein folding funnels , 1996, cond-mat/9606159.

[31]  R. Sauer Scissors and helical forks , 1990, Nature.

[32]  J Skolnick,et al.  How do potentials derived from structural databases relate to “true” potentials? , 1998, Protein science : a publication of the Protein Society.

[33]  L M Amzel,et al.  Estimation of changes in side chain configurational entropy in binding and folding: General methods and application to helix formation , 1994, Proteins.

[34]  P. V. von Hippel,et al.  Double-stranded DNA templates can induce alpha-helical conformation in peptides containing lysine and alanine: functional implications for leucine zipper and helix-loop-helix transcription factors. , 1994, Proceedings of the National Academy of Sciences of the United States of America.

[35]  V. Uversky Natively unfolded proteins: A point where biology waits for physics , 2002, Protein science : a publication of the Protein Society.

[36]  Randy J. Read,et al.  Multiple-Start Monte Carlo Docking of Flexible Ligands , 1994 .

[37]  C. Matthews,et al.  Probing the folding mechanism of a leucine zipper peptide by stopped-flow circular dichroism spectroscopy. , 1995, Biochemistry.

[38]  G. Fredrickson The theory of polymer dynamics , 1996 .

[39]  H. Erickson,et al.  Kinetics of protein-protein association explained by Brownian dynamics computer simulation. , 1992, Proceedings of the National Academy of Sciences of the United States of America.

[40]  Benjamin A. Shoemaker,et al.  Speeding molecular recognition by using the folding funnel: the fly-casting mechanism. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[41]  M Vendruscolo,et al.  Recovery of protein structure from contact maps. , 1997, Folding & design.

[42]  Zaida Luthey-Schulten,et al.  Evaluating protein structure-prediction schemes using energy landscape theory , 2001, IBM J. Res. Dev..

[43]  R L Brady,et al.  One sequence, two folds: a metastable structure of CD2. , 1995, Proceedings of the National Academy of Sciences of the United States of America.

[44]  L A Mirny,et al.  Comparison of two optimization methods to derive energy parameters for protein folding: Perceptron and Z score , 2000, Proteins.

[45]  R. Jernigan,et al.  Residue-residue potentials with a favorable contact pair term and an unfavorable high packing density term, for simulation and threading. , 1996, Journal of molecular biology.

[46]  M Vendruscolo,et al.  Protein folding using contact maps. , 1999, Vitamins and hormones.