Composites of local structure propensities: evidence for local encoding of long-range structure.

To estimate how extensively the ensemble of denatured-state conformations is constrained by local side-chain-backbone interactions, propensities of each of the 20 amino acids to occur in mono- and dipeptides mapped to discrete regions of the Ramachandran map are computed from proteins of known structure. In addition, propensities are computed for the trans, gauche-, and gauche+ rotamers, with or without consideration of the values of phi and psi. These propensities are used in scoring functions for fragment threading, which estimates the energetic favorability of fragments of protein sequence to adopt the native conformation as opposed to hundreds of thousands of incorrect conformations. As finer subdivisions of the Ramachandran plot, neighboring residue phi/psi angles, and rotamers are incorporated, scoring functions become better at ranking the native conformation as the most favorable. With the best composite propensity function, the native structure can be distinguished from 300,000 incorrect structures for 71% of the 2130 arbitrary protein segments of length 40, 48% of 2247 segments of length 30, and 20% of 2368 segments of length 20. A majority of fragments of length 30-40 are estimated to be folded into the native conformation a substantial fraction of the time. These data suggest that the variations observed in amino acid frequencies in different phi/psi/chi1 environments in folded proteins reflect energetically important local side-chain-backbone interactions, interactions that may severely restrict the ensemble of conformations populated in the denatured state to a relatively small subset with nativelike structure.

[1]  G. N. Ramachandran,et al.  Conformation of polypeptides and proteins. , 1968, Advances in protein chemistry.

[2]  M. Volkenstein,et al.  Statistical mechanics of chain molecules , 1969 .

[3]  F. Pohl Empirical protein energy maps. , 1971, Nature: New biology.

[4]  P. Y. Chou,et al.  Conformational parameters for amino acids in helical, beta-sheet, and random coil regions calculated from proteins. , 1974, Biochemistry.

[5]  G. Rose,et al.  Hydrophobicity of amino acid residues in globular proteins. , 1985, Science.

[6]  R. Jernigan,et al.  Estimation of effective interresidue contact energies from protein crystal structures: quasi-chemical approximation , 1985 .

[7]  J. Ponder,et al.  Tertiary templates for proteins. Use of packing criteria in the enumeration of allowed sequences for different structural classes. , 1987, Journal of molecular biology.

[8]  A M Lesk,et al.  Interior and surface of monomeric proteins. , 1987, Journal of molecular biology.

[9]  J. Richardson,et al.  Principles and Patterns of Protein Conformation , 1989 .

[10]  J. Thornton,et al.  Influence of proline residues on protein conformation. , 1991, Journal of molecular biology.

[11]  G. Rose,et al.  Side-chain entropy opposes alpha-helix formation but rationalizes experimentally determined helix-forming propensities. , 1992, Proceedings of the National Academy of Sciences of the United States of America.

[12]  S. Bryant,et al.  An empirical energy function for threading protein sequence through the folding motif , 1993, Proteins.

[13]  Roland L. Dunbrack,et al.  Backbone-dependent rotamer library for proteins. Application to side-chain prediction. , 1993, Journal of molecular biology.

[14]  B. Matthews,et al.  Structural basis of amino acid alpha helix propensity. , 1993, Science.

[15]  Manfred J. Sippl,et al.  Boltzmann's principle, knowledge-based mean fields and protein folding. An approach to the computational determination of protein structures , 1993, J. Comput. Aided Mol. Des..

[16]  V. Muñoz,et al.  Intrinsic secondary structure propensities of the amino acids, using statistical ϕ–ψ matrices: Comparison with experimental scales , 1994 .

[17]  Y. Matsuo,et al.  Protein structural similarities predicted by a sequence‐structure compatibility method , 1994, Protein science : a publication of the Protein Society.

[18]  B. Honig,et al.  Free energy determinants of secondary structure formation: II. Antiparallel beta-sheets. , 1995, Journal of molecular biology.

[19]  B. Honig,et al.  Free energy determinants of secondary structure formation: I. alpha-Helices. , 1995, Journal of molecular biology.

[20]  S. Bryant,et al.  Threading a database of protein cores , 1995, Proteins.

[21]  A. Finkelstein,et al.  Why do protein architectures have boltzmann‐like statistics? , 1995, Proteins.

[22]  G D Rose,et al.  Modeling unfolded states of peptides and proteins. , 1995, Biochemistry.

[23]  M. Swindells,et al.  Intrinsic φ,ψ propensities of amino acids, derived from the coil regions of known structures , 1995, Nature Structural Biology.

[24]  James O. Wrabl,et al.  Perturbations of the denatured state ensemble: Modeling their effects on protein stability and folding kinetics , 1996, Protein science : a publication of the Protein Society.

[25]  M. Sippl,et al.  Helmholtz free energies of atom pair interactions in proteins. , 1996, Folding & design.

[26]  K. Dill,et al.  Statistical potentials extracted from protein structures: how accurate are they? , 1996, Journal of molecular biology.

[27]  David C. Jones,et al.  Potential energy functions for threading. , 1996, Current opinion in structural biology.

[28]  J Moult,et al.  Comparison of database potentials and molecular mechanics force fields. , 1997, Current opinion in structural biology.

[29]  S Vajda,et al.  Empirical potentials and functions for protein folding and binding. , 1997, Current opinion in structural biology.

[30]  R Abagyan,et al.  Evaluating the energetics of empty cavities and internal mutations in proteins , 1997, Protein science : a publication of the Protein Society.

[31]  R L Jernigan,et al.  Short‐range conformational energies, secondary structure propensities, and recognition of correct sequence‐structure matches , 1997, Proteins.

[32]  S L Mayo,et al.  Intrinsic beta-sheet propensities result from van der Waals interactions between side chains and the local backbone. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[33]  R. Srinivasan,et al.  A physical basis for protein secondary structure. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[34]  R. Srinivasan,et al.  The Flory isolated-pair hypothesis is not valid for polypeptide chains: implications for protein folding. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[35]  N. Kannan,et al.  Aromatic clusters: a determinant of thermal stability of thermophilic proteins. , 2000, Protein engineering.

[36]  Richard Bonneau,et al.  Ab initio protein structure prediction: progress and prospects. , 2001, Annual review of biophysics and biomolecular structure.

[37]  D. Shortle,et al.  Persistence of Native-Like Topology in a Denatured Protein in 8 M Urea , 2001, Science.