As in any challenging science problem, a sensible approach is to consider the hints provided by nature. In spite of their diversity, globular proteins share some amazing commonalities: they fold rapidly and reproducibly (Anfinsen, 1973); their folding is driven by the affinity or lack of certain side chains and the backbone to the surrounding water; they are able to catalyze reactions and speed them up by many orders of magnitude; the folded forms of proteins are comprised of helices and sheets – the scaffolding which is provided by hydrogen bonds (Pauling & Corey, 1951; Pauling, Corey, & Branson, 1951); and proteins seem to have a generic propensity to aggregate and form amyloid (Dobson, 2003), which, in turn, is implicated in debilitating diseases. A remarkable experimental observation is that the total number of distinct folds seems to be of the order of just a few thousand (Chothia & Finkelstein, 1990). More than 70 years ago, Bernal wrote a remarkable paper entitled, “Structure of Proteins” (Bernal, 1939), whose abstract reads: The structure of proteins is the major unsolved problem on the boundary of chemistry and biology to-day. We have not yet found the key to the problem, but in recent years a mass of new evidence and new lines of attack have enabled us to see it in a far more concrete and precise form, and to have some hope that we are near to solving it. Bernal goes on to suggest: Any effective picture of protein structure must provide at the same time for the common character of all proteins as exemplified by their many chemical and physical similarities, and for the highly specific nature of each protein type. A striking feature of proteins is that a large number of sequences adopt one of a relatively small number of putative protein folds (Chothia & Finkelstein, 1990). This has led to thinking about how one may determine the structure of a given sequence from knowledge of the menu of the folds. A crucial point to consider is the nature of the unfolded state of a protein (Rose, 2002). Does a given sequence already have the address of its native-state structure encoded in its nascent conformation? Coarse-grained models of proteins have been studied over the years (Miyazawa & Jernigan, 1985) with varying degrees of approximation to determine generic and transferable effective interactions between pairs of amino acids that ensure that the energy of a sequence in its native state is lower than in accessible competing conformations. Such knowledge-based approaches in their simplest form (Jha, Vishveshwara, & Banavar, 2010) approximate a protein native-state structure as being made up of a gas of amino acids and study the propensity for specific pairs of amino acids to be spatially near each other compared to what one would expect from random considerations. More refined calculations involving the side-chain atoms show that a protein native-state structure is a delicate packing of all the constituent atoms respecting steric constraints (Ramachandran, Ramakrishnan, & Sasisekharan, 1963), yielding an emergent connected community arising through short-range interactions screened by the surrounding water molecules (Deb, Vishveshwara, & Vishveshwara, 2009). Ben-Naim (2012) points out that the Gibbs energy landscape may be characterized by many minima (see Figure 5 of his paper), some of which may be rarely visited and others which may be inaccessible because of high barriers. The experimental observation that a given protein or sequence of amino acids folds reproducibly and rapidly into a specific native fold, is encapsulated in the folding funnel picture (Dill & Chan, 1997; Wolynes, Onuchic, & Thirumalai, 1995). Were there multiple competing and accessible minima in the Gibbs energy landscape, one would not observe reproducible folding. Were there not a smooth and accessible funnel-like basin of attraction for the native fold, one would not observe
[1]
C. Dobson.
Protein Folding and Disease: a view from the first Horizon Symposium
,
2003,
Nature Reviews Drug Discovery.
[2]
K. Dill,et al.
A lattice statistical mechanics model of the conformational and sequence spaces of proteins
,
1989
.
[3]
A. Ben-Naim.
Levinthal’s question revisited, and answered
,
2012,
Journal of biomolecular structure & dynamics.
[4]
Flavio Seno,et al.
Protein Sequence and Structure: Is One More Fundamental than the Other?
,
2012
.
[5]
K. Dill,et al.
From Levinthal to pathways to funnels
,
1997,
Nature Structural Biology.
[6]
J. Onuchic,et al.
Navigating the folding routes
,
1995,
Science.
[7]
A V Finkelstein,et al.
The classification and origins of protein folding patterns.
,
1990,
Annual review of biochemistry.
[8]
L. Pauling,et al.
Configurations of Polypeptide Chains With Favored Orientations Around Single Bonds: Two New Pleated Sheets.
,
1951,
Proceedings of the National Academy of Sciences of the United States of America.
[9]
W. T. ASTBURY,et al.
Structure of Proteins
,
1939,
Nature.
[10]
Saraswathi Vishveshwara,et al.
Amino acid interaction preferences in proteins
,
2010,
Protein science : a publication of the Protein Society.
[11]
Jayanth R Banavar,et al.
Physics of proteins.
,
2007,
Annual review of biophysics and biomolecular structure.
[12]
G. Rose.
Getting to know u.
,
2002,
Advances in protein chemistry.
[13]
L. Pauling,et al.
The structure of proteins; two hydrogen-bonded helical configurations of the polypeptide chain.
,
1951,
Proceedings of the National Academy of Sciences of the United States of America.
[14]
R. Jernigan,et al.
Estimation of effective interresidue contact energies from protein crystal structures: quasi-chemical approximation
,
1985
.
[15]
P. Wolynes,et al.
Spin glasses and the statistical mechanics of protein folding.
,
1987,
Proceedings of the National Academy of Sciences of the United States of America.
[16]
G. N. Ramachandran,et al.
Stereochemistry of polypeptide chain configurations.
,
1963,
Journal of molecular biology.
[17]
C. Anfinsen.
Principles that govern the folding of protein chains.
,
1973,
Science.
[18]
Saraswathi Vishveshwara,et al.
Understanding protein structure from a percolation perspective.
,
2009,
Biophysical journal.