Is the Folding Topology of a Protein Related to its Amino Acid Occurrence?

Proteins are molecular machines built from a polypeptide chain with capability to self-assemble into a compact 3D structure. This amazing capability is only exhibited by a subset of all the possible polypeptide chains built using the 20 natural amino acids so that, proteins in the living organisms are the result of years of natural selection (1). The physical basis of protein self-assembling or folding as is commonly referred, has puzzled researchers for the last fifty years and it is still a matter of debate (2, 3). Protein folding has been analyzed using methodologies that range from biophysical methods including structural, spectroscopic or computational to biochemical, including those of molecular biology. Present knowledge is the result of important research efforts basically focused on either understanding the structural features of known folded proteins, on the dynamics of the folding process or on the design of algorithms to predict the 3D structure of a protein from its sequence. These three aspects are closely related and need to be considered simultaneously, since prediction of the 3D structure of a protein involves a deep understanding of the structural features of folded proteins together with a solid knowledge of the driving forces involved in the folding process, including the role played by the solvent (4).

[1]  B Jayaram,et al.  A Stoichiometry Driven Universal Spatial Organization of Backbones of Folded Proteins: Are there Chargaff's Rules for Protein Folding? , 2010, Journal of biomolecular structure & dynamics.

[2]  Tomonori Gotoh,et al.  Secondary Structure Characterization Based on Amino Acid Composition and Availability in Proteins , 2010, J. Chem. Inf. Model..

[3]  John Orban,et al.  NMR structures of two designed proteins with high sequence identity but different fold and function , 2008, Proceedings of the National Academy of Sciences.

[4]  A. Bornot,et al.  Protein contacts, inter-residue interactions and side-chain modelling. , 2008, Biochimie.

[5]  K. Dill,et al.  The protein folding problem. , 1993, Annual review of biophysics.

[6]  Y-h. Taguchi,et al.  Application of amino acid occurrence for discriminating different folding types of globular proteins , 2007, BMC Bioinformatics.

[7]  G. Rose,et al.  A backbone-based theory of protein folding , 2006, Proceedings of the National Academy of Sciences.

[8]  J. Skolnick,et al.  On the origin and highly likely completeness of single-domain protein structures. , 2006, Proceedings of the National Academy of Sciences of the United States of America.

[9]  J. Skolnick,et al.  Development and large scale benchmark testing of the PROSPECTOR_3 threading algorithm , 2004, Proteins.

[10]  D. Baker,et al.  A surprising simplicity to protein folding , 2000, Nature.

[11]  K. Dill,et al.  From Levinthal to pathways to funnels , 1997, Nature Structural Biology.

[12]  K. Dill Dominant forces in protein folding. , 1990, Biochemistry.

[13]  T. Creighton Proteins: Structures and molecular principles , 1983 .

[14]  C. Anfinsen Principles that govern the folding of protein chains. , 1973, Science.

[15]  J. L. King,et al.  Non-Darwinian evolution. , 1969, Science.

[16]  C. Levinthal Are there pathways for protein folding , 1968 .

[17]  J. Kendrew,et al.  A Three-Dimensional Model of the Myoglobin Molecule Obtained by X-Ray Analysis , 1958, Nature.

[18]  L. Pauling,et al.  Atomic coordinates and structure factors for two helical configurations of polypeptide chains. , 1951, Proceedings of the National Academy of Sciences of the United States of America.