All-atom ab initio folding of a diverse set of proteins.

Natural proteins fold to a unique, thermodynamically dominant state. Modeling of the folding process and prediction of the native fold of proteins are two major unsolved problems in biophysics. Here, we show successful all-atom ab initio folding of a representative diverse set of proteins by using a minimalist transferable-energy model that consists of two-body atom-atom interactions, hydrogen bonding, and a local sequence-energy term that models sequence-specific chain stiffness. Starting from a random coil, the native-like structure was observed during replica exchange Monte Carlo (REMC) simulation for most proteins regardless of their structural classes; the lowest energy structure was close to native-in the range of 2-6 A root-mean-square deviation (rmsd). Our results demonstrate that the successful folding of a protein chain to its native state is governed by only a few crucial energetic terms.

[1]  P. Kollman,et al.  Pathways to a protein folding intermediate observed in a 1-microsecond simulation in aqueous solution. , 1998, Science.

[2]  Anil K. Jain,et al.  Algorithms for Clustering Data , 1988 .

[3]  K Schulten,et al.  VMD: visual molecular dynamics. , 1996, Journal of molecular graphics.

[4]  F M Richards,et al.  An analysis of packing in the protein folding problem , 1993, Quarterly Reviews of Biophysics.

[5]  E. Shakhnovich,et al.  Chain Length Scaling of Protein Folding Time. , 1996, Physical review letters.

[6]  J. Skolnick,et al.  Comparison of three Monte Carlo conformational search strategies for a proteinlike homopolymer model: Folding thermodynamics and identification of low-energy structures , 2000 .

[7]  Chaok Seok,et al.  A kinematic view of loop closure , 2004, J. Comput. Chem..

[8]  Guoli Wang,et al.  PISCES: a protein sequence culling server , 2003, Bioinform..

[9]  Eugene I Shakhnovich,et al.  Lessons from the design of a novel atomic potential for protein folding , 2005, Protein science : a publication of the Protein Society.

[10]  Liam J. McGuffin,et al.  The PSIPRED protein structure prediction server , 2000, Bioinform..

[11]  Wang,et al.  Replica Monte Carlo simulation of spin glasses. , 1986, Physical review letters.

[12]  P. Bradley,et al.  Toward High-Resolution de Novo Structure Prediction for Small Proteins , 2005, Science.

[13]  V. Pande,et al.  Absolute comparison of simulated and experimental protein-folding dynamics , 2002, Nature.

[14]  A. Liwo,et al.  Ab initio simulations of protein-folding pathways by molecular dynamics with the united-residue model of polypeptide chains. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[15]  M Karplus,et al.  "New view" of protein folding reconciled with the old through multiple unfolding simulations. , 1997, Science.

[16]  E. Shakhnovich,et al.  Proteins with selected sequences fold into unique native conformation. , 1994, Physical review letters.

[17]  Eugene Shakhnovich,et al.  Protein folding thermodynamics and dynamics: where physics, chemistry, and biology meet. , 2006, Chemical reviews.

[18]  N. Guex,et al.  SWISS‐MODEL and the Swiss‐Pdb Viewer: An environment for comparative protein modeling , 1997, Electrophoresis.

[19]  J. Skolnick,et al.  Automated structure prediction of weakly homologous proteins on a genomic scale. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[20]  M. Karplus,et al.  CHARMM: A program for macromolecular energy, minimization, and dynamics calculations , 1983 .

[21]  A. Fersht,et al.  Protein Folding and Unfolding at Atomic Resolution , 2002, Cell.

[22]  L A Mirny,et al.  How to derive a protein folding potential? A new approach to an old problem. , 1996, Journal of molecular biology.

[23]  E I Shakhnovich,et al.  Protein design: a perspective from simple tractable models , 1998, Folding & design.

[24]  G. Rose,et al.  The protein-folding problem: the native fold determines packing, but does packing determine the native fold? , 1991, Proceedings of the National Academy of Sciences of the United States of America.

[25]  D. Baker,et al.  Prospects for ab initio protein structural genomics. , 2001, Journal of molecular biology.

[26]  Hongyi Zhou,et al.  Distance‐scaled, finite ideal‐gas reference state improves structure‐derived potentials of mean force for structure selection and stability prediction , 2002, Protein science : a publication of the Protein Society.

[27]  K. Dill Dominant forces in protein folding. , 1990, Biochemistry.

[28]  Eric J. Deeds,et al.  High-resolution protein folding with a transferable potential. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[29]  D. Shortle Propensities, probabilities, and the Boltzmann hypothesis , 2003, Protein science : a publication of the Protein Society.

[30]  Yang Zhang,et al.  The protein structure prediction problem could be solved using the current PDB library. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[31]  Eric J. Deeds,et al.  Understanding ensemble protein folding at atomic detail , 2006, Proceedings of the National Academy of Sciences.

[32]  Arne Elofsson,et al.  MaxSub: an automated measure for the assessment of protein structure prediction quality , 2000, Bioinform..

[33]  Eugene I. Shakhnovich,et al.  A structure-based method for derivation of all-atom potentials for protein folding , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[34]  E. Shakhnovich,et al.  The folding thermodynamics and kinetics of crambin using an all-atom Monte Carlo simulation. , 2000, Journal of molecular biology.

[35]  D Thirumalai,et al.  Mechanisms and kinetics of beta-hairpin formation. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[36]  T. Sosnick,et al.  Fast folding of a helical protein initiated by the collision of unstructured chains. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[37]  P. Argos,et al.  Knowledge‐based protein secondary structure assignment , 1995, Proteins.

[38]  Patrick J Fleming,et al.  Ab Initio Protein Folding Using LINUS , 2004, Numerical Computer Methods, Part D.

[39]  Yang Zhang,et al.  TASSER: An automated method for the prediction of protein tertiary structures in CASP6 , 2005, Proteins.

[40]  Eugene I Shakhnovich,et al.  A knowledge‐based move set for protein folding , 2007, Proteins.