Lessons from application of the UNRES force field to predictions of structures of CASP10 targets

Significance With the example of the coarse-grained United Residue model of polypeptide chains, this paper demonstrates that the physics-based approach for protein-structure prediction can lead to exceptionally good results when correct domain packing is an issue, even for a highly homologous target. The reason for this is probably that emphasis is placed on energetically favorable residue–residue interactions, including those with residues in relatively flexible linker regions; these regions are usually very different in the target compared with those of proteins in the databases used for template-based modeling. The results suggest that a combination of bioinformatics and a physics-based approach could result in a major increase in the prediction capacity of existing approaches. The performance of the physics-based protocol, whose main component is the United Residue (UNRES) physics-based coarse-grained force field, developed in our laboratory for the prediction of protein structure from amino acid sequence, is illustrated. Candidate models are selected, based on probabilities of the conformational families determined by multiplexed replica-exchange simulations, from the 10th Community Wide Experiment on the Critical Assessment of Techniques for Protein Structure Prediction (CASP10). For target T0663, classified as a new fold, which consists of two domains homologous to those of known proteins, UNRES predicted the correct symmetry of packing, in which the domains are rotated with respect to each other by 180° in the experimental structure. By contrast, models obtained by knowledge-based methods, in which each domain is modeled very accurately but not rotated, resulted in incorrect packing. Two UNRES models of this target were featured by the assessors. Correct domain packing was also predicted by UNRES for the homologous target T0644, which has a similar structure to that of T0663, except that the two domains are not rotated. Predictions for two other targets, T0668 and T0684_D2, are among the best ones by global distance test score. These results suggest that our physics-based method has substantial predictive power. In particular, it has the ability to predict domain–domain orientations, which is a significant advance in the state of the art.

[1]  C. Anfinsen Principles that govern the folding of protein chains. , 1973, Science.

[2]  P K Warme,et al.  Computation of structures of homologous proteins. Alpha-lactalbumin from lysozyme. , 1974, Biochemistry.

[3]  H. Berendsen,et al.  Molecular dynamics with coupling to an external bath , 1984 .

[4]  T. A. Jones,et al.  Using known substructures in protein model building and crystallography. , 1986, The EMBO journal.

[5]  R. Swendsen,et al.  THE weighted histogram analysis method for free‐energy calculations on biomolecules. I. The method , 1992 .

[6]  Janet M. Thornton,et al.  Protein fold recognition , 1993, J. Comput. Aided Mol. Des..

[7]  D T Jones,et al.  Protein fold recognition by sequence threading: tools and assessment techniques , 1996, FASEB journal : official publication of the Federation of American Societies for Experimental Biology.

[8]  Thomas L. Madden,et al.  Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. , 1997, Nucleic acids research.

[9]  J. Skolnick,et al.  MONSSTER: a method for folding globular proteins with a small number of distance restraints. , 1997, Journal of molecular biology.

[10]  U. Hansmann Parallel tempering algorithm for conformational studies of biological molecules , 1997, physics/9710041.

[11]  L. Mirny,et al.  Protein structure prediction by threading. Why it works and why it does not. , 1998, Journal of molecular biology.

[12]  A. Liwo,et al.  Protein structure prediction by global optimization of a potential energy function. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[13]  Adam Liwo,et al.  Hierarchical energy-based approach to protein-structure prediction: Blind-test evaluation with CASP3 targets , 2000 .

[14]  Liam J. McGuffin,et al.  The PSIPRED protein structure prediction server , 2000, Bioinform..

[15]  A. Liwo,et al.  Cumulant-based expressions for the multibody terms for the correlation between local and electrostatic interactions in the united-residue force field , 2001 .

[16]  Richard Bonneau,et al.  Rosetta in CASP4: Progress in ab initio protein structure prediction , 2001, Proteins.

[17]  David C. Jones Predicting novel protein folds by using FRAGFOLD , 2001, Proteins.

[18]  Adam Liwo,et al.  Recent improvements in prediction of protein structure by global optimization of a potential energy function , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[19]  Adam Liwo,et al.  Energy‐based reconstruction of a protein backbone from its α‐carbon trace by a Monte‐Carlo method , 2002, J. Comput. Chem..

[20]  Dominik Gront,et al.  A simple lattice model that exhibits a protein-like cooperative all-or-none folding transition. , 2003, Biopolymers.

[21]  J. Skolnick,et al.  TOUCHSTONE II: a new approach to ab initio protein structure prediction. , 2003, Biophysical journal.

[22]  A. Liwo,et al.  Addition of side chains to a known backbone with defined side-chain centroids. , 2002, Biophysical chemistry.

[23]  Michael R. Shirts,et al.  Atomistic protein folding simulations on the submillisecond time scale using worldwide distributed computing. , 2003, Biopolymers.

[24]  David Baker,et al.  Protein Structure Prediction Using Rosetta , 2004, Numerical Computer Methods, Part D.

[25]  Valentina Tozzini,et al.  Coarse-grained models for proteins. , 2005, Current opinion in structural biology.

[26]  A. Liwo,et al.  Molecular dynamics with the united-residue model of polypeptide chains. II. Langevin and Berendsen-bath dynamics and tests on model alpha-helical systems. , 2005, The journal of physical chemistry. B.

[27]  A. Liwo,et al.  Physics-based protein-structure prediction using a hierarchical protocol based on the UNRES force field: assessment in two blind tests. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[28]  Yang Zhang,et al.  The protein structure prediction problem could be solved using the current PDB library. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[29]  A. Liwo,et al.  Molecular dynamics with the united-residue model of polypeptide chains. I. Lagrange equations of motion and tests of numerical stability in the microcanonical mode. , 2005, The journal of physical chemistry. B.

[30]  Lynda B. M. Ellis The ten most wanted solutions in protein bioinformatics , 2006 .

[31]  Cezary Czaplewski,et al.  Replica Exchange and Multicanonical Algorithms with the coarse-grained UNRES force field. , 2006, Journal of chemical theory and computation.

[32]  A. Liwo,et al.  Implementation of a symplectic multiple-time-step molecular dynamics algorithm, based on the united-residue mesoscopic potential energy function. , 2006, The Journal of chemical physics.

[33]  David E. Kim,et al.  Physically realistic homology models built with ROSETTA can be more accurate than their templates. , 2006, Proceedings of the National Academy of Sciences of the United States of America.

[34]  A. Liwo,et al.  Modification and optimization of the united-residue (UNRES) potential energy function for canonical simulations. I. Temperature dependence of the effective energy function and tests of the optimization method with single training proteins. , 2007, The journal of physical chemistry. B.

[35]  John L. Klepeis,et al.  Anton, a special-purpose machine for molecular dynamics simulation , 2007, ISCA '07.

[36]  G. Voth Coarse-Graining of Condensed Phase and Biomolecular Systems , 2008 .

[37]  Adam Liwo,et al.  Simple physics-based analytical formulas for the potentials of mean force for the interaction of amino acid side chains in water. IV. Pairs of different hydrophobic side chains. , 2008, The journal of physical chemistry. B.

[38]  Yang Zhang Progress and challenges in protein structure prediction. , 2008, Current opinion in structural biology.

[39]  A. Liwo,et al.  Application of Multiplexed Replica Exchange Molecular Dynamics to the UNRES Force Field: Tests with alpha and alpha+beta Proteins. , 2009, Journal of chemical theory and computation.

[40]  Vijay S. Pande,et al.  Accelerating molecular dynamic simulation on graphics processing units , 2009, J. Comput. Chem..

[41]  Adam Liwo,et al.  Implementation of molecular dynamics and its extensions with the coarse-grained UNRES force field on massively parallel systems; towards millisecond-scale simulations of protein structure, dynamics, and thermodynamics. , 2010, Journal of chemical theory and computation.

[42]  A. Liwo,et al.  Simple physics-based analytical formulas for the potentials of mean force of the interaction of amino-acid side chains in water. VI. Oppositely charged side chains. , 2011, The journal of physical chemistry. B.

[43]  Adam Liwo,et al.  Coarse-grained force field: general folding theory. , 2011, Physical chemistry chemical physics : PCCP.

[44]  John Orban,et al.  Mutational tipping points for switching protein folds and functions. , 2012, Structure.