Improving the orientation‐dependent statistical potential using a reference state

Statistical potentials are frequently engaged in the protein structural prediction and protein folding for conformational evaluation. Theoretically, to describe the many‐body effect, pairwise interaction between two atom groups should be corrected by their relative geometric orientation. The potential functions developed by this means are called orientation‐dependent statistical potentials and have exhibited substantially improved performance. However, none of the currently available orientation‐dependent statistical potentials use any reference state, which has been proven to greatly enhance the power of distance‐dependent statistical potentials in numerous previous studies. In this work, we designed a reasonable reference state for the orientation‐dependent statistical potentials: using the average geometric relationship between atom pairs in known structures by neglecting their residue identities. The statistical potential developed using this reference state (called ORDER_AVE) prevails most available rival potentials in a series of tests on the decoy sets, although the information of side chain atoms (except the β‐carbon) is absent in its construction. Proteins 2014; 82:2383–2393. © 2014 Wiley Periodicals, Inc.

[1]  Jens Meiler,et al.  ROSETTA3: an object-oriented software suite for the simulation and design of macromolecules. , 2011, Methods in enzymology.

[2]  Evandro Ferrada,et al.  A Knowledge-Based Potential with an Accurate Description of Local Interactions Improves Discrimination between Native and Near-Native Protein Conformations , 2007, Cell Biochemistry and Biophysics.

[3]  Ram Samudrala,et al.  A Combined Approach for Ab Initio Construction of Low Resolution Protein Tertiary Structures from Sequence , 1999, Pacific Symposium on Biocomputing.

[4]  C. Brooks,et al.  Exploring Flory's isolated-pair hypothesis: Statistical mechanics of helix–coil transitions in polyalanine and the C-peptide from RNase A , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[5]  J. Skolnick,et al.  Ab initio modeling of small proteins by iterative TASSER simulations , 2007, BMC Biology.

[6]  A. Sali,et al.  Statistical potential for assessment and prediction of protein structures , 2006, Protein science : a publication of the Protein Society.

[7]  George D Rose,et al.  Reducing the dimensionality of the protein‐folding search problem , 2012, Protein science : a publication of the Protein Society.

[8]  James E. Fitzgerald,et al.  Mimicking the folding pathway to improve homology-free protein structure prediction , 2009, Proceedings of the National Academy of Sciences.

[9]  Changiz Eslahchi,et al.  A pairwise residue contact area-based mean force potential for discrimination of native protein structure , 2010, BMC Bioinformatics.

[10]  Yang Zhang,et al.  A Novel Side-Chain Orientation Dependent Potential Derived from Random-Walk Reference State for Protein Fold Selection and Structure Prediction , 2010, PloS one.

[11]  Alfonso Valencia,et al.  Assessment of intramolecular contact predictions for CASP7 , 2007, Proteins.

[12]  Hongyi Zhou,et al.  An accurate, residue‐level, pair potential of mean force for folding and binding based on the distance‐scaled, ideal‐gas reference state , 2004, Protein science : a publication of the Protein Society.

[13]  Yang Zhang,et al.  Ab initio protein structure assembly using continuous structure fragments and optimized knowledge‐based force field , 2012, Proteins.

[14]  J. Skolnick,et al.  A distance‐dependent atomic knowledge‐based potential for improved protein structure selection , 2001, Proteins.

[15]  Manfred J. Sippl,et al.  Boltzmann's principle, knowledge-based mean fields and protein folding. An approach to the computational determination of protein structures , 1993, J. Comput. Aided Mol. Des..

[16]  Changiz Eslahchi,et al.  A distance‐dependent atomic knowledge‐based potential and force for discrimination of native structures from decoys , 2009, Proteins.

[17]  Andrzej Kloczkowski,et al.  Potentials 'R'Us web-server for protein energy estimations with coarse-grained knowledge-based potentials , 2010, BMC Bioinformatics.

[18]  C Kooperberg,et al.  Assembly of protein tertiary structures from fragments with similar local sequences using simulated annealing and Bayesian scoring functions. , 1997, Journal of molecular biology.

[19]  J. Danielsson,et al.  15N relaxation study of the amyloid β‐peptide: structural propensities and persistence length , 2006, Magnetic resonance in chemistry : MRC.

[20]  Yang Zhang,et al.  Scoring function for automated assessment of protein structure template quality , 2004, Proteins.

[21]  Andrej Sali,et al.  Minimalist representations and the importance of nearest neighbor effects in protein folding simulations. , 2006, Journal of molecular biology.

[22]  Richard Bonneau,et al.  An improved protein decoy set for testing energy functions for protein structure prediction , 2003, Proteins.

[23]  Burkhard Rost,et al.  PROFcon: novel prediction of long-range contacts , 2005, Bioinform..

[24]  A. Sali,et al.  Statistical potentials for fold assessment , 2009 .

[25]  R. Samudrala,et al.  An all-atom distance-dependent conditional probability discriminatory function for protein structure prediction. , 1998, Journal of molecular biology.

[26]  A. Tramontano,et al.  Critical assessment of methods of protein structure prediction (CASP)—round IX , 2011, Proteins.

[27]  J. Skolnick,et al.  GOAP: a generalized orientation-dependent, all-atom statistical potential for protein structure prediction. , 2011, Biophysical journal.

[28]  Haipeng Gong,et al.  Using the unfolded state as the reference state improves the performance of statistical potentials. , 2012, Biophysical journal.

[29]  S. Ozkan,et al.  Local and non-local native topologies reveal the underlying folding landscape of proteins , 2011, Physical biology.

[30]  T. N. Bhat,et al.  The Protein Data Bank , 2000, Nucleic Acids Res..

[31]  M. Levitt,et al.  Energy functions that discriminate X-ray and near native folds from well-constructed decoys. , 1996, Journal of molecular biology.

[32]  Andrzej Kloczkowski,et al.  A global machine learning based scoring function for protein structure prediction , 2014, Proteins.

[33]  A. Sali,et al.  Comparative protein structure modeling by iterative alignment, model building and model assessment. , 2003, Nucleic Acids Research.

[34]  Yaoqi Zhou,et al.  Specific interactions for ab initio folding of protein terminal regions with secondary structures , 2008, Proteins.

[35]  Samuel L. DeLuca,et al.  Small-molecule ligand docking into comparative models with Rosetta , 2013, Nature Protocols.

[36]  Jing He,et al.  A distance- and orientation-dependent energy function of amino acid key blocks. , 2014, Biopolymers.

[37]  Yaoqi Zhou,et al.  Ab initio folding of terminal segments with secondary structures reveals the fine difference between two closely related all‐atom statistical energy functions , 2008, Protein science : a publication of the Protein Society.

[38]  P. Bradley,et al.  High-resolution structure prediction and the crystallographic phase problem , 2007, Nature.

[39]  Andrzej Kloczkowski,et al.  A global machine learning based scoring function for protein structure prediction , 2014 .

[40]  Andrzej Kloczkowski,et al.  Four‐body contact potentials derived from two protein datasets to discriminate native structures from decoys , 2007, Proteins.

[41]  E. Myers,et al.  Basic local alignment search tool. , 1990, Journal of molecular biology.

[42]  G. Rose,et al.  Sterics and solvation winnow accessible conformational space for unfolded proteins. , 2005, Journal of molecular biology.

[43]  R. Jernigan,et al.  Estimation of effective interresidue contact energies from protein crystal structures: quasi-chemical approximation , 1985 .

[44]  James E Fitzgerald,et al.  Reduced Cβ statistical potentials can outperform all‐atom potentials in decoy identification , 2007, Protein science : a publication of the Protein Society.

[45]  Robert L Jernigan,et al.  How effective for fold recognition is a potential of mean force that includes relative orientations between contacting residues in proteins? , 2005, The Journal of chemical physics.

[46]  András Fiser,et al.  New statistical potential for quality assessment of protein models and a survey of energy functions , 2010, BMC Bioinformatics.

[47]  B. McConkey,et al.  Discrimination of native protein structures using atom–atom contact scoring , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[48]  Sumudu P. Leelananda,et al.  Multibody coarse‐grained potentials for native structure recognition and quality assessment of protein models , 2011, Proteins.

[49]  Deok-Soo Kim,et al.  A beta‐complex statistical four body contact potential combined with a hydrogen bond statistical potential recognizes the correct native structure from protein decoy sets , 2013, Proteins.

[50]  Hongyi Zhou,et al.  Distance‐scaled, finite ideal‐gas reference state improves structure‐derived potentials of mean force for structure selection and stability prediction , 2002, Protein science : a publication of the Protein Society.

[51]  R Samudrala,et al.  Ab initio construction of protein tertiary structures using a hierarchical approach. , 2000, Journal of molecular biology.

[52]  H. Scheraga,et al.  Medium- and long-range interaction parameters between amino acids for predicting three-dimensional structures of proteins. , 1976, Macromolecules.

[53]  Sarel J Fleishman,et al.  Emerging themes in the computational design of novel enzymes and protein–protein interfaces , 2013, FEBS letters.

[54]  M. Sippl Calculation of conformational ensembles from potentials of mean force. An approach to the knowledge-based prediction of local structures in globular proteins. , 1990, Journal of molecular biology.

[55]  Bin Xue,et al.  Predicting residue–residue contact maps by a two‐layer, integrated neural‐network method , 2009, Proteins.

[56]  M. Levitt,et al.  A novel approach to decoy set generation: designing a physical energy function having local minima with native structure characteristics. , 2003, Journal of molecular biology.

[57]  R. Srinivasan,et al.  The Flory isolated-pair hypothesis is not valid for polypeptide chains: implications for protein folding. , 2000, Proceedings of the National Academy of Sciences of the United States of America.