Assessing protein structures with a non-local atomic interaction energy.

We describe a new approach, based on the energy of non-local interactions, to assess protein structures. The method uses a very sensitive and accurate atomic mean force potential (AMFP) to calculate the non-local energy profile (NL-profile) of a proteins structure. Several protein models, built using the comparative modeling technique and containing several errors, were evaluated. These models exhibit a good stereochemistry and have been previously checked with different, widely used, methods that failed to detect the errors. The AMFP-derived energy profiles are able to correlate high scores with point errors and misalignments in the models. The point errors are frequently found in loops or regions of structural differences between the template and the target protein. The misalignments are clearly detected with very high scores. The performance of the method was also tested for the assessment of X-ray solved protein structures. In a data set of 143 well solved and non-redundant protein structures, we find that the average energy Z-scores, obtained from AMFP, increase as the resolution decreases. In the case of structures that have already been described as having an unusual stereochemistry, very high Z-scores are obtained. Moreover, energy calculations for some pairs of obsolete and replacement proteins always show higher Z-scores for the obsolete proteins. Finally, two particular cases show the usefulness of the profiles in the assessment of X-ray solved protein structures. First, the NL-profile of a protein structure refined in the incorrect space group has very high scores in several regions. One region has already been described to be out-of-register with the density map of the structure. The NL-profile of the re-refined structure with the correct space group is vastly improved. In the second case, the method is able to accurately point out disordered residues, even if the atoms of these residues do not violate the sum of the van der Waals radii. ANOLEA, the program used to calculate the NL-profile of a protein structure containing one or more chains is accessible through the World Wide Web at: http://www.fundp.ac.be/pub/ANOLEA.html.

[1]  G J Williams,et al.  The Protein Data Bank: a computer-based archival file for macromolecular structures. , 1977, Journal of molecular biology.

[2]  R J Read,et al.  Critical evaluation of comparative model building of Streptomyces griseus trypsin. , 1984, Biochemistry.

[3]  T. Jones,et al.  Between objectivity and subjectivity , 1990, Nature.

[4]  G. Casari,et al.  Identification of native protein folds amongst a large number of incorrect models. The calculation of low energy conformations from potentials of mean force. , 1990, Journal of molecular biology.

[5]  M. Sippl Calculation of conformational ensembles from potentials of mean force. An approach to the knowledge-based prediction of local structures in globular proteins. , 1990, Journal of molecular biology.

[6]  M. Sippl,et al.  Detection of native‐like models for amino acid sequences of unknown three‐dimensional structure in a data base of known protein conformations , 1992, Proteins.

[7]  D. Eisenberg,et al.  Assessment of protein models with three-dimensional profiles , 1992, Nature.

[8]  C. Sander,et al.  Evaluation of protein models by atomic solvation preference. , 1992, Journal of molecular biology.

[9]  U. Hobohm,et al.  Selection of representative protein data sets , 1992, Protein science : a publication of the Protein Society.

[10]  T. Blundell,et al.  Comparative protein modelling by satisfaction of spatial restraints. , 1993, Journal of molecular biology.

[11]  J. Thornton,et al.  PROCHECK: a program to check the stereochemical quality of protein structures , 1993 .

[12]  M. Sippl Recognition of errors in three‐dimensional structures of proteins , 1993, Proteins.

[13]  Manfred J. Sippl,et al.  Boltzmann's principle, knowledge-based mean fields and protein folding. An approach to the computational determination of protein structures , 1993, J. Comput. Aided Mol. Des..

[14]  A. Korn,et al.  Torsion angle differences as a means of pinpointing local polypeptide chain trajectory changes for identical proteins in different conformational states. , 1994, Protein engineering.

[15]  U. Hobohm,et al.  Enlarged representative set of protein structures , 1994, Protein science : a publication of the Protein Society.

[16]  M. James,et al.  A critical assessment of comparative molecular modeling of tertiary structures of proteins * , 1995, Proteins.

[17]  M J Sippl,et al.  Knowledge-based potentials for proteins. , 1995, Current opinion in structural biology.

[18]  R. E. Marsh Some Thoughts on Choosing the Correct Space Group , 1995 .

[19]  G J Kleywegt,et al.  Where freedom is given, liberties are taken. , 1995, Structure.

[20]  C Sander,et al.  The use of position‐specific rotamers in model building by homology , 1995, Proteins.

[21]  K Fidelis,et al.  A large‐scale experiment to assess protein structure prediction methods , 1995, Proteins.

[22]  M. Karplus,et al.  Evaluation of comparative protein modeling by MODELLER , 1995, Proteins.

[23]  R Abagyan,et al.  Homology modeling by the ICM method , 1995, Proteins.

[24]  S. Wodak,et al.  Protein structure prediction by threading methods: Evaluation of current techniques , 1995, Proteins.

[25]  Ram Samudrala,et al.  Confronting the problem of interconnected structural changes in the comparative modeling of proteins , 1995, Proteins.

[26]  S. Wodak,et al.  Deviations from standard atomic volumes as a quality measure for protein crystal structures. , 1996, Journal of molecular biology.

[27]  G J Kleywegt,et al.  A re-evaluation of the crystal structure of chloromuconate cycloisomerase. , 1996, Acta crystallographica. Section D, Biological crystallography.

[28]  M. Totrov,et al.  Contact area difference (CAD): a robust measure to evaluate accuracy of protein models. , 1997, Journal of molecular biology.

[29]  E S Huang,et al.  Factors affecting the ability of energy functions to discriminate correct from incorrect folds. , 1997, Journal of molecular biology.

[30]  S. Bryant,et al.  Critical assessment of methods of protein structure prediction (CASP): Round II , 1997, Proteins.

[31]  Roland L. Dunbrack,et al.  Meeting review: the Second meeting on the Critical Assessment of Techniques for Protein Structure Prediction (CASP2), Asilomar, California, December 13-16, 1996. , 1997, Folding & design.

[32]  F. Melo,et al.  Novel knowledge-based mean force potential at atomic level. , 1997, Journal of molecular biology.