Prediction of protein loop geometries in solution

The ability to determine the structure of a protein in solution is a critical tool for structural biology, as proteins in their native state are found in aqueous environments. Using a physical chemistry based prediction protocol, we demonstrate the ability to reproduce protein loop geometries in experimentally derived solution structures. Predictions were run on loops drawn from (1)NMR entries in the Protein Databank (PDB), and from (2) the RECOORD database in which NMR entries from the PDB have been standardized and re‐refined in explicit solvent. The predicted structures are validated by comparison with experimental distance restraints, a test of structural quality as defined by the WHAT IF structure validation program, root mean square deviation (RMSD) of the predicted loops to the original structural models, and comparison of precision of the original and predicted ensembles. Results show that for the RECOORD ensembles, the predicted loops are consistent with an average of 95%, 91%, and 87% of experimental restraints for the short, medium and long loops respectively. Prediction accuracy is strongly affected by the quality of the original models, with increases in the percentage of experimental restraints violated of 2% for the short loops, and 9% for both the medium and long loops in the PDB derived ensembles. We anticipate the application of our protocol to theoretical modeling of protein structures, such as fold recognition methods; as well as to experimental determination of protein structures, or segments, for which only sparse NMR restraint data is available. Proteins 2007. © 2007 Wiley‐Liss, Inc.

[1]  Alexandre M J J Bonvin,et al.  DRESS: a database of REfined solution NMR structures , 2004, Proteins.

[2]  Peter Güntert,et al.  Automated NMR protein structure calculation , 2003 .

[3]  J. Rullmann,et al.  Quality assessment of NMR structures: a statistical survey. , 1998, Journal of molecular biology.

[4]  J. Kelly,et al.  NMR solution structure of the isolated Apo Pin1 WW domain: comparison to the x-ray crystal structures of Pin1. , 2002, Biopolymers.

[5]  R J Read,et al.  Crystallography & NMR system: A new software suite for macromolecular structure determination. , 1998, Acta crystallographica. Section D, Biological crystallography.

[6]  J H Prestegard,et al.  Rapid determination of protein folds using residual dipolar couplings. , 2000, Journal of molecular biology.

[7]  C. W. Hilbers,et al.  Improving the quality of protein structures derived by NMR spectroscopy** , 2002, Journal of biomolecular NMR.

[8]  D. Baker,et al.  Rapid protein fold determination using unassigned NMR data , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[9]  Song Liu,et al.  Accurate and efficient loop selections by the DFIRE‐based all‐atom statistical potential , 2004, Protein science : a publication of the Protein Society.

[10]  James M Aramini,et al.  Comparisons of NMR spectral quality and success in crystallization demonstrate that NMR and X-ray crystallography are complementary methods for small protein structure determination. , 2005, Journal of the American Chemical Society.

[11]  Gaetano T Montelione,et al.  Assessing precision and accuracy of protein structures derived from NMR data , 2005, Proteins.

[12]  Baldomero Oliva,et al.  Prediction of the conformation and geometry of loops in globular proteins: Testing ArchDB, a structural classification of loops , 2005, Proteins.

[13]  M. Moorhouse,et al.  The Protein Databank , 2005 .

[14]  B. Honig,et al.  A hierarchical approach to all‐atom protein loop prediction , 2004, Proteins.

[15]  Gert Vriend,et al.  Traditional Biomolecular Structure Determination by NMR Spectroscopy Allows for Major Errors , 2005, PLoS Comput. Biol..

[16]  Jianpeng Ma,et al.  Determining protein topology from skeletons of secondary structures. , 2005, Journal of molecular biology.

[17]  J. Skolnick,et al.  Ab initio folding of proteins using restraints derived from evolutionary information , 1999, Proteins.

[18]  T. N. Bhat,et al.  The Protein Data Bank , 2000, Nucleic Acids Res..

[19]  M. Y. Lobanov,et al.  Comparison of X‐ray and NMR structures: Is there a systematic difference in residue contacts between X‐ray‐ and NMR‐resolved protein structures? , 2005, Proteins.

[20]  Bin Xia,et al.  Comparison of protein solution structures refined by molecular dynamics simulation in vacuum, with a generalized Born model, and with explicit water , 2002, Journal of biomolecular NMR.

[21]  Miron Livny,et al.  RECOORD: A recalculated coordinate database of 500+ proteins from the PDB using restraints from the BioMagResBank , 2005, Proteins.

[22]  C. Rapp,et al.  Crystal packing effects on protein loops , 2005, Proteins.

[23]  Dietmar Schomburg,et al.  Efficient methods for filtering and ranking fragments for the prediction of structurally variable regions in proteins , 2004, Proteins.

[24]  Gert Vriend,et al.  The precision of NMR structure ensembles revisited , 2003, Journal of biomolecular NMR.

[25]  C. Sander,et al.  Errors in protein structures , 1996, Nature.

[26]  W. L. Jorgensen,et al.  Development and Testing of the OPLS All-Atom Force Field on Conformational Energetics and Properties of Organic Liquids , 1996 .

[27]  R. Friesner,et al.  Generalized Born Model Based on a Surface Integral Formulation , 1998 .

[28]  C. Brooks,et al.  Refinement of NMR structures using implicit solvent and advanced sampling techniques. , 2004, Journal of the American Chemical Society.

[29]  K. Misura,et al.  PROTEINS: Structure, Function, and Bioinformatics 59:15–29 (2005) Progress and Challenges in High-Resolution Refinement of Protein Structure Models , 2022 .

[30]  R. Friesner,et al.  Evaluation and Reparametrization of the OPLS-AA Force Field for Proteins via Comparison with Accurate Quantum Chemical Calculations on Peptides† , 2001 .

[31]  Hagai Meirovitch,et al.  Solvation parameters for predicting the structure of surface loops in proteins: Transferability and entropic effects , 2003, Proteins.

[32]  K. Wüthrich,et al.  Torsion angle dynamics for NMR structure calculation with the new program DYANA. , 1997, Journal of molecular biology.

[33]  M. Nilges,et al.  Refinement of protein structures in explicit solvent , 2003, Proteins.