A correlation‐based method for the enhancement of scoring functions on funnel‐shaped energy landscapes

A correlation‐based approach is introduced for enhancing the ability of structure‐scoring methods to identify and distinguish native‐like conformations. The proposed method relies on a funnel‐shaped scoring function that decreases steadily toward the native state. It takes advantage of the idea that the structure from a given ensemble that is closest to the native basin leads to the highest correlation coefficient between a given score and distance to that structure as an approximation of the native state for the entire ensemble. The method is applied successfully to a number of different test cases that demonstrate substantial improvements in the correlation of the score with the distance from the true native state but also result in the selection of more native‐like structures compared to the original score. Proteins 2006. © 2006 Wiley‐Liss, Inc.

[1]  Michael Feig,et al.  MMTSB Tool Set: enhanced sampling and multiscale modeling methods for applications in structural biology. , 2004, Journal of molecular graphics & modelling.

[2]  I. Shimada,et al.  Three-dimensional solution structure of the B domain of staphylococcal protein A: comparisons of the solution and crystal structures. , 1992, Biochemistry.

[3]  N. Grishin,et al.  Practical lessons from protein structure prediction , 2005, Nucleic acids research.

[4]  Sandor Vajda,et al.  CAPRI: A Critical Assessment of PRedicted Interactions , 2003, Proteins.

[5]  J. Skolnick,et al.  An Efficient Monte Carlo Model of Protein Chains. Modeling the Short-Range Correlations between Side Group Centers of Mass , 1998 .

[6]  Michael Feig,et al.  Evaluating CASP4 predictions with physical energy functions , 2002, Proteins.

[7]  Richard Bonneau,et al.  An improved protein decoy set for testing energy functions for protein structure prediction , 2003, Proteins.

[8]  Natasja Brooijmans,et al.  Molecular recognition and docking algorithms. , 2003, Annual review of biophysics and biomolecular structure.

[9]  Jeffrey Skolnick,et al.  Finding the needle in a haystack: educing native folds from ambiguous ab initio protein structure predictions , 2001, J. Comput. Chem..

[10]  Miroslaw Cygler,et al.  The structural genomics experimental pipeline: Insights from global target lists , 2004, Proteins.

[11]  J. MacQueen Some methods for classification and analysis of multivariate observations , 1967 .

[12]  Lisa N Kinch,et al.  CASP5 assessment of fold recognition target predictions , 2003, Proteins.

[13]  J. Skolnick,et al.  MONSSTER: a method for folding globular proteins with a small number of distance restraints. , 1997, Journal of molecular biology.

[14]  Andrej Sali,et al.  Comparative Protein Structure Modeling in Genomics , 1999 .

[15]  Peter J. Rousseeuw,et al.  Finding Groups in Data: An Introduction to Cluster Analysis , 1990 .

[16]  R. Schulz,et al.  Protein Structure Prediction , 2020, Methods in Molecular Biology.

[17]  Harold A. Scheraga,et al.  Protein structure prediction using a combination of sequence homology and global energy minimization I. Global energy minimization of surface loops , 1990 .

[18]  J. Janin,et al.  Computer analysis of protein-protein interaction. , 1978, Journal of molecular biology.

[19]  T. Blundell,et al.  Comparative protein modelling by satisfaction of spatial restraints. , 1993, Journal of molecular biology.

[20]  Frank Alber,et al.  A structural perspective on protein-protein interactions. , 2004, Current opinion in structural biology.

[21]  Anna Tramontano,et al.  Assessment of homology‐based predictions in CASP5 , 2003, Proteins.

[22]  M Feig,et al.  Accurate reconstruction of all‐atom protein representations from side‐chain‐based low‐resolution models , 2000, Proteins.

[23]  Chris Sander,et al.  Completeness in structural genomics , 2001, Nature Structural Biology.

[24]  J. Skolnick,et al.  A distance‐dependent atomic knowledge‐based potential for improved protein structure selection , 2001, Proteins.

[25]  Themis Lazaridis,et al.  Binding Affinity and Specificity from Computational Studies , 2002 .

[26]  C. Brooks,et al.  First-principles calculation of the folding free energy of a three-helix bundle protein. , 1995, Science.

[27]  Adam Zemla,et al.  Critical assessment of methods of protein structure prediction (CASP)‐round V , 2005, Proteins.

[28]  I. Vakser,et al.  How common is the funnel‐like energy landscape in protein‐protein interactions? , 2001, Protein science : a publication of the Protein Society.

[29]  B. Rost,et al.  Critical assessment of methods of protein structure prediction (CASP)—Round 6 , 2005, Proteins.

[30]  Patrick Aloy,et al.  Predictions without templates: New folds, secondary structure, and contacts in CASP5 , 2003, Proteins.

[31]  C. Brooks,et al.  Novel generalized Born methods , 2002 .

[32]  Charles L. Brooks,et al.  New analytic approximation to the standard molecular volume definition and its application to generalized born calculations , 2003, J. Comput. Chem..

[33]  M. Gerstein,et al.  Structural Genomics: Current Progress , 2003, Science.

[34]  Song Liu,et al.  A knowledge-based energy function for protein-ligand, protein-protein, and protein-DNA complexes. , 2005, Journal of medicinal chemistry.

[35]  William H. Press,et al.  Numerical recipes in C , 2002 .

[36]  Zukang Feng,et al.  The Protein Data Bank and structural genomics , 2003, Nucleic Acids Res..

[37]  Dominik Gront,et al.  HCPM - program for hierarchical clustering of protein models , 2005, Bioinform..

[38]  M. Karplus,et al.  CHARMM: A program for macromolecular energy, minimization, and dynamics calculations , 1983 .

[39]  Richard Bonneau,et al.  Ab initio protein structure prediction: progress and prospects. , 2001, Annual review of biophysics and biomolecular structure.

[40]  Richard Bonneau,et al.  Rosetta in CASP4: Progress in ab initio protein structure prediction , 2001, Proteins.

[41]  J. Bajorath,et al.  Docking and scoring in virtual screening for drug discovery: methods and applications , 2004, Nature Reviews Drug Discovery.

[42]  Daisuke Kihara,et al.  Ab initio protein structure prediction on a genomic scale: Application to the Mycoplasma genitalium genome , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[43]  K. Dill Polymer principles and protein folding , 1999, Protein science : a publication of the Protein Society.

[44]  J. Watson,et al.  Crystal structure of Bacillus subtilis YckF: structural and functional evolution. , 2004, Journal of structural biology.

[45]  Ruth Nussinov,et al.  Principles of docking: An overview of search algorithms and a guide to scoring functions , 2002, Proteins.

[46]  Maria I. Zavodszky,et al.  Distilling the essential features of a protein surface for improving protein-ligand docking, scoring, and virtual screening , 2002, J. Comput. Aided Mol. Des..

[47]  A. Sali,et al.  Large-scale protein structure modeling of the Saccharomyces cerevisiae genome. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[48]  Alexander D. MacKerell,et al.  All-atom empirical potential for molecular modeling and dynamics studies of proteins. , 1998, The journal of physical chemistry. B.

[49]  E. Huang,et al.  Ab initio fold prediction of small helical proteins using distance geometry and knowledge-based scoring functions. , 1999, Journal of molecular biology.

[50]  J. Onuchic,et al.  Theory of Protein Folding This Review Comes from a Themed Issue on Folding and Binding Edited Basic Concepts Perfect Funnel Landscapes and Common Features of Folding Mechanisms , 2022 .

[51]  A. Kolinski Protein modeling and structure prediction with a reduced representation. , 2004, Acta biochimica Polonica.

[52]  R. Nussinov,et al.  Folding funnels and binding mechanisms. , 1999, Protein engineering.

[53]  Yang Zhang,et al.  SPICKER: A clustering approach to identify near‐native protein folds , 2004, J. Comput. Chem..

[54]  S. Wodak,et al.  Assessment of CAPRI predictions in rounds 3–5 shows progress in docking procedures , 2005, Proteins.