Local energy landscape flattening: Parallel hyperbolic Monte Carlo sampling of protein folding

Among the major difficulties in protein structure prediction is the roughness of the energy landscape that must be searched for the global energy minimum. To address this issue, we have developed a novel Monte Carlo algorithm called parallel hyperbolic sampling (PHS) that logarithmically flattens local high‐energy barriers and, therefore, allows the simulation to tunnel more efficiently through energetically inaccessible regions to low‐energy valleys. Here, we show the utility of this approach by applying it to the SICHO (SIde‐CHain‐Only) protein model. For the same CPU time, the parallel hyperbolic sampling method can identify much lower energy states and explore a larger region phase space than the commonly used replica sampling (RS) Monte Carlo method. By clustering the simulated structures obtained in the PHS implementation of the SICHO model, we can successfully predict, among a representative benchmark 65 proteins set, 50 cases in which one of the top 5 clusters have a root‐mean‐square deviation (RMSD) from the native structure below 6.5 Å. Compared with our previous calculations that used RS as the conformational search procedure, the number of successful predictions increased by four and the CPU cost is reduced. By comparing the structure clusters produced by both PHS and RS, we find a strong correlation between the quality of predicted structures and the minimum relative RMSD (mrRMSD) of structures clusters identified by using different search engines. This mrRMSD correlation may be useful in blind prediction as an indicator of the likelihood of successful folds. Proteins 2002;48:192–201. © 2002 Wiley‐Liss, Inc.

[1]  C. Anfinsen Principles that govern the folding of protein chains. , 1973, Science.

[2]  Wang,et al.  Replica Monte Carlo simulation of spin glasses. , 1986, Physical review letters.

[3]  H. Scheraga,et al.  On the multiple-minima problem in the conformational analysis of molecules: deformation of the potential energy hypersurface by the diffusion equation method , 1989 .

[4]  J. Straub,et al.  Approximate solution of the classical Liouville equation using Gaussian phase packet dynamics: Application to enhanced equilibrium averaging and global optimization , 1993 .

[5]  H A Scheraga,et al.  Theoretical prediction of a crystal structure. , 1996, Proceedings of the National Academy of Sciences of the United States of America.

[6]  David C. Jones,et al.  CATH--a hierarchic classification of protein domain structures. , 1997, Structure.

[7]  U. Hansmann Parallel tempering algorithm for conformational studies of biological molecules , 1997, physics/9710041.

[8]  J. Skolnick,et al.  Assembly of protein structure from sparse experimental data: An efficient Monte Carlo model , 1998, Proteins.

[9]  I. Andricioaei,et al.  Global optimization using bad derivatives: Derivative‐free method for molecular energy minimization , 1998 .

[10]  Adam Liwo,et al.  An Efficient Deformation-Based Global Optimization Method for Off-Lattice Polymer Chains: Self-Consistent Basin-to-Deformed-Basin Mapping (SCBDBM). Application to United-Residue Polypeptide Chains , 1999 .

[11]  H. Scheraga,et al.  Global optimization of clusters, crystals, and biomolecules. , 1999, Science.

[12]  Gerard T. Barkema,et al.  Monte Carlo Methods in Statistical Physics , 1999 .

[13]  Yuko Okamoto,et al.  The Generalized-Ensemble Approach for Protein Folding Simulations , 1999 .

[14]  P Rotkiewicz,et al.  A method for the improvement of threading‐based protein models , 1999, Proteins.

[15]  David C. Jones,et al.  GenTHREADER: an efficient and reliable protein fold recognition method for genomic sequences. , 1999, Journal of molecular biology.

[16]  T. N. Bhat,et al.  The Protein Data Bank , 2000, Nucleic Acids Res..

[17]  J. Skolnick,et al.  Comparison of three Monte Carlo conformational search strategies for a proteinlike homopolymer model: Folding thermodynamics and identification of low-energy structures , 2000 .

[18]  J Skolnick,et al.  Defrosting the frozen approximation: PROSPECTOR— A new approach to threading , 2001, Proteins.

[19]  J. Skolnick,et al.  Parallel-hat tempering: A Monte Carlo search scheme for the identification of low-energy structures , 2001 .

[20]  Jeffrey Skolnick,et al.  Finding the needle in a haystack: educing native folds from ambiguous ab initio protein structure predictions , 2001, J. Comput. Chem..

[21]  P Rotkiewicz,et al.  Generalized comparative modeling (GENECOMP): A combination of sequence comparison, threading, and lattice modeling for protein structure prediction and refinement , 2001, Proteins.

[22]  J. Skolnick,et al.  TOUCHSTONE: An ab initio protein structure prediction method that uses threading-based tertiary restraints , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[23]  Daisuke Kihara,et al.  Ab initio protein structure prediction on a genomic scale: Application to the Mycoplasma genitalium genome , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[24]  Surface anchoring on layers of grafted liquid-crystalline chain molecules: A computer simulation , 2002, cond-mat/0205324.