Efficient Energy Computation for Monte Carlo Simulation of Proteins

Monte Carlo simulation (MCS) is a common methodology to compute pathways and thermodynamic properties of proteins. A simulation run is a series of random steps in conformation space, each perturbing some degrees of freedom of the molecule. A step is accepted with a probability that depends on the change in value of an energy function. Typical energy functions sum many terms. The most costly ones to compute are contributed by atom pairs closer than some cutoff distance. This paper introduces a new method that speeds up MCS by efficiently computing the energy at each step. The method exploits the facts that proteins are long kinematic chains and that few degrees of freedom are changed at each step. A novel data structure, called the ChainTree, captures both the kinematics and the shape of a protein at successive levels of detail. It is used to find all atom pairs contributing to the energy. It also makes it possible to identify partial energy sums left unchanged by a perturbation, thus allowing the energy value to be incrementally updated. Computational tests on four proteins of sizes ranging from 68 to 755 amino acids show that MCS with the ChainTree method is significantly faster (as much as 12 times faster for the largest protein) than with the widely used grid method. They also indicate that speed-up increases with larger proteins.

[1]  K. Dill,et al.  A simple protein folding algorithm using a binary code and secondary structure constraints. , 1995, Protein engineering.

[2]  H. Scheraga,et al.  Monte Carlo-minimization approach to the multiple-minima problem in protein folding. , 1987, Proceedings of the National Academy of Sciences of the United States of America.

[3]  M. Karplus,et al.  Discrimination of the native from misfolded protein models with an energy function including implicit solvation. , 1999, Journal of molecular biology.

[4]  U H Hansmann,et al.  New Monte Carlo algorithms for protein folding. , 1999, Current opinion in structural biology.

[5]  Bruce J. Berne,et al.  On a novel Monte Carlo scheme for simulating water and aqueous solutions , 1978 .

[6]  M. Karplus,et al.  Effective energy function for proteins in solution , 1999, Proteins.

[7]  M. Karplus,et al.  CHARMM: A program for macromolecular energy, minimization, and dynamics calculations , 1983 .

[8]  D. Eisenberg Proteins. Structures and molecular properties, T.E. Creighton. W. H. Freeman and Company, New York (1984), 515, $36.95 , 1985 .

[9]  J. Valverde Molecular Modelling: Principles and Applications , 2001 .

[10]  J. Skolnick,et al.  Local energy landscape flattening: Parallel hyperbolic Monte Carlo sampling of protein folding , 2002, Proteins.

[11]  Jean-Claude Latombe,et al.  Algorithmic tools for real-time microsurgery simulation , 2002, Medical Image Anal..

[12]  N. Go,et al.  Noninteracting local‐structure model of folding and unfolding transition in globular proteins. I. Formulation , 1981, Biopolymers.

[13]  S. Sun,et al.  Reduced representation model of protein structure prediction: Statistical potential and genetic algorithms , 1993, Protein science : a publication of the Protein Society.

[14]  Sean Quinlan,et al.  Efficient distance computation between non-convex objects , 1994, Proceedings of the 1994 IEEE International Conference on Robotics and Automation.

[15]  Philip M. Hubbard,et al.  Approximating polyhedra with spheres for time-critical collision detection , 1996, TOGS.

[16]  N. Metropolis,et al.  Equation of State Calculations by Fast Computing Machines , 1953, Resonance.

[17]  Takeshi Kikuchi,et al.  Inter‐Cα atomic potentials derived from the statistics of average interresidue distances in proteins: Application to bovine pancreatic trypsin inhibitor , 1996 .

[18]  J. Mccammon,et al.  Simulation methods for protein structure fluctuations , 1980, Biopolymers.

[19]  Gino van den Bergen Efficient Collision Detection of Complex Deformable Models using AABB Trees , 1997, J. Graphics, GPU, & Game Tools.

[20]  R Unger,et al.  Genetic algorithms for protein folding simulations. , 1992, Journal of molecular biology.

[21]  P Argos,et al.  Optimal protocol and trajectory visualization for conformational searches of peptides and proteins. , 1992, Journal of molecular biology.

[22]  Jean-Claude Latombe,et al.  Efficient maintenance and self-collision testing for Kinematic Chains , 2002, SCG '02.

[23]  J Moult,et al.  Protein folding simulations with genetic algorithms and a detailed molecular description. , 1997, Journal of molecular biology.

[24]  Dinesh Manocha,et al.  Fast distance queries with rectangular swept sphere volumes , 2000, Proceedings 2000 ICRA. Millennium Conference. IEEE International Conference on Robotics and Automation. Symposia Proceedings (Cat. No.00CH37065).

[25]  U. Hansmann Parallel tempering algorithm for conformational studies of biological molecules , 1997, physics/9710041.

[26]  H. Stanley,et al.  Statistical physics of macromolecules , 1995 .

[27]  Lee,et al.  New Monte Carlo algorithm: Entropic sampling. , 1993, Physical review letters.

[28]  A. Elofsson,et al.  Local moves: An efficient algorithm for simulation of protein folding , 1995, Proteins.

[29]  R. Abagyan,et al.  Biased probability Monte Carlo conformational searches and electrostatic calculations for peptides and proteins. , 1994, Journal of molecular biology.

[30]  Ruben Abagyan,et al.  Ab InitioFolding of Peptides by the Optimal-Bias Monte Carlo Minimization Procedure , 1999 .

[31]  Mark H. Overmars,et al.  Spheres, molecules, and hidden surface removal , 1994, SCG '94.

[32]  Leonidas J. Guibas,et al.  Collision detection for deforming necklaces , 2002, SCG '02.

[33]  Joseph S. B. Mitchell,et al.  Efficient Collision Detection Using Bounding Volume Hierarchies of k-DOPs , 1998, IEEE Trans. Vis. Comput. Graph..

[34]  E. Shakhnovich,et al.  The folding thermodynamics and kinetics of crambin using an all-atom Monte Carlo simulation. , 2000, Journal of molecular biology.

[35]  C Kooperberg,et al.  Assembly of protein tertiary structures from fragments with similar local sequences using simulated annealing and Bayesian scoring functions. , 1997, Journal of molecular biology.

[36]  J. Skolnick,et al.  Parallel-hat tempering: A Monte Carlo search scheme for the identification of low-energy structures , 2001 .

[37]  E. Shakhnovich,et al.  The ensemble folding kinetics of protein G from an all-atom Monte Carlo simulation , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[38]  K. Binder,et al.  Monte Carlo Simulation in Statistical Physics , 1992, Graduate Texts in Physics.

[39]  Akinori Kidera Smart Monte Carlo simulation of a globular protein , 1999 .

[40]  Eugene I. Shakhnovich,et al.  A structure-based method for derivation of all-atom potentials for protein folding , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[41]  A NEAR-NEIGHBOUR ALGORITHM FOR METROPOLIS MONTE CARLO SIMULATIONS. , 1988 .

[42]  Dinesh Manocha,et al.  OBBTree: a hierarchical structure for rapid interference detection , 1996, SIGGRAPH.