Using motion planning to study protein folding pathways

We present a framework for studying protein folding pathways and potential landscapes which is based on techniques recently developed in the robotics motion planning community. In particular, our work uses Probabilistic Roadmap (PRM) motion planning techniques which have proven to be very successful for problems involving high-dimensional configuration spaces. Our results applying PRM techniques to several small proteins (60 residues) are very encouraging. The framework enables one to easily and efficiently compute folding pathways from any denatured starting state to the native fold. This aspect makes our approach ideal for studying global properties of the protein's potential landscape. For example, our results show that folding pathways from different starting denatured states sometimes share some common `gullies', mainly when they are close to the native fold. Such global issues are difficult to simulate and study with other methods. Our focus in this work is to study the protein folding mechanism assuming we know the native fold. Therefore, instead of performing fold prediction, we aim to study issues related to the folding process, such as the formation of secondary and tertiary structure, and the dependence on the initial conformation. Our results indicate that for some proteins, secondary structure clearly forms first while for others the tertiary structure is obtained more directly, and moreover, these situations seem to be differentiated in the distributions of the conformations sampled by our technique. We also find that the formation order is independent of the starting denatured conformation. We validate our results by comparing the secondary structure formation order on our paths to known pulse-labeling experimental results. This indicates the promise of our approach for studying proteins for which experimental results are not available.

[1]  Jason H. Cantarella,et al.  NONTRIVIAL EMBEDDINGS OF POLYGONAL INTERVALS AND UNKNOTS IN 3-SPACE , 1998 .

[2]  Lydia E. Kavraki,et al.  Probabilistic roadmaps for path planning in high-dimensional configuration spaces , 1996, IEEE Trans. Robotics Autom..

[3]  Jürgen Sühnel,et al.  Image library of biological macromolecules , 1996, Comput. Appl. Biosci..

[4]  A. Gronenborn,et al.  Fast folding of a prototypic polypeptide: The immunoglobulin binding domain of streptococcal protein G , 1994, Protein science : a publication of the Protein Society.

[5]  E. Demaine,et al.  Folding and unfolding , 2002 .

[6]  Jan Reichert,et al.  The IMB Jena Image Library of Biological Macromolecules - New Features , 2001, German Conference on Bioinformatics.

[7]  Nancy M. Amato,et al.  Using Motion Planning to Study Protein Folding Pathways , 2002, J. Comput. Biol..

[8]  Nancy M. Amato,et al.  A Kinematics-Based Probabilistic Roadmap Method for Closed Chain Systems , 2001 .

[9]  C. Matthews,et al.  Pathways of protein folding. , 1993, Annual review of biochemistry.

[10]  J. Jung,et al.  Protein structure prediction. , 2001, Current opinion in chemical biology.

[11]  E. Alm,et al.  Prediction of protein-folding mechanisms from free-energy landscapes derived from native structures. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[12]  Nancy M. Amato,et al.  How Does It Fold? Searching for Folding Pathways using A Motion Planning Approach , 2000 .

[13]  Lydia E. Kavraki,et al.  Random networks in configuration space for fast path planning , 1994 .

[14]  Nancy M. Amato,et al.  MAPRM: a probabilistic roadmap planner with sampling on the medial axis of the free space , 1999, Proceedings 1999 IEEE International Conference on Robotics and Automation (Cat. No.99CH36288C).

[15]  D. Covell Folding protein α‐carbon chains into compact forms by monte carlo methods , 1992 .

[16]  M. Levitt,et al.  Realistic simulations of native-protein dynamics in solution and beyond. , 1993, Annual review of biophysics and biomolecular structure.

[17]  Liang Lu,et al.  Folding cartons with fixtures: a motion planning approach , 1999, IEEE Trans. Robotics Autom..

[18]  V. Muñoz,et al.  Submillisecond kinetics of protein folding. , 1997, Current opinion in structural biology.

[19]  B. Honig Protein folding: from the levinthal paradox to structure prediction. , 1999, Journal of molecular biology.

[20]  Nancy M. Amato,et al.  Customizing PRM roadmaps at query time , 2001, Proceedings 2001 ICRA. IEEE International Conference on Robotics and Automation (Cat. No.01CH37164).

[21]  Lydia E. Kavraki,et al.  A probabilistic roadmap approach for systems with closed kinematic chains , 1999, Proceedings 1999 IEEE International Conference on Robotics and Automation (Cat. No.99CH36288C).

[22]  Joseph O'Rourke,et al.  Folding and Unfolding in Computational Geometry , 1998, JCDCG.

[23]  M. Levitt,et al.  Protein folding: the endgame. , 1997, Annual review of biochemistry.

[24]  C. Anfinsen Principles that govern the folding of protein chains. , 1973, Science.

[25]  K. Dill,et al.  Protein core assembly processes , 1993 .

[26]  Jean-Claude Latombe,et al.  A Motion Planning Approach to Flexible Ligand Binding , 1999, ISMB.

[27]  Mark H. Overmars,et al.  The Gaussian sampling strategy for probabilistic roadmap planners , 1999, Proceedings 1999 IEEE International Conference on Robotics and Automation (Cat. No.99CH36288C).

[28]  R. Li,et al.  The hydrogen exchange core and protein folding , 1999, Protein science : a publication of the Protein Society.

[29]  E. Shakhnovich Theoretical studies of protein-folding thermodynamics and kinetics. , 1997, Current opinion in structural biology.

[30]  M. Levitt Protein folding by restrained energy minimization and molecular dynamics. , 1983, Journal of molecular biology.

[31]  J. Skolnick Monte Carlo Simulations of Protein Folding , 2003 .

[32]  D. Eisenberg,et al.  An evolutionary approach to folding small alpha-helical proteins that uses sequence information and an empirical guiding fitness function. , 1994, Proceedings of the National Academy of Sciences of the United States of America.

[33]  Nancy M. Amato,et al.  A Motion Planning Approach to Folding: From Paper Craft to Protein Structure Prediction , 2000 .

[34]  Nancy M. Amato,et al.  A probabilistic method for rigid-body motion planning using sampling from the medial axis of the free space , 1999 .

[35]  P. Kollman,et al.  Pathways to a protein folding intermediate observed in a 1-microsecond simulation in aqueous solution. , 1998, Science.

[36]  Matthews Cr PATHWAYS OF PROTEIN FOLDING , 1993 .

[37]  M. Levitt,et al.  Computer simulation of protein folding , 1975, Nature.

[38]  Nancy M. Amato,et al.  Enhancing Randomized Motion Planners: Exploring with Haptic Hints , 2001, Auton. Robots.

[39]  D. Baker,et al.  A surprising simplicity to protein folding , 2000, Nature.

[40]  Jr. George N. Reeke Protein folding: computational approaches to an exponential-time problem , 1988 .

[41]  Lydia E. Kavraki,et al.  On finding narrow passages with probabilistic roadmap planners , 1998 .

[42]  Nancy M. Amato,et al.  Ligand Binding with OBPRM and Haptic User Input: Enhancing Automatic Motion Planning with Virtual Touch , 2000 .

[43]  B. Faverjon,et al.  Probabilistic Roadmaps for Path Planning in High-Dimensional Con(cid:12)guration Spaces , 1996 .

[44]  Daniel Vallejo,et al.  OBPRM: an obstacle-based PRM for 3D workspaces , 1998 .

[45]  S. Sun,et al.  Reduced representation model of protein structure prediction: Statistical potential and genetic algorithms , 1993, Protein science : a publication of the Protein Society.

[46]  G. N. Ramachandran,et al.  Conformation of polypeptides and proteins. , 1968, Advances in protein chemistry.

[47]  K. Dill,et al.  A simple protein folding algorithm using a binary code and secondary structure constraints. , 1995, Protein engineering.

[48]  Erik D. Demaine,et al.  Locked and unlocked polygonal chains in 3D , 1998, SODA '99.

[49]  M. Sternberg Protein Structure Prediction: A Practical Approach , 1997 .

[50]  V. Muñoz,et al.  A simple model for calculating the kinetics of protein folding from three-dimensional structures. , 1999, Proceedings of the National Academy of Sciences of the United States of America.