Structure-Guided Protein Transition Modeling with a Probabilistic Roadmap Algorithm

Proteins are macromolecules in perpetual motion, switching between structural states to modulate their function. A detailed characterization of the precise yet complex relationship between protein structure, dynamics, and function requires elucidating transitions between functionally-relevant states. Doing so challenges both wet and dry laboratories, as protein dynamics involves disparate temporal scales. In this paper, we present a novel, sampling-based algorithm to compute transition paths. The algorithm exploits two main ideas. First, it leverages known structures to initialize its search and define a reduced conformation space for rapid sampling. This is key to address the insufficient sampling issue suffered by sampling-based algorithms. Second, the algorithm embeds samples in a nearest-neighbor graph where transition paths can be efficiently computed via queries. The algorithm adapts the probabilistic roadmap framework that is popular in robot motion planning. In addition to efficiently computing lowest-cost paths between any given structures, the algorithm allows investigating hypotheses regarding the order of experimentally-known structures in a transition event. This novel contribution is likely to open up new venues of research. Detailed analysis is presented on multiple-basin proteins of relevance to human disease. Multiscaling and the AMBER ff14SB force field are used to obtain energetically-credible paths at atomistic detail.

[1]  Ruth Nussinov,et al.  Principles and Overview of Sampling Methods for Modeling Macromolecular Structure and Dynamics , 2016, PLoS Comput. Biol..

[2]  Michael A Hough,et al.  Variable metallation of human superoxide dismutase: atomic resolution crystal structures of Cu-Zn, Zn-Zn and as-isolated wild-type enzymes. , 2006, Journal of molecular biology.

[3]  Priyanka Prakash,et al.  Computational allosteric ligand binding site identification on Ras proteins. , 2015, Acta biochimica et biophysica Sinica.

[4]  Jochen S. Hub,et al.  Detection of Functional Modes in Protein Dynamics , 2010 .

[5]  T. Siméon,et al.  Modeling protein conformational transitions by a combination of coarse-grained normal mode analysis and robotics-inspired methods , 2013, BMC Structural Biology.

[6]  Amarda Shehu,et al.  A Principled Comparative Analysis of Dimensionality Reduction Techniques on Protein Structure Decoy Data , 2016 .

[7]  Lydia Tapia,et al.  Kinetics analysis methods for approximate folding landscapes , 2007, ISMB/ECCB.

[8]  J Andrew McCammon,et al.  Mapping the nucleotide and isoform-dependent structural and dynamical features of Ras proteins. , 2008, Structure.

[9]  Nancy M. Amato,et al.  A motion planning approach to folding: from paper craft to protein folding , 2001, Proceedings 2001 ICRA. IEEE International Conference on Robotics and Automation (Cat. No.01CH37164).

[10]  W. Kabsch,et al.  The Ras-RasGAP complex: structural basis for GTPase activation and its loss in oncogenic Ras mutants. , 1997, Science.

[11]  Nikolay V Dokholyan,et al.  Modifications of Superoxide Dismutase (SOD1) in Human Erythrocytes , 2009, Journal of Biological Chemistry.

[12]  Juan Cort Sampling-Based Path Planning on Configuration-Space Costmaps , 2010 .

[13]  Lydia E. Kavraki,et al.  Finding Solutions of the Inverse Kinematics Problems in Computer-aided Drug Design , 2002 .

[14]  Amarda Shehu,et al.  A Data-Driven Evolutionary Algorithm for Mapping Multibasin Protein Energy Landscapes , 2015, J. Comput. Biol..

[15]  Amarda Shehu,et al.  A stochastic roadmap method to model protein structural transitions , 2015, Robotica.

[16]  Daniel Russel,et al.  The structural dynamics of macromolecular processes. , 2009, Current opinion in cell biology.

[17]  I. Bahar,et al.  Coarse-grained normal mode analysis in structural biology. , 2005, Current opinion in structural biology.

[18]  JAMES DEMMEL,et al.  LAPACK: A portable linear algebra library for high-performance computers , 1990, Proceedings SUPERCOMPUTING '90.

[19]  Suryani Lukman,et al.  The Distinct Conformational Dynamics of K-Ras and H-Ras A59G , 2010, PLoS Comput. Biol..

[20]  Amarda Shehu,et al.  Basin Hopping as a General and Versatile Optimization Framework for the Characterization of Biological Macromolecules , 2012, Adv. Artif. Intell..

[21]  J. Onuchic,et al.  Multiple-basin energy landscapes for large-amplitude conformational motions of proteins: Structure-based molecular dynamics simulations , 2006, Proceedings of the National Academy of Sciences.

[22]  Leonidas J. Guibas,et al.  Inverse Kinematics in Biology: The Protein Loop Closure Problem , 2005, Int. J. Robotics Res..

[23]  A. D. McLachlan,et al.  A mathematical procedure for superimposing atomic coordinates of proteins , 1972 .

[24]  G. Chirikjian,et al.  Elastic models of conformational transitions in macromolecules. , 2002, Journal of molecular graphics & modelling.

[25]  D. Kern,et al.  Dynamic personalities of proteins , 2007, Nature.

[26]  Gregory S. Chirikjian,et al.  General methods for computing hyper-redundant manipulator inverse kinematics , 1993, Proceedings of 1993 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS '93).

[27]  E. Polak Introduction to linear and nonlinear programming , 1973 .

[28]  Dinesh Manocha,et al.  Efficient inverse kinematics for general 6R manipulators , 1994, IEEE Trans. Robotics Autom..

[29]  Howie Choset,et al.  Principles of Robot Motion: Theory, Algorithms, and Implementation ERRATA!!!! 1 , 2007 .

[30]  Guang Song,et al.  Protein folding by motion planning , 2005, Physical biology.

[31]  G. Chirikjian,et al.  Efficient generation of feasible pathways for protein conformational transitions. , 2002, Biophysical journal.

[32]  Lydia Tapia,et al.  A Motion Planning Approach to Studying Molecular Motions , 2010, Commun. Inf. Syst..

[33]  Shawna L. Thomas,et al.  Simulating RNA folding kinetics on approximated energy landscapes. , 2008, Journal of molecular biology.

[34]  Amarda Shehu,et al.  A General, Adaptive, Roadmap-Based Algorithm for Protein Motion Computation , 2016, IEEE Transactions on NanoBioscience.

[35]  Harrison J. Hocker,et al.  Novel Allosteric Sites on Ras for Lead Generation , 2011, PloS one.

[36]  Michele Vendruscolo,et al.  A Coupled Equilibrium Shift Mechanism in Calmodulin-Mediated Signal Transduction , 2008, Structure.

[37]  Misha V Golynskiy,et al.  Rational design of FRET sensor proteins based on mutually exclusive domain interactions. , 2013, Biochemical Society transactions.

[38]  Erion Plaku,et al.  Computing transition paths in multiple-basin proteins with a probabilistic roadmap algorithm guided by structure data , 2015, 2015 IEEE International Conference on Bioinformatics and Biomedicine (BIBM).

[39]  Haruki Nakamura,et al.  Announcing the worldwide Protein Data Bank , 2003, Nature Structural Biology.

[40]  Chung F Wong,et al.  Protein simulation and drug design. , 2003, Advances in protein chemistry.

[41]  Gregory S. Chirikjian,et al.  Iterative cluster-NMA ( icNMA ) : A tool for generating conformational transitions in proteins , 2017 .

[42]  Erion Plaku,et al.  Region-Guided and Sampling-Based Tree Search for Motion Planning With Dynamics , 2015, IEEE Transactions on Robotics.

[43]  Murat Cirit,et al.  Allosteric Modulation of Ras-GTP Is Linked to Signal Transduction through RAF Kinase* , 2010, The Journal of Biological Chemistry.

[44]  James Andrew McCammon,et al.  Ras Conformational Switching: Simulating Nucleotide-Dependent Conformational Transitions with Accelerated Molecular Dynamics , 2009, PLoS Comput. Biol..

[45]  Ruben Abagyan,et al.  ICM—A new method for protein modeling and design: Applications to docking and structure prediction from the distorted native conformation , 1994, J. Comput. Chem..

[46]  Juan Cortés,et al.  Randomized tree construction algorithm to explore energy landscapes , 2011, J. Comput. Chem..

[47]  Yung Doug Suh,et al.  Single-molecule surface-enhanced Raman spectroscopy: a perspective on the current status. , 2013, Physical chemistry chemical physics : PCCP.

[48]  Nancy M. Amato,et al.  A Kinematics-Based Probabilistic Roadmap Method for Closed Chain Systems , 2001 .

[49]  Amarda Shehu,et al.  Guiding the Search for Native-like Protein Conformations with an Ab-initio Tree-based Exploration , 2010, Int. J. Robotics Res..

[50]  Kenneth A. De Jong,et al.  Mapping Multiple Minima in Protein Energy Landscapes with Evolutionary Algorithms , 2015, GECCO.

[51]  Lydia E Kavraki,et al.  Computational models of protein kinematics and dynamics: beyond simulation. , 2012, Annual review of analytical chemistry.

[52]  Thierry Siméon,et al.  Transition-based RRT for path planning in continuous cost spaces , 2008, 2008 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[53]  Rommie E Amaro,et al.  Editorial overview: Theory and simulation: Tools for solving the insolvable. , 2014, Current opinion in structural biology.

[54]  Dominik Gront,et al.  Backbone building from quadrilaterals: A fast and accurate algorithm for protein backbone reconstruction from alpha carbon coordinates , 2007, J. Comput. Chem..

[55]  R. Nussinov,et al.  The role of dynamic conformational ensembles in biomolecular recognition. , 2009, Nature chemical biology.

[56]  Lydia E. Kavraki,et al.  Tracing conformational changes in proteins , 2009, BIBM 2009.

[57]  Andre Hoelz,et al.  Structural Evidence for Feedback Activation by Ras·GTP of the Ras-Specific Nucleotide Exchange Factor SOS , 2003, Cell.

[58]  Nancy M. Amato,et al.  Using Motion Planning to Map Protein Folding Landscapes and Analyze Folding Kinetics of Known Native Structures , 2003, J. Comput. Biol..

[59]  Ivet Bahar,et al.  Exploring the Conformational Transitions of Biomolecular Systems Using a Simple Two-State Anisotropic Network Model , 2014, PLoS Comput. Biol..

[60]  J. Valentine,et al.  SOD1 aggregation and ALS: role of metallation states and disulfide status. , 2013, Current topics in medicinal chemistry.

[61]  G. Hummer,et al.  Protein conformational transitions explored by mixed elastic network models , 2007, Proteins.

[62]  Dinesh Manocha,et al.  Conformational analysis of molecular chains using nano-kinematics , 1995, Comput. Appl. Biosci..

[63]  Lydia E. Kavraki,et al.  A New Method for Fast and Accurate Derivation of Molecular Conformations. , 2010 .

[64]  Roland L. Dunbrack,et al.  proteins STRUCTURE O FUNCTION O BIOINFORMATICS Improved prediction of protein side-chain conformations with SCWRL4 , 2022 .

[65]  L. Kavraki,et al.  SIMS: A Hybrid Method for Rapid Conformational Analysis , 2013, PloS one.

[66]  G. Chirikjian,et al.  Iterative cluster‐NMA: A tool for generating conformational transitions in proteins , 2009, Proteins.

[67]  Lydia Tapia,et al.  Simulating Protein Motions with Rigidity Analysis , 2007, J. Comput. Biol..

[68]  Dima Kozakov,et al.  Analysis of binding site hot spots on the surface of Ras GTPase. , 2011, Journal of molecular biology.

[69]  Ora Schueler-Furman,et al.  Rapid Sampling of Molecular Motions with Prior Information Constraints , 2009, PLoS Comput. Biol..

[70]  Carla Mattos,et al.  Transformation efficiency of RasQ61 mutants linked to structural features of the switch regions in the presence of Raf. , 2007, Structure.

[71]  Amarda Shehu,et al.  Elucidating the ensemble of functionally-relevant transitions in protein systems with a robotics-inspired method , 2013, BMC Structural Biology.

[72]  I. Bahar,et al.  Normal mode analysis : theory and applications to biological and chemical systems , 2005 .

[73]  T. Siméon,et al.  An NMA‐guided path planning approach for computing large‐amplitude conformational changes in proteins , 2007, Proteins.

[74]  Amarda Shehu,et al.  Probabilistic Search and Energy Guidance for Biased Decoy Sampling in Ab Initio Protein Structure Prediction , 2013, IEEE/ACM Transactions on Computational Biology and Bioinformatics.