Computing transition paths in multiple-basin proteins with a probabilistic roadmap algorithm guided by structure data

Proteins are macromolecules in perpetual motion, switching between structural states to modulate their function. A detailed characterization of the precise yet complex relationship between protein structure, dynamics, and function requires elucidating transitions between functionally-relevant states. Doing so challenges both wet and dry laboratories, as protein dynamics involves disparate temporal scales. In this paper we present a novel, sampling-based algorithm to compute transition paths. The algorithm exploits two main ideas. First, it leverages known structures to initialize its search and define a reduced conformation space for rapid sampling. This is key to address the insufficient sampling issue suffered by sampling-based algorithms. Second, the algorithm embeds samples in a nearest-neighbor graph where transition paths can be efficiently computed via queries. The algorithm adapts the probabilistic roadmap framework that is popular in robot motion planning. In addition to efficiently computing lowest-cost paths between any given structures, the algorithm allows investigating hypotheses regarding the order of experimentally-known structures in a transition event. This novel contribution is likely to open up new venues of research. Detailed analysis is presented on multiple-basin proteins of relevance to human disease. Multiscaling and the AMBER ff12SB force field are used to obtain energetically-credible paths at atomistic detail.

[1]  Haruki Nakamura,et al.  Announcing the worldwide Protein Data Bank , 2003, Nature Structural Biology.

[2]  G. G. Stokes "J." , 1890, The New Yale Book of Quotations.

[3]  Dominik Gront,et al.  Backbone building from quadrilaterals: A fast and accurate algorithm for protein backbone reconstruction from alpha carbon coordinates , 2007, J. Comput. Chem..

[4]  Amarda Shehu,et al.  A Data-Driven Evolutionary Algorithm for Mapping Multibasin Protein Energy Landscapes , 2015, J. Comput. Biol..

[5]  Thierry Siméon,et al.  Sampling-Based Path Planning on Configuration-Space Costmaps , 2010, IEEE Transactions on Robotics.

[6]  TWO-WEEK Loan COpy,et al.  University of California , 1886, The American journal of dental science.

[7]  Daniel Russel,et al.  The structural dynamics of macromolecular processes. , 2009, Current opinion in cell biology.

[8]  Yung Doug Suh,et al.  Single-molecule surface-enhanced Raman spectroscopy: a perspective on the current status. , 2013, Physical chemistry chemical physics : PCCP.

[9]  R. Nussinov,et al.  The role of dynamic conformational ensembles in biomolecular recognition. , 2009, Nature chemical biology.

[10]  A. D. McLachlan,et al.  A mathematical procedure for superimposing atomic coordinates of proteins , 1972 .

[11]  Howie Choset,et al.  Principles of Robot Motion: Theory, Algorithms, and Implementation ERRATA!!!! 1 , 2007 .

[12]  Rommie E Amaro,et al.  Editorial overview: Theory and simulation: Tools for solving the insolvable. , 2014, Current opinion in structural biology.

[13]  Jack Dongarra,et al.  LAPACK: a portable linear algebra library for high-performance computers , 1990, SC.

[14]  Roland L. Dunbrack,et al.  proteins STRUCTURE O FUNCTION O BIOINFORMATICS Improved prediction of protein side-chain conformations with SCWRL4 , 2022 .

[15]  Amarda Shehu,et al.  Interleaving Global and Local Search for Protein Motion Computation , 2015, ISBRA.

[16]  J. Straub,et al.  The MaxFlux algorithm for calculating variationally optimized reaction paths for conformational transitions in many body systems at finite temperature , 1997 .

[17]  Michele Vendruscolo,et al.  A Coupled Equilibrium Shift Mechanism in Calmodulin-Mediated Signal Transduction , 2008, Structure.

[18]  Misha V Golynskiy,et al.  Rational design of FRET sensor proteins based on mutually exclusive domain interactions. , 2013, Biochemical Society transactions.

[19]  D. Kern,et al.  Dynamic personalities of proteins , 2007, Nature.

[20]  Chung F Wong,et al.  Protein simulation and drug design. , 2003, Advances in protein chemistry.

[21]  Jochen S. Hub,et al.  Detection of Functional Modes in Protein Dynamics , 2010 .

[22]  Kenneth A. De Jong,et al.  Mapping Multiple Minima in Protein Energy Landscapes with Evolutionary Algorithms , 2015, GECCO.

[23]  J. Onuchic,et al.  Multiple-basin energy landscapes for large-amplitude conformational motions of proteins: Structure-based molecular dynamics simulations , 2006, Proceedings of the National Academy of Sciences.

[24]  E. Polak Introduction to linear and nonlinear programming , 1973 .