Sample-Based Models of Protein Energy Landscapes and Slow Structural Rearrangements

Proteins often undergo slow structural rearrangements that involve several angstroms and surpass the nanosecond timescale. These spatiotemporal scales challenge physics-based simulations and open the way to sample-based models of structural dynamics. This article improves an understanding of current capabilities and limitations of sample-based models of dynamics. Borrowing from widely used concepts in evolutionary computation, this article introduces two conflicting aspects of sampling capability and quantifies them via statistical (and graphical) analysis tools. This allows not only conducting a principled comparison of different sample-based algorithms but also understanding which algorithmic ingredients to use as knobs via which to control sampling and, in turn, the accuracy and detail of modeled structural rearrangements. We demonstrate the latter by proposing two powerful variants of a recently published sample-based algorithm. We believe that this work will advance the adoption of sample-based models as reliable tools for modeling slow protein structural rearrangements.

[1]  Amarda Shehu,et al.  Interleaving Global and Local Search for Protein Motion Computation , 2015, ISBRA.

[2]  Haruki Nakamura,et al.  Announcing the worldwide Protein Data Bank , 2003, Nature Structural Biology.

[3]  Erion Plaku,et al.  Computing transition paths in multiple-basin proteins with a probabilistic roadmap algorithm guided by structure data , 2015, 2015 IEEE International Conference on Bioinformatics and Biomedicine (BIBM).

[4]  Amarda Shehu,et al.  A General, Adaptive, Roadmap-Based Algorithm for Protein Motion Computation , 2016, IEEE Transactions on NanoBioscience.

[5]  C. Simmerling,et al.  ff14SB: Improving the Accuracy of Protein Side Chain and Backbone Parameters from ff99SB. , 2015, Journal of chemical theory and computation.

[6]  Lydia Tapia,et al.  Kinetics analysis methods for approximate folding landscapes , 2007, ISMB/ECCB.

[7]  Juan Cortés,et al.  Randomized tree construction algorithm to explore energy landscapes , 2011, J. Comput. Chem..

[8]  Rommie E Amaro,et al.  Editorial overview: Theory and simulation: Tools for solving the insolvable. , 2014, Current opinion in structural biology.

[9]  Didier Devaurs,et al.  Characterizing Energy Landscapes of Peptides Using a Combination of Stochastic Algorithms , 2015, IEEE Transactions on NanoBioscience.

[10]  Ruth Nussinov,et al.  A second molecular biology revolution? The energy landscapes of biomolecular function. , 2014, Physical chemistry chemical physics : PCCP.

[11]  Lydia E Kavraki,et al.  Computational models of protein kinematics and dynamics: beyond simulation. , 2012, Annual review of analytical chemistry.

[12]  Thierry Siméon,et al.  Transition-based RRT for path planning in continuous cost spaces , 2008, 2008 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[13]  Amarda Shehu,et al.  Probabilistic Search and Energy Guidance for Biased Decoy Sampling in Ab Initio Protein Structure Prediction , 2013, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[14]  Peter L. Brooks,et al.  Visualizing data , 1997 .

[15]  Erion Plaku,et al.  Structure-Guided Protein Transition Modeling with a Probabilistic Roadmap Algorithm , 2018, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[16]  T. Siméon,et al.  Modeling protein conformational transitions by a combination of coarse-grained normal mode analysis and robotics-inspired methods , 2013, BMC Structural Biology.

[17]  Howie Choset,et al.  Principles of Robot Motion: Theory, Algorithms, and Implementation ERRATA!!!! 1 , 2007 .

[18]  Amarda Shehu,et al.  Conformational Search for the Protein Native State , 2010 .

[19]  Jean-Claude Latombe,et al.  Using Stochastic Roadmap Simulation to Predict Experimental Quantities in Protein Folding Kinetics: Folding Rates and Phi-Values , 2007, J. Comput. Biol..

[20]  M. Karplus,et al.  Conjugate peak refinement: an algorithm for finding reaction paths and accurate transition states in systems with many degrees of freedom , 1992 .

[21]  Ruth Nussinov,et al.  Principles and Overview of Sampling Methods for Modeling Macromolecular Structure and Dynamics , 2016, PLoS Comput. Biol..

[22]  J. Onuchic,et al.  Multiple-basin energy landscapes for large-amplitude conformational motions of proteins: Structure-based molecular dynamics simulations , 2006, Proceedings of the National Academy of Sciences.

[23]  De-Shuang Huang,et al.  Guest Editorial for Special Section on the 10th International Conference on Intelligent Computing (ICIC) , 2016, TCBB.

[24]  Amarda Shehu,et al.  A stochastic roadmap method to model protein structural transitions , 2015, Robotica.

[25]  R. Nussinov,et al.  The structural basis for cancer treatment decisions , 2014, Oncotarget.

[26]  Roland L. Dunbrack,et al.  proteins STRUCTURE O FUNCTION O BIOINFORMATICS Improved prediction of protein side-chain conformations with SCWRL4 , 2022 .

[27]  R. Nussinov,et al.  The role of dynamic conformational ensembles in biomolecular recognition. , 2009, Nature chemical biology.

[28]  Lydia E. Kavraki,et al.  Tracing conformational changes in proteins , 2009, BIBM 2009.

[29]  C. Mattos,et al.  Allosteric modulation of Ras positions Q61 for a direct role in catalysis , 2010, Proceedings of the National Academy of Sciences.

[30]  Daniel B. Carr,et al.  Looking at large data sets using binned data plots , 1992 .

[31]  Ruth Nussinov,et al.  Mapping the Conformation Space of Wildtype and Mutant H-Ras with a Memetic, Cellular, and Multiscale Evolutionary Algorithm , 2015, PLoS Comput. Biol..

[32]  Amarda Shehu,et al.  A Data-Driven Evolutionary Algorithm for Mapping Multibasin Protein Energy Landscapes , 2015, J. Comput. Biol..

[33]  Amarda Shehu,et al.  Elucidating the ensemble of functionally-relevant transitions in protein systems with a robotics-inspired method , 2013, BMC Structural Biology.

[34]  Guang Song,et al.  Protein folding by motion planning , 2005, Physical biology.

[35]  Jean-Claude Latombe,et al.  A Motion Planning Approach to Flexible Ligand Binding , 1999, ISMB.

[36]  Amarda Shehu,et al.  Guiding the Search for Native-like Protein Conformations with an Ab-initio Tree-based Exploration , 2010, Int. J. Robotics Res..

[37]  Carla Mattos,et al.  The allosteric switch and conformational states in Ras GTPase affected by small molecules. , 2013, The Enzymes.

[38]  Nancy M. Amato,et al.  Using Motion Planning to Map Protein Folding Landscapes and Analyze Folding Kinetics of Known Native Structures , 2003, J. Comput. Biol..

[39]  Lydia Tapia,et al.  A Motion Planning Approach to Studying Molecular Motions , 2010, Commun. Inf. Syst..

[40]  Amarda Shehu,et al.  Basin Hopping as a General and Versatile Optimization Framework for the Characterization of Biological Macromolecules , 2012, Adv. Artif. Intell..

[41]  Shawna L. Thomas,et al.  Simulating RNA folding kinetics on approximated energy landscapes. , 2008, Journal of molecular biology.

[42]  Erion Plaku,et al.  Sample-based Models of Protein Structural Transitions , 2016, BCB.

[43]  Daniel Russel,et al.  The structural dynamics of macromolecular processes. , 2009, Current opinion in cell biology.

[44]  Lydia Tapia,et al.  Simulating Protein Motions with Rigidity Analysis , 2007, J. Comput. Biol..

[45]  Erion Plaku,et al.  A Survey of Computational Treatments of Biomolecules by Robotics-Inspired Methods Modeling Equilibrium Structure and Dynamic , 2016, J. Artif. Intell. Res..

[46]  M. Karplus,et al.  The topology of multidimensional potential energy surfaces: Theory and application to peptide structure and kinetics , 1997 .

[47]  Brian P. Dawkins Investigating the Geometry of a p-Dimensional Data Set , 1995 .