Maintaining and Enhancing Diversity of Sampled Protein Conformations in Robotics-Inspired Methods

The ability to efficiently sample structurally diverse protein conformations allows one to gain a high-level view of a protein's energy landscape. Algorithms from robot motion planning have been used for conformational sampling, and several of these algorithms promote diversity by keeping track of "coverage" in conformational space based on the local sampling density. However, large proteins present special challenges. In particular, larger systems require running many concurrent instances of these algorithms, but these algorithms can quickly become memory intensive because they typically keep previously sampled conformations in memory to maintain coverage estimates. In addition, robotics-inspired algorithms depend on defining useful perturbation strategies for exploring the conformational space, which is a difficult task for large proteins because such systems are typically more constrained and exhibit complex motions. In this article, we introduce two methodologies for maintaining and enhancing diversity in robotics-inspired conformational sampling. The first method addresses algorithms based on coverage estimates and leverages the use of a low-dimensional projection to define a global coverage grid that maintains coverage across concurrent runs of sampling. The second method is an automatic definition of a perturbation strategy through readily available flexibility information derived from B-factors, secondary structure, and rigidity analysis. Our results show a significant increase in the diversity of the conformations sampled for proteins consisting of up to 500 residues when applied to a specific robotics-inspired algorithm for conformational sampling. The methodologies presented in this article may be vital components for the scalability of robotics-inspired approaches.

[1]  Henry van den Bedem,et al.  Nullspace Sampling with Holonomic Constraints Reveals Molecular Mechanisms of Protein Gαs , 2015, PLoS Comput. Biol..

[2]  Lydia Tapia,et al.  Simulating Protein Motions with Rigidity Analysis , 2007, J. Comput. Biol..

[3]  Thierry Siméon,et al.  Motion planning algorithms for molecular simulations: A survey , 2012, Comput. Sci. Rev..

[4]  Didier Devaurs,et al.  Coarse-Grained Conformational Sampling of Protein Structure Improves the Fit to Experimental Hydrogen-Exchange Data , 2017, Front. Mol. Biosci..

[5]  Ora Schueler-Furman,et al.  Rapid Sampling of Molecular Motions with Prior Information Constraints , 2009, PLoS Comput. Biol..

[6]  Erion Plaku,et al.  A Survey of Computational Treatments of Biomolecules by Robotics-Inspired Methods Modeling Equilibrium Structure and Dynamic , 2016, J. Artif. Intell. Res..

[7]  Chunli Yan,et al.  Integrative Modeling of Macromolecular Assemblies from Low to Near-Atomic Resolution , 2015, Computational and structural biotechnology journal.

[8]  F A Quiocho,et al.  Extensive features of tight oligosaccharide binding revealed in high-resolution structures of the maltodextrin transport/chemosensory receptor. , 1997, Structure.

[9]  James Andrew McCammon,et al.  Conformational Sampling and Nucleotide-Dependent Transitions of the GroEL Subunit Probed by Unbiased Molecular Dynamics Simulations , 2011, PLoS Comput. Biol..

[10]  T. Siméon,et al.  Modeling protein conformational transitions by a combination of coarse-grained normal mode analysis and robotics-inspired methods , 2013, BMC Structural Biology.

[11]  Jean-Claude Latombe,et al.  Efficient Algorithms to Explore Conformation Spaces of Flexible Protein Loops , 2008, TCBB.

[12]  Didier Devaurs,et al.  Defining Low-Dimensional Projections to Guide Protein Conformational Sampling , 2017, J. Comput. Biol..

[13]  Douglas R. Powell Review of X-Ray Crystallography , 2016 .

[14]  Jean-Claude Latombe,et al.  Efficient Algorithms to Explore Conformation Spaces of Flexible Protein Loops , 2007, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[15]  Amarda Shehu,et al.  A Data-Driven Evolutionary Algorithm for Mapping Multibasin Protein Energy Landscapes , 2015, J. Comput. Biol..

[16]  Lydia E. Kavraki,et al.  Kinodynamic Motion Planning by Interior-Exterior Cell Exploration , 2008, WAFR.

[17]  Lydia E. Kavraki,et al.  Modeling Structures and Motions of Loops in Protein Molecules , 2012, Entropy.

[18]  Nathalie Reuter,et al.  A dynamic model of long‐range conformational adaptations triggered by nucleotide binding in GroEL‐GroES , 2012, Proteins.

[19]  Ron Alterovitz,et al.  Parallel sampling-based motion planning with superlinear speedup , 2012, 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[20]  Lydia E Kavraki,et al.  Computational models of protein kinematics and dynamics: beyond simulation. , 2012, Annual review of analytical chemistry.

[21]  David Baker,et al.  Macromolecular modeling with rosetta. , 2008, Annual review of biochemistry.

[22]  Zheng Yuan,et al.  Prediction of protein B‐factor profiles , 2005, Proteins.

[23]  Erik Andersson,et al.  Assessing how multiple mutations affect protein stability using rigid cluster size distributions , 2016, 2016 IEEE 6th International Conference on Computational Advances in Bio and Medical Sciences (ICCABS).

[24]  L. Kavraki,et al.  SIMS: A Hybrid Method for Rapid Conformational Analysis , 2013, PloS one.

[25]  Amelie Stein,et al.  Improvements to Robotics-Inspired Conformational Sampling in Rosetta , 2013, PloS one.

[26]  Heather A Carlson,et al.  Protein flexibility is an important component of structure-based drug discovery. , 2002, Current pharmaceutical design.

[27]  Didier Devaurs,et al.  Parallelizing RRT on Large-Scale Distributed-Memory Architectures , 2013, IEEE Transactions on Robotics.

[28]  Kostas E. Bekris,et al.  Sampling-based roadmap of trees for parallel motion planning , 2005, IEEE Transactions on Robotics.

[29]  Jack D. Dunitz,et al.  Atomic Dispacement Parameter Nomenclature. Report of a Subcommittee on Atomic Displacement Parameter Nomenclature , 1996 .

[30]  Rajeev Motwani,et al.  Path Planning in Expansive Configuration Spaces , 1999, Int. J. Comput. Geom. Appl..

[31]  Jens Meiler,et al.  ROSETTA3: an object-oriented software suite for the simulation and design of macromolecules. , 2011, Methods in enzymology.

[32]  Sherry L. Mowbray,et al.  Probing protein-protein interactions. The ribose-binding protein in bacterial transport and chemotaxis. , 1995 .

[33]  Juan Cortés,et al.  Randomized tree construction algorithm to explore energy landscapes , 2011, J. Comput. Chem..

[34]  Amarda Shehu,et al.  Guiding the Search for Native-like Protein Conformations with an Ab-initio Tree-based Exploration , 2010, Int. J. Robotics Res..

[35]  Yang Li,et al.  KINARI-Web: a server for protein rigidity analysis , 2011, Nucleic Acids Res..

[36]  Dominique Marion,et al.  An Introduction to Biological NMR Spectroscopy* , 2013, Molecular & Cellular Proteomics.

[37]  G Marius Clore,et al.  Transient, sparsely populated compact states of apo and calcium-loaded calmodulin probed by paramagnetic relaxation enhancement: interplay of conformational selection and induced fit. , 2011, Journal of the American Chemical Society.

[38]  S L Mowbray,et al.  Multiple open forms of ribose-binding protein trace the path of its conformational change. , 1998, Journal of molecular biology.

[39]  Audrey Lee-St. John,et al.  Pebble game algorithms and sparse graphs , 2007, Discret. Math..

[40]  Lydia E. Kavraki,et al.  Tracing conformational changes in proteins , 2009, BIBM 2009.

[41]  Nancy M. Amato,et al.  Using Motion Planning to Map Protein Folding Landscapes and Analyze Folding Kinetics of Known Native Structures , 2003, J. Comput. Biol..

[42]  E. Paquet,et al.  Molecular Dynamics, Monte Carlo Simulations, and Langevin Dynamics: A Computational Review , 2015, BioMed research international.

[43]  Andreas Vitalis,et al.  Methods for Monte Carlo simulations of biomacromolecules. , 2009, Annual reports in computational chemistry.

[44]  Ruth Nussinov,et al.  Principles and Overview of Sampling Methods for Modeling Macromolecular Structure and Dynamics , 2016, PLoS Comput. Biol..

[45]  Lydia Tapia,et al.  A Motion Planning Approach to Studying Molecular Motions , 2010, Commun. Inf. Syst..

[46]  R. Nussinov,et al.  Protein Ensembles: How Does Nature Harness Thermodynamic Fluctuations for Life? The Diverse Functional Roles of Conformational Ensembles in the Cell. , 2016, Chemical reviews.

[47]  Didier Devaurs,et al.  Native State of Complement Protein C3d Analysed via Hydrogen Exchange and Conformational Sampling. , 2018, International journal of computational biology and drug design.

[48]  Alexander Wlodawer,et al.  Structures of the Complexes of a Potent Anti-HIV Protein Cyanovirin-N and High Mannose Oligosaccharides* , 2002, The Journal of Biological Chemistry.

[49]  Nurit Haspel,et al.  Multi-Resolution Rigidity-Based Sampling of Protein Conformational Paths , 2013, BCB.

[50]  Lydia E. Kavraki,et al.  The Open Motion Planning Library , 2012, IEEE Robotics & Automation Magazine.

[51]  S L Mowbray,et al.  Probing protein-protein interactions. The ribose-binding protein in bacterial transport and chemotaxis. , 1994, The Journal of biological chemistry.