From molecular energy landscapes to equilibrium dynamics via landscape analysis and markov state models

Molecular dynamics (MD) simulation software allows probing the equilibrium structural dynamics of a molecule of interest, revealing how a molecule navigates its structure space one structure at a time. To obtain a broader view of dynamics, typically one needs to launch many such simulations, obtaining many trajectories. A summarization of the equilibrium dynamics requires integrating the information in the various trajectories, and Markov State Models (MSM) are increasingly being used for this task. At its core, the task involves organizing the structures accessed in simulation into structural states, and then constructing a transition probability matrix revealing the transitions between states. While now considered a mature technology and widely used to summarize equilibrium dynamics, the underlying computational process in the construction of an MSM ignores energetics even though the transition of a molecule between two nearby structures in an MD trajectory is governed by the corresponding energies. In this paper, we connect theory with simulation and analysis of equilibrium dynamics. A molecule navigates the energy landscape underlying the structure space. The structural states that are identified via off-the-shelf clustering algorithms need to be connected to thermodynamically-stable and semi-stable (macro)states among which transitions can then be quantified. Leveraging recent developments in the analysis of energy landscapes that identify basins in the landscape, we evaluate the hypothesis that basins, directly tied to stable and semi-stable states, lead to better models of dynamics. Our analysis indicates that basins lead to MSMs of better quality and thus can be useful to further advance this widely-used technology for summarization of molecular equilibrium dynamics.

[1]  Diego Prada-Gracia,et al.  Exploring the Free Energy Landscape: From Dynamics to Networks and Back , 2009, PLoS Comput. Biol..

[2]  F. Noé,et al.  Protein conformational plasticity and complex ligand-binding kinetics explored by atomistic simulations and Markov models , 2015, Nature Communications.

[3]  S Sriadhi,et al.  Comparison analysis of K-Means and K-Medoid with Ecluidience Distance Algorithm, Chanberra Distance, and Chebyshev Distance for Big Data Clustering , 2018, IOP Conference Series: Materials Science and Engineering.

[4]  Ruth Nussinov,et al.  Computational Methods for Exploration and Analysis of Macromolecular Structure and Dynamics , 2015, PLoS Comput. Biol..

[5]  J. A. Hartigan,et al.  A k-means clustering algorithm , 1979 .

[6]  Rommie E. Amaro,et al.  Structural basis for ligand modulation of the CCR2 conformational landscape , 2018, Proceedings of the National Academy of Sciences.

[7]  Ruth Nussinov,et al.  A second molecular biology revolution? The energy landscapes of biomolecular function. , 2014, Physical chemistry chemical physics : PCCP.

[8]  Nasrin Akhter,et al.  From Extraction of Local Structures of Protein Energy Landscapes to Improved Decoy Selection in Template-Free Protein Structure Prediction , 2018, Molecules.

[9]  R. Nussinov,et al.  The role of dynamic conformational ensembles in biomolecular recognition. , 2009, Nature chemical biology.

[10]  Frank Noé,et al.  A Variational Approach to Modeling Slow Processes in Stochastic Dynamical Systems , 2012, Multiscale Model. Simul..

[11]  F. Noé,et al.  Complete protein–protein association kinetics in atomic detail revealed by molecular dynamics simulations and Markov modelling , 2017, Nature Chemistry.

[12]  Frank Noé,et al.  Markov models of molecular kinetics: generation and validation. , 2011, The Journal of chemical physics.

[13]  V. Pande,et al.  Markov State Models: From an Art to a Science. , 2018, Journal of the American Chemical Society.

[14]  Rommie E. Amaro,et al.  Application of Molecular-Dynamics Based Markov State Models to Functional Proteins , 2014, Journal of chemical theory and computation.

[15]  Anil K. Jain Data clustering: 50 years beyond K-means , 2010, Pattern Recognit. Lett..

[16]  Ruth Nussinov,et al.  Principles and Overview of Sampling Methods for Modeling Macromolecular Structure and Dynamics , 2016, PLoS Comput. Biol..

[17]  Vijay S Pande,et al.  Improvements in Markov State Model Construction Reveal Many Non-Native Interactions in the Folding of NTL9. , 2013, Journal of chemical theory and computation.

[18]  Cesar H. Comin,et al.  Clustering algorithms: A comparative approach , 2016, PloS one.

[19]  J. Onuchic,et al.  Multiple-basin energy landscapes for large-amplitude conformational motions of proteins: Structure-based molecular dynamics simulations , 2006, Proceedings of the National Academy of Sciences.

[20]  Vijay S. Pande,et al.  Everything you wanted to know about Markov State Models but were afraid to ask. , 2010, Methods.

[21]  Frank Noé,et al.  PyEMMA 2: A Software Package for Estimation, Validation, and Analysis of Markov Models. , 2015, Journal of chemical theory and computation.

[22]  Frank Noé,et al.  Markov state models of biomolecular conformational dynamics. , 2014, Current opinion in structural biology.

[23]  M. Karplus,et al.  Dynamics of folded proteins , 1977, Nature.

[24]  Frank Noé,et al.  EMMA: A Software Package for Markov Model Building and Analysis. , 2012, Journal of chemical theory and computation.

[25]  Mohammad M. Sultan,et al.  MSMBuilder: Statistical Models for Biomolecular Dynamics , 2016, bioRxiv.

[26]  Amarda Shehu,et al.  Learning Organizations of Protein Energy Landscapes: An Application on Decoy Selection in Template-Free Protein Structure Prediction. , 2019, Methods in molecular biology.

[27]  A. D. McLachlan,et al.  A mathematical procedure for superimposing atomic coordinates of proteins , 1972 .

[28]  Vijay S Pande,et al.  Progress and challenges in the automated construction of Markov state models for full protein systems. , 2009, The Journal of chemical physics.

[29]  Toni Giorgino,et al.  Identification of slow molecular order parameters for Markov model construction. , 2013, The Journal of chemical physics.

[30]  Nasrin Akhter,et al.  Community Detection for Decoy Selection in Template-free Protein Structure Prediction , 2018, BCB.

[31]  Nasrin Akhter,et al.  An Energy Landscape Treatment of Decoy Selection in Template-Free Protein Structure Prediction , 2018, Comput..

[32]  Pablo Moreno-Ger,et al.  Comparison of Clustering Algorithms for Learning Analytics with Educational Datasets , 2018, Int. J. Interact. Multim. Artif. Intell..