Adaptive Markov state model estimation using short reseeding trajectories.

In the last decade, advances in molecular dynamics (MD) and Markov State Model (MSM) methodologies have made possible accurate and efficient estimation of kinetic rates and reactive pathways for complex biomolecular dynamics occurring on slow time scales. A promising approach to enhanced sampling of MSMs is to use "adaptive" methods, in which new MD trajectories are "seeded" preferentially from previously identified states. Here, we investigate the performance of various MSM estimators applied to reseeding trajectory data, for both a simple 1D free energy landscape and mini-protein folding MSMs of WW domain and NTL9(1-39). Our results reveal the practical challenges of reseeding simulations and suggest a simple way to reweight seeding trajectory data to better estimate both thermodynamic and kinetic quantities.

[1]  Vijay S Pande,et al.  Improvements in Markov State Model Construction Reveal Many Non-Native Interactions in the Folding of NTL9. , 2013, Journal of chemical theory and computation.

[2]  M. Gruebele,et al.  Downhill dynamics and the molecular rate of protein folding , 2008 .

[3]  Martin Gruebele,et al.  Engineering a β-sheet protein toward the folding speed limit , 2005 .

[4]  Vincent A Voelz,et al.  A molecular interpretation of 2D IR protein folding experiments with Markov state models. , 2014, Biophysical journal.

[5]  D. Raleigh,et al.  Thermodynamics and kinetics of non-native interactions in protein folding: a single point mutant significantly stabilizes the N-terminal domain of L9 by modulating non-native interactions in the denatured state. , 2004, Journal of molecular biology.

[6]  R. Dror,et al.  How Fast-Folding Proteins Fold , 2011, Science.

[7]  Frank Noé,et al.  Statistically optimal analysis of state-discretized trajectory data from multiple thermodynamic states. , 2014, The Journal of chemical physics.

[8]  R. McGibbon,et al.  Variational cross-validation of slow dynamical modes in molecular kinetics. , 2014, The Journal of chemical physics.

[9]  Frank Noé,et al.  PyEMMA 2: A Software Package for Estimation, Validation, and Analysis of Markov Models. , 2015, Journal of chemical theory and computation.

[10]  Hongbin Wan,et al.  A Maximum-Caliber Approach to Predicting Perturbed Folding Kinetics Due to Mutations. , 2016, Journal of chemical theory and computation.

[11]  Lydia E Kavraki,et al.  Quantitative comparison of adaptive sampling methods for protein dynamics. , 2018, The Journal of chemical physics.

[12]  Vijay S Pande,et al.  Progress and challenges in the automated construction of Markov state models for full protein systems. , 2009, The Journal of chemical physics.

[13]  Yan Zhang,et al.  Structure-function-folding relationship in a WW domain. , 2006, Proceedings of the National Academy of Sciences of the United States of America.

[14]  Benjamin Trendelkamp-Schroer,et al.  Efficient estimation of rare-event kinetics , 2014, 1409.6439.

[15]  Daniel M Zuckerman,et al.  The "weighted ensemble" path sampling method is statistically exact for a broad class of stochastic processes and binning procedures. , 2008, The Journal of chemical physics.

[16]  Hao Wu,et al.  Projected metastable Markov processes and their estimation with observable operator models. , 2015, The Journal of chemical physics.

[17]  H. Nguyen,et al.  Tuning the free-energy landscape of a WW domain by temperature, mutation, and truncation , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[18]  Gerhard Hummer,et al.  Dynamic Histogram Analysis To Determine Free Energies and Rates from Biased Simulations. , 2017, Journal of chemical theory and computation.

[19]  Frank Noé,et al.  Markov models of molecular kinetics: generation and validation. , 2011, The Journal of chemical physics.

[20]  Diwakar Shukla,et al.  Reinforcement Learning Based Adaptive Sampling: REAPing Rewards by Exploring Protein Conformational Landscapes. , 2017, The journal of physical chemistry. B.

[21]  Mohammad M. Sultan,et al.  MSMBuilder: Statistical Models for Biomolecular Dynamics , 2016, bioRxiv.

[22]  Kyle A. Beauchamp,et al.  Molecular simulation of ab initio protein folding for a millisecond folder NTL9(1-39). , 2010, Journal of the American Chemical Society.

[23]  Joshua L Adelman,et al.  WESTPA: an interoperable, highly scalable software package for weighted ensemble simulation and analysis. , 2015, Journal of chemical theory and computation.

[24]  Gerhard Hummer,et al.  Native contacts determine protein folding mechanisms in atomistic simulations , 2013, Proceedings of the National Academy of Sciences.

[25]  Cecilia Clementi,et al.  Markov state models from short non-equilibrium simulations—Analysis and correction of estimation bias , 2017, 1701.01665.

[26]  V. Pande,et al.  Rapid equilibrium sampling initiated from nonequilibrium data , 2009, Proceedings of the National Academy of Sciences.

[27]  D. Raleigh,et al.  Mutational analysis demonstrates that specific electrostatic interactions can play a key role in the denatured state ensemble of proteins. , 2005, Journal of molecular biology.

[28]  Diwakar Shukla,et al.  Enhanced unbiased sampling of protein dynamics using evolutionary coupling information , 2017, Scientific Reports.

[29]  Vincent A Voelz,et al.  Surprisal Metrics for Quantifying Perturbed Conformational Dynamics in Markov State Models. , 2014, Journal of chemical theory and computation.

[30]  Vijay S Pande,et al.  Simple few-state models reveal hidden complexity in protein folding , 2012, Proceedings of the National Academy of Sciences.

[31]  Jia-Cherng Horng,et al.  Rapid Cooperative Two-state Folding of a Miniature α–β Protein and Design of a Thermostable Variant , 2003 .

[32]  Gregory R Bowman,et al.  FAST Conformational Searches by Balancing Exploration/Exploitation Trade-Offs. , 2015, Journal of chemical theory and computation.

[33]  M. Gruebele,et al.  Computational design and experimental testing of the fastest-folding β-sheet protein. , 2011, Journal of molecular biology.

[34]  Herbert Jaeger,et al.  Observable Operator Models for Discrete Stochastic Time Series , 2000, Neural Computation.

[35]  T. Zhu,et al.  A test of AMBER force fields in predicting the secondary structure of α-helical and β-hairpin peptides , 2017 .

[36]  Samuel D. Lotz,et al.  Unbiased Molecular Dynamics of 11 min Timescale Drug Unbinding Reveals Transition State Stabilizing Interactions. , 2018, Journal of the American Chemical Society.

[37]  Amelia A. Fuller,et al.  An experimental survey of the transition between two-state and downhill protein folding scenarios , 2008, Proceedings of the National Academy of Sciences.

[38]  Hao Wu,et al.  Multiensemble Markov models of molecular thermodynamics and kinetics , 2016, Proceedings of the National Academy of Sciences.

[39]  Samuel D. Lotz,et al.  Predicting ligand binding affinity using on- and off-rates for the SAMPL6 SAMPLing challenge , 2018, Journal of Computer-Aided Molecular Design.

[40]  H. Nguyen,et al.  High-Resolution Mapping of the Folding Transition State of a WW Domain. , 2016, Journal of molecular biology.

[41]  D. Raleigh,et al.  Energetically significant networks of coupled interactions within an unfolded protein , 2014, Proceedings of the National Academy of Sciences.

[42]  Gregory R Bowman,et al.  Choice of Adaptive Sampling Strategy Impacts State Discovery, Transition Probabilities, and the Apparent Mechanism of Conformational Changes. , 2018, Journal of chemical theory and computation.

[43]  Samuel D. Lotz,et al.  Ligand Release Pathways Obtained with WExplore: Residence Times and Mechanisms. , 2016, The journal of physical chemistry. B.

[44]  C. Schütte,et al.  Supplementary Information for “ Constructing the Equilibrium Ensemble of Folding Pathways from Short Off-Equilibrium Simulations ” , 2009 .

[45]  Frank Noé,et al.  Markov state models of biomolecular conformational dynamics. , 2014, Current opinion in structural biology.

[46]  Toni Giorgino,et al.  Identification of slow molecular order parameters for Markov model construction. , 2013, The Journal of chemical physics.

[47]  S. Doerr,et al.  On-the-Fly Learning and Sampling of Ligand Binding by High-Throughput Molecular Simulations. , 2014, Journal of chemical theory and computation.