Accurate Estimation of Protein Folding and Unfolding Times: Beyond Markov State Models

Because standard molecular dynamics (MD) simulations are unable to access time scales of interest in complex biomolecular systems, it is common to “stitch together” information from multiple shorter trajectories using approximate Markov state model (MSM) analysis. However, MSMs may require significant tuning and can yield biased results. Here, by analyzing some of the longest protein MD data sets available (>100 μs per protein), we show that estimators constructed based on exact non-Markovian (NM) principles can yield significantly improved mean first-passage times (MFPTs) for protein folding and unfolding. In some cases, MSM bias of more than an order of magnitude can be corrected when identical trajectory data are reanalyzed by non-Markovian approaches. The NM analysis includes “history” information, higher order time correlations compared to MSMs, that is available in every MD trajectory. The NM strategy is insensitive to fine details of the states used and works well when a fine time-discretization (i.e., small “lag time”) is used.

[1]  Frank Noé,et al.  Markov models of molecular kinetics: generation and validation. , 2011, The Journal of chemical physics.

[2]  Peter G Bolhuis,et al.  Rate constants for diffusive processes by partial path sampling. , 2004, The Journal of chemical physics.

[3]  Exact rate calculations by trajectory parallelization and tilting. , 2009, The Journal of chemical physics.

[4]  Frank Noé,et al.  An Introduction to Markov State Models and Their Application to Long Timescale Molecular Simulation , 2014, Advances in Experimental Medicine and Biology.

[5]  R. Dror,et al.  How Fast-Folding Proteins Fold , 2011, Science.

[6]  K. Lindorff-Larsen,et al.  How robust are protein folding simulations with respect to force field parameterization? , 2011, Biophysical journal.

[7]  A. Laio,et al.  Equilibrium free energies from nonequilibrium metadynamics. , 2006, Physical Review Letters.

[8]  J. Adelman,et al.  Simulating Current-Voltage Relationships for a Narrow Ion Channel Using the Weighted Ensemble Method. , 2015, Journal of chemical theory and computation.

[9]  P. R. ten Wolde,et al.  Sampling rare switching events in biochemical networks. , 2004, Physical review letters.

[10]  Vijay S Pande,et al.  Enhanced modeling via network theory: Adaptive sampling of Markov state models. , 2010, Journal of chemical theory and computation.

[11]  Ronald M. Levy,et al.  Conformational populations of ligand‐sized molecules by replica exchange molecular dynamics and temperature reweighting , 2009, J. Comput. Chem..

[12]  D. Frenkel,et al.  Computing stationary distributions in equilibrium and nonequilibrium systems with forward flux sampling. , 2007, The Journal of chemical physics.

[13]  Peter G. Bolhuis,et al.  A novel path sampling method for the calculation of rate constants , 2003 .

[14]  Frank Noé,et al.  Markov state models based on milestoning. , 2011, The Journal of chemical physics.

[15]  Toni Giorgino,et al.  Identification of slow molecular order parameters for Markov model construction. , 2013, The Journal of chemical physics.

[16]  K. Dill,et al.  Automatic discovery of metastable states for the construction of Markov models of macromolecular conformational dynamics. , 2007, The Journal of chemical physics.

[17]  C. Schütte,et al.  Supplementary Information for “ Constructing the Equilibrium Ensemble of Folding Pathways from Short Off-Equilibrium Simulations ” , 2009 .

[18]  Daniel M Zuckerman,et al.  Estimating first‐passage time distributions from weighted ensemble simulations and non‐Markovian analyses , 2016, Protein science : a publication of the Protein Society.

[19]  K. Hukushima,et al.  Exchange Monte Carlo Method and Application to Spin Glass Simulations , 1995, cond-mat/9512035.

[20]  R. Elber,et al.  Computing time scales from reaction coordinates by milestoning. , 2004, The Journal of chemical physics.

[21]  Ronald M Levy,et al.  How kinetics within the unfolded state affects protein folding: an analysis based on markov state models and an ultra-long MD trajectory. , 2013, The journal of physical chemistry. B.

[22]  Bin W. Zhang,et al.  Efficient and verified simulation of a path ensemble for conformational change in a united-residue model of calmodulin , 2007, Proceedings of the National Academy of Sciences.

[23]  A. Laio,et al.  Escaping free-energy minima , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[24]  M. Parrinello,et al.  From metadynamics to dynamics. , 2013, Physical review letters.

[25]  Peter G Bolhuis,et al.  Simultaneous computation of free energies and kinetics of rare events. , 2005, Physical review. E, Statistical, nonlinear, and soft matter physics.

[26]  P. Hänggi,et al.  Universal equivalence of mean first-passage time and Kramers rate. , 1999, Physical review. E, Statistical physics, plasmas, fluids, and related interdisciplinary topics.

[27]  Thomas J Lane,et al.  MSMBuilder2: Modeling Conformational Dynamics at the Picosecond to Millisecond Scale. , 2011, Journal of chemical theory and computation.

[28]  Adrian E Roitberg,et al.  Constant pH replica exchange molecular dynamics in biomolecules using a discrete protonation model. , 2010, Journal of chemical theory and computation.

[29]  Alessandra Magistrato,et al.  Dissociation of minor groove binders from DNA: insights from metadynamics simulations , 2008, Nucleic acids research.

[30]  A. Dinner,et al.  Separating forward and backward pathways in nonequilibrium umbrella sampling. , 2009, The Journal of chemical physics.

[31]  Jeremy C. Smith,et al.  Hierarchical analysis of conformational dynamics in biomolecules: transition networks of metastable states. , 2007, The Journal of chemical physics.

[32]  L. Chong,et al.  Simultaneous Computation of Dynamical and Equilibrium Information Using a Weighted Ensemble of Trajectories , 2012, Journal of chemical theory and computation.

[33]  Daniel M. Zuckerman,et al.  Transition events in butane simulations: Similarities across models , 2002 .

[34]  Rommie E. Amaro,et al.  Application of Molecular-Dynamics Based Markov State Models to Functional Proteins , 2014, Journal of chemical theory and computation.

[35]  Ulrich H E Hansmann,et al.  Dynamics and optimal number of replicas in parallel tempering simulations. , 2007, Physical review. E, Statistical, nonlinear, and soft matter physics.

[36]  Thomas J Lane,et al.  MDTraj: a modern, open library for the analysis of molecular dynamics trajectories , 2014, bioRxiv.

[37]  Vijay S Pande,et al.  Improvements in Markov State Model Construction Reveal Many Non-Native Interactions in the Folding of NTL9. , 2013, Journal of chemical theory and computation.

[38]  Vijay S Pande,et al.  Progress and challenges in the automated construction of Markov state models for full protein systems. , 2009, The Journal of chemical physics.