Optimal use of data in parallel tempering simulations for the construction of discrete-state Markov models of biomolecular dynamics.

Parallel tempering (PT) molecular dynamics simulations have been extensively investigated as a means of efficient sampling of the configurations of biomolecular systems. Recent work has demonstrated how the short physical trajectories generated in PT simulations of biomolecules can be used to construct the Markov models describing biomolecular dynamics at each simulated temperature. While this approach describes the temperature-dependent kinetics, it does not make optimal use of all available PT data, instead estimating the rates at a given temperature using only data from that temperature. This can be problematic, as some relevant transitions or states may not be sufficiently sampled at the temperature of interest, but might be readily sampled at nearby temperatures. Further, the comparison of temperature-dependent properties can suffer from the false assumption that data collected from different temperatures are uncorrelated. We propose here a strategy in which, by a simple modification of the PT protocol, the harvested trajectories can be reweighted, permitting data from all temperatures to contribute to the estimated kinetic model. The method reduces the statistical uncertainty in the kinetic model relative to the single temperature approach and provides estimates of transition probabilities even for transitions not observed at the temperature of interest. Further, the method allows the kinetics to be estimated at temperatures other than those at which simulations were run. We illustrate this method by applying it to the generation of a Markov model of the conformational dynamics of the solvated terminally blocked alanine peptide.

[1]  John D. Chodera,et al.  Long-Time Protein Folding Dynamics from Short-Time Molecular Dynamics Simulations , 2006, Multiscale Model. Simul..

[2]  Jimmie D. Doll,et al.  Dynamical aspects of precursor state kinetics , 1981 .

[3]  Sören Doose,et al.  A microscopic view of miniprotein folding: enhanced folding efficiency through formation of an intermediate. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[4]  R. A. Goldbeck,et al.  Multiple pathways on a protein-folding energy landscape: kinetic evidence. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[5]  L. Kay,et al.  Intrinsic dynamics of an enzyme underlies catalysis , 2005, Nature.

[6]  L. Verlet Computer "Experiments" on Classical Fluids. I. Thermodynamical Properties of Lennard-Jones Molecules , 1967 .

[7]  William Swope,et al.  Describing Protein Folding Kinetics by Molecular Dynamics Simulations. 1. Theory , 2004 .

[8]  G. Ciccotti,et al.  Numerical Integration of the Cartesian Equations of Motion of a System with Constraints: Molecular Dynamics of n-Alkanes , 1977 .

[9]  V. Pande,et al.  Calculation of the distribution of eigenvalues and eigenvectors in Markovian state models for molecular dynamics. , 2007, The Journal of chemical physics.

[10]  H. C. Andersen Molecular dynamics simulations at constant pressure and/or temperature , 1980 .

[11]  Michael R. Shirts,et al.  Dynamical reweighting: improved estimates of dynamical properties from simulations at multiple temperatures. , 2011, The Journal of chemical physics.

[12]  R. Swendsen,et al.  THE weighted histogram analysis method for free‐energy calculations on biomolecules. I. The method , 1992 .

[13]  U. Hansmann Parallel tempering algorithm for conformational studies of biological molecules , 1997, physics/9710041.

[14]  Jason C. Crane,et al.  The folding mechanism of a -sheet: the WW domain1 , 2001 .

[15]  Frank Noé,et al.  On the Approximation Quality of Markov State Models , 2010, Multiscale Model. Simul..

[16]  Anthony K. Felts,et al.  Temperature weighted histogram analysis method, replica exchange, and transition paths. , 2005, The journal of physical chemistry. B.

[17]  A Mitsutake,et al.  Generalized-ensemble algorithms for molecular simulations of biopolymers. , 2000, Biopolymers.

[18]  Michael R. Shirts,et al.  Statistically optimal analysis of samples from multiple equilibrium states. , 2008, The Journal of chemical physics.

[19]  G. Hummer,et al.  Coarse master equations for peptide folding dynamics. , 2008, The journal of physical chemistry. B.

[20]  K. Hukushima,et al.  Exchange Monte Carlo Method and Application to Spin Glass Simulations , 1995, cond-mat/9512035.

[21]  Christian Bartels,et al.  Multidimensional adaptive umbrella sampling: Applications to main chain and side chain peptide conformations , 1997 .

[22]  William Swope,et al.  Describing Protein Folding Kinetics by Molecular Dynamics Simulations. 2. Example Applications to Alanine Dipeptide and a β-Hairpin Peptide† , 2004 .

[23]  Yawen Bai,et al.  A protein folding pathway with multiple folding intermediates at atomic resolution. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[24]  X. Xie,et al.  Observation of a power-law memory kernel for fluctuations within a single protein molecule. , 2005, Physical review letters.

[25]  A Caflisch,et al.  ETNA: equilibrium transitions network and Arrhenius equation for extracting folding kinetics from REMD simulations. , 2009, The journal of physical chemistry. B.

[26]  K. Dill,et al.  Automatic discovery of metastable states for the construction of Markov models of macromolecular conformational dynamics. , 2007, The Journal of chemical physics.

[27]  F. Noé Probability distributions of molecular observables computed from Markov models. , 2008, The Journal of chemical physics.

[28]  L. Verlet Computer "Experiments" on Classical Fluids. II. Equilibrium Correlation Functions , 1968 .

[29]  V. Pande,et al.  Error analysis and efficient sampling in Markovian state models for molecular dynamics. , 2005, The Journal of chemical physics.

[30]  C. Bartels Analyzing biased Monte Carlo and molecular dynamics simulations , 2000 .

[31]  Y. Sugita,et al.  Replica-exchange molecular dynamics method for protein folding , 1999 .

[32]  C M Dobson,et al.  Fast and slow tracks in lysozyme folding: insight into the role of domains in the folding process. , 1997, Journal of molecular biology.

[33]  Arthur F. Voter,et al.  Dynamical corrections to transition state theory for multistate systems: Surface self-diffusion in the rare-event regime , 1985 .

[34]  Michael W Deem,et al.  Parallel tempering: theory, applications, and new perspectives. , 2005, Physical chemistry chemical physics : PCCP.

[35]  Alan M. Ferrenberg,et al.  Optimized Monte Carlo data analysis. , 1989, Physical Review Letters.

[36]  Gerhard Hummer,et al.  Peptide folding kinetics from replica exchange molecular dynamics. , 2007, Physical review. E, Statistical, nonlinear, and soft matter physics.

[37]  Jeremy C. Smith,et al.  Hierarchical analysis of conformational dynamics in biomolecules: transition networks of metastable states. , 2007, The Journal of chemical physics.

[38]  T. Pan,et al.  A large collapsed-state RNA can exhibit simple exponential single-molecule dynamics. , 2008, Journal of molecular biology.

[39]  D. A. Bosco,et al.  Enzyme Dynamics During Catalysis , 2002, Science.

[40]  Vijay S Pande,et al.  Enhanced modeling via network theory: Adaptive sampling of Markov state models. , 2010, Journal of chemical theory and computation.

[41]  Matthias Rief,et al.  Single-molecule folding. , 2003, Current opinion in structural biology.

[42]  Ken A Dill,et al.  Use of the Weighted Histogram Analysis Method for the Analysis of Simulated and Parallel Tempering Simulations. , 2007, Journal of chemical theory and computation.

[43]  A. Khinchin Mathematical foundations of statistical mechanics , 1949 .