Girsanov reweighting for path ensembles and Markov state models.

The sensitivity of molecular dynamics on changes in the potential energy function plays an important role in understanding the dynamics and function of complex molecules. We present a method to obtain path ensemble averages of a perturbed dynamics from a set of paths generated by a reference dynamics. It is based on the concept of path probability measure and the Girsanov theorem, a result from stochastic analysis to estimate a change of measure of a path ensemble. Since Markov state models (MSMs) of the molecular dynamics can be formulated as a combined phase-space and path ensemble average, the method can be extended to reweight MSMs by combining it with a reweighting of the Boltzmann distribution. We demonstrate how to efficiently implement the Girsanov reweighting in a molecular dynamics simulation program by calculating parts of the reweighting factor "on the fly" during the simulation, and we benchmark the method on test systems ranging from a two-dimensional diffusion process and an artificial many-body system to alanine dipeptide and valine dipeptide in implicit and explicit water. The method can be used to study the sensitivity of molecular dynamics on external perturbations as well as to reweight trajectories generated by enhanced sampling schemes to the original dynamics.

[1]  Kyle A. Beauchamp,et al.  Molecular simulation of ab initio protein folding for a millisecond folder NTL9(1-39). , 2010, Journal of the American Chemical Society.

[2]  Michael R. Shirts,et al.  Dynamical reweighting: improved estimates of dynamical properties from simulations at multiple temperatures. , 2011, The Journal of chemical physics.

[3]  Frank Noé,et al.  Markov models of molecular kinetics: generation and validation. , 2011, The Journal of chemical physics.

[4]  M. Karplus,et al.  Stochastic boundary conditions for molecular dynamics simulations of ST2 water , 1984 .

[5]  Frank Noé,et al.  Variational tensor approach for approximating the rare-event kinetics of macromolecular systems. , 2016, The Journal of chemical physics.

[6]  C. Schütte,et al.  Efficient rare event simulation by optimal nonequilibrium forcing , 2012, 1208.3232.

[7]  É. Pardoux,et al.  Méthodes de Monte-Carlo pour les équations de transport et de diffusion , 1998 .

[8]  Gianni De Fabritiis,et al.  Kinetic modulation of a disordered protein domain by phosphorylation , 2014, Nature Communications.

[9]  Bettina Keller,et al.  An Analysis of the Validity of Markov State Models for Emulating the Dynamics of Classical Molecular Systems and Ensembles. , 2011, Journal of chemical theory and computation.

[10]  Frank Noé,et al.  Variational Approach to Molecular Kinetics. , 2014, Journal of chemical theory and computation.

[11]  Frank Noé,et al.  Optimal use of data in parallel tempering simulations for the construction of discrete-state Markov models of biomolecular dynamics. , 2011, The Journal of chemical physics.

[12]  B. Keller,et al.  Density-based cluster algorithms for the identification of core sets. , 2016, The Journal of chemical physics.

[13]  Walter Thiel,et al.  Bridging the gap between thermodynamic integration and umbrella sampling provides a novel analysis method: "Umbrella integration". , 2005, The Journal of chemical physics.

[14]  Christof Schütte,et al.  Markov state models and molecular alchemy , 2015 .

[15]  C. Oostenbrink,et al.  On using oscillating time-dependent restraints in MD simulation , 2007, Journal of biomolecular NMR.

[16]  Peter Deuflhard,et al.  Transfer Operator Approach to Conformational Dynamics in Biomolecular Systems , 2001 .

[17]  Xuhui Huang,et al.  Elucidation of the Dynamics of Transcription Elongation by RNA Polymerase II using Kinetic Network Models. , 2016, Accounts of chemical research.

[18]  Gerhard Stock,et al.  Multidimensional Langevin modeling of biomolecular dynamics. , 2009, The Journal of chemical physics.

[19]  R. Elber,et al.  Computing time scales from reaction coordinates by milestoning. , 2004, The Journal of chemical physics.

[20]  R. Swendsen,et al.  THE weighted histogram analysis method for free‐energy calculations on biomolecules. I. The method , 1992 .

[21]  Kathryn M Hart,et al.  Discovery of multiple hidden allosteric sites by combining Markov state models and experiments , 2015, Proceedings of the National Academy of Sciences.

[22]  G. Hummer,et al.  Reaction coordinates and rates from transition paths. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[23]  Frank Noé,et al.  Statistically optimal analysis of state-discretized trajectory data from multiple thermodynamic states. , 2014, The Journal of chemical physics.

[24]  David D. L. Minh,et al.  Optimal estimators and asymptotic variances for nonequilibrium path-ensemble averages. , 2009, The Journal of chemical physics.

[25]  Andrew E. Torda,et al.  Local elevation: A method for improving the searching properties of molecular dynamics simulation , 1994, J. Comput. Aided Mol. Des..

[26]  Yannis Pantazis,et al.  Parametric Sensitivity Analysis for Stochastic Molecular Systems using Information Theoretic Metrics , 2014, The Journal of chemical physics.

[27]  Diwakar Shukla,et al.  OpenMM 4: A Reusable, Extensible, Hardware Independent Library for High Performance Molecular Simulation. , 2013, Journal of chemical theory and computation.

[28]  R. Dror,et al.  Improved side-chain torsion potentials for the Amber ff99SB protein force field , 2010, Proteins.

[29]  L. Rey-Bellet,et al.  Efficient estimators for likelihood ratio sensitivity indices of complex stochastic dynamics. , 2015, The Journal of chemical physics.

[30]  Hao Wu,et al.  xTRAM: Estimating equilibrium expectations from time-correlated simulation data at multiple thermodynamic states , 2014, 1407.0138.

[31]  W. L. Jorgensen,et al.  Comparison of simple potential functions for simulating liquid water , 1983 .

[32]  F. Noé,et al.  A Basis Set for Peptides for the Variational Approach to Conformational Kinetics. , 2015, Journal of chemical theory and computation.

[33]  Hao Wu,et al.  Multiensemble Markov models of molecular thermodynamics and kinetics , 2016, Proceedings of the National Academy of Sciences.

[34]  G. Hummer,et al.  Coarse master equations for peptide folding dynamics. , 2008, The journal of physical chemistry. B.

[35]  G. Torrie,et al.  Nonphysical sampling distributions in Monte Carlo free-energy estimation: Umbrella sampling , 1977 .

[36]  Pietro Faccioli,et al.  Dominant reaction pathways in protein folding: A direct validation against molecular dynamics simulations. , 2010, The Journal of chemical physics.

[37]  David Chandler,et al.  Transition path sampling: throwing ropes over rough mountain passes, in the dark. , 2002, Annual review of physical chemistry.

[38]  P. Deuflhard,et al.  A Direct Approach to Conformational Dynamics Based on Hybrid Monte Carlo , 1999 .

[39]  A. Laio,et al.  Escaping free-energy minima , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[40]  I. V. Girsanov On Transforming a Certain Class of Stochastic Processes by Absolutely Continuous Substitution of Measures , 1960 .

[41]  Charles H. Bennett,et al.  Efficient estimation of free energy differences from Monte Carlo data , 1976 .

[42]  C. Simmerling,et al.  ff14SB: Improving the Accuracy of Protein Side Chain and Backbone Parameters from ff99SB. , 2015, Journal of chemical theory and computation.

[43]  Gerrit Groenhof,et al.  GROMACS: Fast, flexible, and free , 2005, J. Comput. Chem..

[44]  William Swope,et al.  Describing Protein Folding Kinetics by Molecular Dynamics Simulations. 2. Example Applications to Alanine Dipeptide and a β-Hairpin Peptide† , 2004 .

[45]  F. Noé,et al.  Protein conformational plasticity and complex ligand-binding kinetics explored by atomistic simulations and Markov models , 2015, Nature Communications.

[46]  F. Noé,et al.  Dynamic properties of force fields. , 2015, The Journal of chemical physics.

[47]  Jesús A. Izaguirre,et al.  Multiscale Dynamics of Macromolecules Using Normal Mode Langevin , 2010, Pacific Symposium on Biocomputing.

[48]  M. Parrinello,et al.  Canonical sampling through velocity rescaling. , 2007, The Journal of chemical physics.

[49]  Wilfred F van Gunsteren,et al.  Comparing geometric and kinetic cluster algorithms for molecular simulation data. , 2010, The Journal of chemical physics.

[50]  R. Zwanzig High‐Temperature Equation of State by a Perturbation Method. I. Nonpolar Gases , 1954 .

[51]  T. Darden,et al.  A smooth particle mesh Ewald method , 1995 .

[52]  Yuji Sugita,et al.  Replica-exchange multicanonical algorithm and multicanonical replica-exchange method for simulating systems with rough energy landscape , 2000, cond-mat/0009119.

[53]  P. Deuflhard,et al.  Identification of almost invariant aggregates in reversible nearly uncoupled Markov chains , 2000 .

[54]  Giovanni Bussi,et al.  Combining Simulations and Solution Experiments as a Paradigm for RNA Force Field Refinement. , 2016, Journal of chemical theory and computation.

[55]  D. Case,et al.  Exploring protein native states and large‐scale conformational changes with a modified generalized born model , 2004, Proteins.

[56]  Frank Noé,et al.  Markov state models based on milestoning. , 2011, The Journal of chemical physics.

[57]  Michael R. Shirts,et al.  Statistically optimal analysis of samples from multiple equilibrium states. , 2008, The Journal of chemical physics.

[58]  K. Dill,et al.  Automatic discovery of metastable states for the construction of Markov models of macromolecular conformational dynamics. , 2007, The Journal of chemical physics.