Reinforced dynamics for enhanced sampling in large atomic and molecular systems. I. Basic Methodology

A new approach for efficiently exploring the configuration space and computing the free energy of large atomic and molecular systems is proposed, motivated by an analogy with reinforcement learning. There are two major components in this new approach. Like metadynamics, it allows for an efficient exploration of the configuration space by adding an adaptively computed biasing potential to the original dynamics. Like deep reinforcement learning, this biasing potential is trained on the fly using deep neural networks, with data collected judiciously from the exploration and an uncertainty indicator from the neural network model playing the role of the reward function. Parameterization using neural networks makes it feasible to handle cases with a large set of collective variables. This has the potential advantage that selecting precisely the right set of collective variables has now become less critical for capturing the structural transformations of the system. The method is illustrated by studying the full-atom explicit solvent models of alanine dipeptide and tripeptide, as well as the system of a polyalanine-10 molecule with 20 collective variables.

[1]  Wilfred F van Gunsteren,et al.  Multiple free energies from a single simulation: extending enveloping distribution sampling to nonoverlapping phase-space distributions. , 2008, The Journal of chemical physics.

[2]  Benjamin Trendelkamp-Schroer,et al.  Efficient estimation of rare-event kinetics , 2014, 1409.6439.

[3]  W. L. Jorgensen,et al.  Comparison of simple potential functions for simulating liquid water , 1983 .

[4]  V. Hornak,et al.  Comparison of multiple Amber force fields and development of improved protein backbone parameters , 2006, Proteins.

[5]  M. Tuckerman,et al.  Efficient and direct generation of multidimensional free energy surfaces via adiabatic dynamics without coordinate transformations. , 2008, The journal of physical chemistry. B.

[6]  Noam Bernstein,et al.  Exploration, Sampling, And Reconstruction of Free Energy Surfaces with Gaussian Process Regression. , 2016, Journal of chemical theory and computation.

[7]  P. Tavan,et al.  The "Hot-Solvent/Cold-Solute" Problem Revisited. , 2008, Journal of chemical theory and computation.

[8]  M. Parrinello,et al.  Well-tempered metadynamics: a smoothly converging and tunable free-energy method. , 2008, Physical review letters.

[9]  M. Parrinello,et al.  Canonical sampling through velocity rescaling. , 2007, The Journal of chemical physics.

[10]  E. Vanden-Eijnden,et al.  Single-sweep methods for free energy calculations. , 2007, The Journal of chemical physics.

[11]  A. Laio,et al.  Escaping free-energy minima , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[12]  Yuji Sugita,et al.  Neural Network and Nearest Neighbor Algorithms for Enhancing Sampling of Molecular Dynamics. , 2017, Journal of chemical theory and computation.

[13]  Yi Qin Gao,et al.  Self-adaptive enhanced sampling in the energy and trajectory spaces: accelerated thermodynamics and kinetic calculations. , 2008, The Journal of chemical physics.

[14]  Shane Legg,et al.  Human-level control through deep reinforcement learning , 2015, Nature.

[15]  M. Parrinello,et al.  Polymorphic transitions in single crystals: A new molecular dynamics method , 1981 .

[16]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[17]  Mark E Tuckerman,et al.  Stochastic Neural Network Approach for Learning High-Dimensional Free Energy Surfaces. , 2017, Physical review letters.

[18]  Yuan Yu,et al.  TensorFlow: A system for large-scale machine learning , 2016, OSDI.

[19]  E. Weinan,et al.  Deep Potential: a general representation of a many-body potential energy surface , 2017, 1707.01478.

[20]  Geoffrey E. Hinton,et al.  Deep Learning , 2015, Nature.

[21]  Noam Bernstein,et al.  Free Energy Surface Reconstruction from Umbrella Samples Using Gaussian Process Regression. , 2013, Journal of chemical theory and computation.

[22]  R. Fletcher Practical Methods of Optimization , 1988 .

[23]  Berk Hess,et al.  GROMACS: High performance molecular simulations through multi-level parallelism from laptops to supercomputers , 2015 .

[24]  T. Darden,et al.  A smooth particle mesh Ewald method , 1995 .

[25]  Mark E Tuckerman,et al.  Temperature-accelerated method for exploring polymorphism in molecular crystals based on free energy. , 2011, Physical review letters.

[26]  KumarShankar,et al.  The weighted histogram analysis method for free-energy calculations on biomolecules. I , 1992 .

[27]  A. Voter Hyperdynamics: Accelerated Molecular Dynamics of Infrequent Events , 1997 .

[28]  Klaus-Robert Müller,et al.  Efficient BackProp , 2012, Neural Networks: Tricks of the Trade.

[29]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[30]  G. Ciccotti,et al.  Blue moon sampling, vectorial reaction coordinates, and unbiased constrained dynamics. , 2005, Chemphyschem : a European journal of chemical physics and physical chemistry.

[31]  R. Swendsen,et al.  THE weighted histogram analysis method for free‐energy calculations on biomolecules. I. The method , 1992 .

[32]  P. Kollman,et al.  Settle: An analytical version of the SHAKE and RATTLE algorithm for rigid water models , 1992 .

[33]  Berk Hess,et al.  LINCS: A linear constraint solver for molecular simulations , 1997, J. Comput. Chem..

[34]  Joost VandeVondele,et al.  Efficient multidimensional free energy calculations for ab initio molecular dynamics using classical bias potentials , 2000 .

[35]  Y. Sugita,et al.  Replica-exchange molecular dynamics method for protein folding , 1999 .

[36]  E. Vanden-Eijnden,et al.  A temperature accelerated method for sampling free energy and determining reaction pathways in rare events simulations , 2006 .

[37]  E Weinan,et al.  Deep Potential Molecular Dynamics: a scalable model with the accuracy of quantum mechanics , 2017, Physical review letters.

[38]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[39]  Massimiliano Bonomi,et al.  PLUMED 2: New feathers for an old bird , 2013, Comput. Phys. Commun..

[40]  P. Kollman Advances and Continuing Challenges in Achieving Realistic and Predictive Simulations of the Properties of Organic and Biological Molecules , 1996 .

[41]  Shankar Kumar,et al.  Multidimensional free‐energy calculations using the weighted histogram analysis method , 1995, J. Comput. Chem..

[42]  Eric Vanden-Eijnden,et al.  Simplified and improved string method for computing the minimum energy paths in barrier-crossing events. , 2007, The Journal of chemical physics.

[43]  E. Vanden-Eijnden,et al.  String method for the study of rare events , 2002, cond-mat/0205527.

[44]  Michael W Deem,et al.  Parallel tempering: theory, applications, and new perspectives. , 2005, Physical chemistry chemical physics : PCCP.