Neural Network and Nearest Neighbor Algorithms for Enhancing Sampling of Molecular Dynamics.

The free energy calculations of complex chemical and biological systems with molecular dynamics (MD) are inefficient due to multiple local minima separated by high-energy barriers. The minima can be escaped using an enhanced sampling method such as metadynamics, which apply bias (i.e., importance sampling) along a set of collective variables (CV), but the maximum number of CVs (or dimensions) is severely limited. We propose a high-dimensional bias potential method (NN2B) based on two machine learning algorithms: the nearest neighbor density estimator (NNDE) and the artificial neural network (ANN) for the bias potential approximation. The bias potential is constructed iteratively from short biased MD simulations accounting for correlation among CVs. Our method is capable of achieving ergodic sampling and calculating free energy of polypeptides with up to 8-dimensional bias potential.

[1]  B. Berne,et al.  Spectral gap optimization of order parameters for sampling complex molecular systems , 2015, Proceedings of the National Academy of Sciences.

[2]  Berk Hess,et al.  GROMACS: High performance molecular simulations through multi-level parallelism from laptops to supercomputers , 2015 .

[3]  Kurt Hornik,et al.  Multilayer feedforward networks are universal approximators , 1989, Neural Networks.

[4]  Michele Parrinello,et al.  Well-tempered metadynamics converges asymptotically. , 2014, Physical review letters.

[5]  Michele Parrinello,et al.  A self-learning algorithm for biased molecular dynamics , 2010, Proceedings of the National Academy of Sciences.

[6]  A. Laio,et al.  Substrate binding mechanism of HIV-1 protease from explicit-solvent atomistic simulations. , 2009, Journal of the American Chemical Society.

[7]  A. Laio,et al.  Escaping free-energy minima , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[8]  Michele Parrinello,et al.  Demonstrating the Transferability and the Descriptive Power of Sketch-Map. , 2013, Journal of chemical theory and computation.

[9]  Vojtěch Spiwok,et al.  Metadynamics in the conformational space nonlinearly dimensionally reduced by Isomap. , 2011, The Journal of chemical physics.

[10]  Martin Karplus,et al.  Gaussian-mixture umbrella sampling. , 2009, The journal of physical chemistry. B.

[11]  Berk Hess,et al.  LINCS: A linear constraint solver for molecular simulations , 1997 .

[12]  Massimiliano Bonomi,et al.  PLUMED 2: New feathers for an old bird , 2013, Comput. Phys. Commun..

[13]  R. Dror,et al.  How Fast-Folding Proteins Fold , 2011, Science.

[14]  M. Parrinello,et al.  Metadynamics with Adaptive Gaussians. , 2012, Journal of chemical theory and computation.

[15]  Jean-Paul Watson,et al.  Algorithmic dimensionality reduction for molecular structure analysis. , 2008, The Journal of chemical physics.

[16]  R. Swendsen,et al.  THE weighted histogram analysis method for free‐energy calculations on biomolecules. I. The method , 1992 .

[17]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[18]  Grant M. Rotskoff,et al.  Transition-Tempered Metadynamics: Robust, Convergent Metadynamics via On-the-Fly Transition Barrier Estimation. , 2014, Journal of chemical theory and computation.

[19]  Michele Parrinello,et al.  Locating binding poses in protein-ligand systems using reconnaissance metadynamics , 2012, Proceedings of the National Academy of Sciences.

[20]  Michele Parrinello,et al.  Enhancing Important Fluctuations: Rare Events and Metadynamics from a Conceptual Viewpoint. , 2016, Annual review of physical chemistry.

[21]  Francesco Luigi Gervasio,et al.  Comparing the Efficiency of Biased and Unbiased Molecular Dynamics in Reconstructing the Free Energy Landscape of Met-Enkephalin , 2010 .

[22]  Massimiliano Bonomi,et al.  PLUMED: A portable plugin for free-energy calculations with molecular dynamics , 2009, Comput. Phys. Commun..

[23]  G. Torrie,et al.  Nonphysical sampling distributions in Monte Carlo free-energy estimation: Umbrella sampling , 1977 .

[24]  Harshinder Singh,et al.  Nearest Neighbor Estimates of Entropy , 2003 .

[25]  Ron O. Dror,et al.  Mechanism of Voltage Gating in Potassium Channels , 2012, Science.

[26]  Hua Guo,et al.  Permutation invariant polynomial neural network approach to fitting potential energy surfaces. , 2013, The Journal of chemical physics.

[27]  A. Laio,et al.  Metadynamics: a method to simulate rare events and reconstruct the free energy in biophysics, chemistry and material science , 2008 .

[28]  Joseph A. Bank,et al.  Supporting Online Material Materials and Methods Figs. S1 to S10 Table S1 References Movies S1 to S3 Atomic-level Characterization of the Structural Dynamics of Proteins , 2022 .

[29]  Jun Li,et al.  Communication: An accurate full 15 dimensional permutationally invariant potential energy surface for the OH + CH4 → H2O + CH3 reaction. , 2015, The Journal of chemical physics.

[30]  Michele Parrinello,et al.  Simplifying the representation of complex free-energy landscapes using sketch-map , 2011, Proceedings of the National Academy of Sciences.

[31]  Marino Arroyo,et al.  Topological obstructions in the way of data-driven collective variables. , 2015, The Journal of chemical physics.

[32]  Alessandro Laio,et al.  Advillin folding takes place on a hypersurface of small dimensionality. , 2008, Physical review letters.

[33]  Jürgen Schmidhuber,et al.  Deep learning in neural networks: An overview , 2014, Neural Networks.

[34]  Vojtěch Spiwok,et al.  Continuous metadynamics in essential coordinates as a tool for free energy modelling of conformational changes , 2008, Journal of molecular modeling.

[35]  Jiří Vondrášek,et al.  Gyration- and inertia-tensor-based collective coordinates for metadynamics. Application on the conformational behavior of polyalanine peptides and Trp-cage folding. , 2011, The journal of physical chemistry. A.

[36]  Harshinder Singh,et al.  Nearest‐neighbor nonparametric method for estimating the configurational entropy of complex molecules , 2007, J. Comput. Chem..

[37]  Hao Wu,et al.  Multiensemble Markov models of molecular thermodynamics and kinetics , 2016, Proceedings of the National Academy of Sciences.

[38]  Michele Parrinello,et al.  Generalized neural-network representation of high-dimensional potential-energy surfaces. , 2007, Physical review letters.

[39]  P. Kollman,et al.  Settle: An analytical version of the SHAKE and RATTLE algorithm for rigid water models , 1992 .

[40]  Frank Noé,et al.  Statistically optimal analysis of state-discretized trajectory data from multiple thermodynamic states. , 2014, The Journal of chemical physics.

[41]  A. Laio,et al.  A bias-exchange approach to protein folding. , 2007, The journal of physical chemistry. B.

[42]  Noam Bernstein,et al.  Free Energy Surface Reconstruction from Umbrella Samples Using Gaussian Process Regression. , 2013, Journal of chemical theory and computation.

[43]  Y. Sugita,et al.  Replica-exchange molecular dynamics method for protein folding , 1999 .

[44]  Alan M. Ferrenberg,et al.  New Monte Carlo technique for studying phase transitions. , 1988, Physical review letters.

[45]  L. Devroye,et al.  A weighted k-nearest neighbor density estimate for geometric inference , 2011 .

[46]  Jana Pazúriková,et al.  Nonlinear vs. linear biasing in Trp-cage folding simulations. , 2015, The Journal of chemical physics.

[47]  Michael R. Shirts,et al.  Statistically optimal analysis of samples from multiple equilibrium states. , 2008, The Journal of chemical physics.

[48]  Kevin J. Bowers,et al.  A maximum likelihood method for linking particle-in-cell and Monte-Carlo transport simulations , 2004, Comput. Phys. Commun..

[49]  Michele Parrinello,et al.  Variational approach to enhanced sampling and free energy calculations. , 2014, Physical review letters.

[50]  Alexander D. MacKerell,et al.  Optimization of the additive CHARMM all-atom protein force field targeting improved sampling of the backbone φ, ψ and side-chain χ(1) and χ(2) dihedral angles. , 2012, Journal of chemical theory and computation.

[51]  Marino Arroyo,et al.  Modeling and enhanced sampling of molecular systems with smooth and nonlinear data-driven collective variables. , 2013, The Journal of chemical physics.

[52]  W. L. Jorgensen,et al.  Comparison of simple potential functions for simulating liquid water , 1983 .

[53]  Eric T. Kim,et al.  How does a drug molecule find its target binding site? , 2011, Journal of the American Chemical Society.

[54]  Massimiliano Bonomi,et al.  Efficient Sampling of High-Dimensional Free-Energy Landscapes with Parallel Bias Metadynamics. , 2015, Journal of chemical theory and computation.

[55]  M. Parrinello,et al.  Well-tempered metadynamics: a smoothly converging and tunable free-energy method. , 2008, Physical review letters.

[56]  M. Parrinello,et al.  Canonical sampling through velocity rescaling. , 2007, The Journal of chemical physics.

[57]  Giovanni Bussi,et al.  Enhanced Conformational Sampling Using Replica Exchange with Collective-Variable Tempering , 2015, Journal of chemical theory and computation.

[58]  Michele Parrinello,et al.  Well-Tempered Variational Approach to Enhanced Sampling. , 2015, Journal of chemical theory and computation.

[59]  T. Darden,et al.  A smooth particle mesh Ewald method , 1995 .

[60]  Yasuhiro Matsunaga,et al.  Dimensionality of Collective Variables for Describing Conformational Changes of a Multi-Domain Protein. , 2016, The journal of physical chemistry letters.

[61]  Francesco Luigi Gervasio,et al.  New advances in metadynamics , 2012 .

[62]  Rui Sun,et al.  Exploring Valleys without Climbing Every Peak: More Efficient and Forgiving Metabasin Metadynamics via Robust On-the-Fly Bias Domain Restriction , 2015, Journal of chemical theory and computation.

[63]  Vojtech Spiwok,et al.  Metadynamics in essential coordinates: free energy simulation of conformational changes. , 2007, The journal of physical chemistry. B.

[64]  G. Matheron The intrinsic random functions and their applications , 1973, Advances in Applied Probability.

[65]  K. Lindorff-Larsen,et al.  Atomic-level description of ubiquitin folding , 2013, Proceedings of the National Academy of Sciences.

[66]  Jörg Behler,et al.  Constructing high‐dimensional neural network potentials: A tutorial review , 2015 .

[67]  D. van der Spoel,et al.  A temperature predictor for parallel tempering simulations. , 2008, Physical chemistry chemical physics : PCCP.

[68]  C. Quesenberry,et al.  A nonparametric estimate of a multivariate density function , 1965 .

[69]  M. Gastegger,et al.  High-Dimensional Neural Network Potentials for Organic Reactions and an Improved Training Algorithm. , 2015, Journal of chemical theory and computation.

[70]  Jun Li,et al.  Permutation invariant polynomial neural network approach to fitting potential energy surfaces. II. Four-atom systems. , 2013, The Journal of chemical physics.

[71]  Michele Parrinello,et al.  Enhanced, targeted sampling of high-dimensional free-energy landscapes using variationally enhanced sampling, with an application to chignolin , 2016, Proceedings of the National Academy of Sciences.

[72]  Jing Huang,et al.  CHARMM36 all‐atom additive protein force field: Validation based on comparison to NMR data , 2013, J. Comput. Chem..

[73]  Michele Parrinello,et al.  Probing the Unfolded Configurations of a β-Hairpin Using Sketch-Map. , 2015, Journal of chemical theory and computation.