Using sketch-map coordinates to analyze and bias molecular dynamics simulations

When examining complex problems, such as the folding of proteins, coarse grained descriptions of the system drive our investigation and help us to rationalize the results. Oftentimes collective variables (CVs), derived through some chemical intuition about the process of interest, serve this purpose. Because finding these CVs is the most difficult part of any investigation, we recently developed a dimensionality reduction algorithm, sketch-map, that can be used to build a low-dimensional map of a phase space of high-dimensionality. In this paper we discuss how these machine-generated CVs can be used to accelerate the exploration of phase space and to reconstruct free-energy landscapes. To do so, we develop a formalism in which high-dimensional configurations are no longer represented by low-dimensional position vectors. Instead, for each configuration we calculate a probability distribution, which has a domain that encompasses the entirety of the low-dimensional space. To construct a biasing potential, we exploit an analogy with metadynamics and use the trajectory to adaptively construct a repulsive, history-dependent bias from the distributions that correspond to the previously visited configurations. This potential forces the system to explore more of phase space by making it desirable to adopt configurations whose distributions do not overlap with the bias. We apply this algorithm to a small model protein and succeed in reproducing the free-energy surface that we obtain from a parallel tempering calculation.

[1]  Stéphane Lafon,et al.  Diffusion maps , 2006 .

[2]  Michele Parrinello,et al.  Simplifying the representation of complex free-energy landscapes using sketch-map , 2011, Proceedings of the National Academy of Sciences.

[3]  U. Hansmann Parallel tempering algorithm for conformational studies of biological molecules , 1997, physics/9710041.

[4]  Alessandro Laio,et al.  Advillin folding takes place on a hypersurface of small dimensionality. , 2008, Physical review letters.

[5]  Frédéric Legoll,et al.  Free energy calculations: an efficient adaptive biasing potential method. , 2009, The journal of physical chemistry. B.

[6]  J. Tenenbaum,et al.  A global geometric framework for nonlinear dimensionality reduction. , 2000, Science.

[7]  Berend Smit,et al.  Understanding Molecular Simulation , 2001 .

[8]  Sergei V. Krivov,et al.  Free energy disconnectivity graphs: Application to peptide models , 2002 .

[9]  Andrew L. Ferguson,et al.  Systematic determination of order parameters for chain dynamics using diffusion maps , 2010, Proceedings of the National Academy of Sciences.

[10]  Gerhard Stock,et al.  How complex is the dynamics of Peptide folding? , 2007, Physical review letters.

[11]  M. Parrinello,et al.  Well-tempered metadynamics: a smoothly converging and tunable free-energy method. , 2008, Physical review letters.

[12]  David J. Wales,et al.  Energy landscapes of model polyalanines , 2002 .

[13]  Peter A. Kollman Advances and Continuing Challenges in Achieving Realistic and Predictive Simulations of the Properties of Organic and Biological Molecules , 1997 .

[14]  Y. Sugita,et al.  Replica-exchange molecular dynamics method for protein folding , 1999 .

[15]  Michael C. Hout,et al.  Multidimensional Scaling , 2003, Encyclopedic Dictionary of Archaeology.

[16]  Michele Parrinello,et al.  A self-learning algorithm for biased molecular dynamics , 2010, Proceedings of the National Academy of Sciences.

[17]  Mikhail Belkin,et al.  Laplacian Eigenmaps for Dimensionality Reduction and Data Representation , 2003, Neural Computation.

[18]  H. Berendsen,et al.  Essential dynamics of proteins , 1993, Proteins.

[19]  Lydia E Kavraki,et al.  Low-dimensional, free-energy landscapes of protein-folding reactions by nonlinear dimensionality reduction , 2006, Proc. Natl. Acad. Sci. USA.

[20]  Bernhard Schölkopf,et al.  Nonlinear Component Analysis as a Kernel Eigenvalue Problem , 1998, Neural Computation.

[21]  Ann B. Lee,et al.  Geometric diffusions as a tool for harmonic analysis and structure definition of data: multiscale methods. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[22]  J. E. Glynn,et al.  Numerical Recipes: The Art of Scientific Computing , 1989 .

[23]  Bradley M Dickson Approaching a parameter-free metadynamics. , 2011, Physical review. E, Statistical, nonlinear, and soft matter physics.

[24]  P. Kollman Advances and Continuing Challenges in Achieving Realistic and Predictive Simulations of the Properties of Organic and Biological Molecules , 1996 .

[26]  Michele Parrinello,et al.  Exploring the free energy surfaces of clusters using reconnaissance metadynamics. , 2011, The Journal of chemical physics.

[27]  William H. Press,et al.  Numerical Recipes: The Art of Scientific Computing , 1987 .

[28]  Massimiliano Bonomi,et al.  PLUMED: A portable plugin for free-energy calculations with molecular dynamics , 2009, Comput. Phys. Commun..

[29]  Vojtěch Spiwok,et al.  Metadynamics in the conformational space nonlinearly dimensionally reduced by Isomap. , 2011, The Journal of chemical physics.

[30]  A. Laio,et al.  Free-energy landscape for beta hairpin folding from combined parallel tempering and metadynamics. , 2006, Journal of the American Chemical Society.

[31]  B. Nadler,et al.  Diffusion maps, spectral clustering and reaction coordinates of dynamical systems , 2005, math/0503445.

[32]  A. Laio,et al.  Escaping free-energy minima , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[33]  Francesco Luigi Gervasio,et al.  From A to B in free energy space. , 2007, The Journal of chemical physics.

[34]  KumarShankar,et al.  The weighted histogram analysis method for free-energy calculations on biomolecules. I , 1992 .

[35]  Christoph Dellago,et al.  Kinetic pathways of ion pair dissociation in water , 1999 .

[36]  M. Maggioni,et al.  Determination of reaction coordinates via locally scaled diffusion map. , 2011, The Journal of chemical physics.

[37]  Giovanni Bussi,et al.  Colored-Noise Thermostats à la Carte , 2010, 1204.0822.

[38]  A. Laio,et al.  Metadynamics: a method to simulate rare events and reconstruct the free energy in biophysics, chemistry and material science , 2008 .

[39]  Joseph A. Bank,et al.  Supporting Online Material Materials and Methods Figs. S1 to S10 Table S1 References Movies S1 to S3 Atomic-level Characterization of the Structural Dynamics of Proteins , 2022 .

[40]  K. Dill,et al.  The protein folding problem. , 1993, Annual review of biophysics.

[41]  Carsten Kutzner,et al.  GROMACS 4:  Algorithms for Highly Efficient, Load-Balanced, and Scalable Molecular Simulation. , 2008, Journal of chemical theory and computation.

[42]  R. Swendsen,et al.  THE weighted histogram analysis method for free‐energy calculations on biomolecules. I. The method , 1992 .

[43]  D. Donoho,et al.  Hessian eigenmaps: Locally linear embedding techniques for high-dimensional data , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[44]  Massimiliano Bonomi,et al.  Metadynamics , 2019, ioChem-BD Computational Chemistry Datasets.

[45]  Massimiliano Bonomi,et al.  Reconstructing the equilibrium Boltzmann distribution from well‐tempered metadynamics , 2009, J. Comput. Chem..

[46]  García,et al.  Large-amplitude nonlinear motions in proteins. , 1992, Physical review letters.

[47]  P. Rios,et al.  Complex network analysis of free-energy landscapes , 2007, Proceedings of the National Academy of Sciences.