Simplifying the representation of complex free-energy landscapes using sketch-map

A new scheme, sketch-map, for obtaining a low-dimensional representation of the region of phase space explored during an enhanced dynamics simulation is proposed. We show evidence, from an examination of the distribution of pairwise distances between frames, that some features of the free-energy surface are inherently high-dimensional. This makes dimensionality reduction problematic because the data does not satisfy the assumptions made in conventional manifold learning algorithms We therefore propose that when dimensionality reduction is performed on trajectory data one should think of the resultant embedding as a quickly sketched set of directions rather than a road map. In other words, the embedding tells one about the connectivity between states but does not provide the vectors that correspond to the slow degrees of freedom. This realization informs the development of sketch-map, which endeavors to reproduce the proximity information from the high-dimensionality description in a space of lower dimensionality even when a faithful embedding is not possible.

[1]  Joshua B. Tenenbaum,et al.  Sparse multidimensional scaling using land-mark points , 2004 .

[2]  Ann B. Lee,et al.  Geometric diffusions as a tool for harmonic analysis and structure definition of data: multiscale methods. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[3]  Peter A. Kollman Advances and Continuing Challenges in Achieving Realistic and Predictive Simulations of the Properties of Organic and Biological Molecules , 1997 .

[4]  J. Tenenbaum,et al.  A global geometric framework for nonlinear dimensionality reduction. , 2000, Science.

[5]  John Langerholc Volumes of diced hyperspheres: resumming the Tam-Zardecki formula , 1989 .

[6]  Gerhard Stock,et al.  How complex is the dynamics of Peptide folding? , 2007, Physical review letters.

[7]  Garegin A Papoian,et al.  Deconstructing the native state: energy landscapes, function, and dynamics of globular proteins. , 2009, The journal of physical chemistry. B.

[8]  Lydia E Kavraki,et al.  Fast and reliable analysis of molecular motion using proximity relations and dimensionality reduction , 2007, Proteins.

[9]  Michele Parrinello,et al.  A self-learning algorithm for biased molecular dynamics , 2010, Proceedings of the National Academy of Sciences.

[10]  M. Parrinello,et al.  Canonical sampling through velocity rescaling. , 2007, The Journal of chemical physics.

[11]  John W. Sammon,et al.  A Nonlinear Mapping for Data Structure Analysis , 1969, IEEE Transactions on Computers.

[12]  Amit Singer,et al.  Detecting intrinsic slow variables in stochastic dynamical systems by anisotropic diffusion maps , 2009, Proceedings of the National Academy of Sciences.

[13]  Joseph A. Bank,et al.  Supporting Online Material Materials and Methods Figs. S1 to S10 Table S1 References Movies S1 to S3 Atomic-level Characterization of the Structural Dynamics of Proteins , 2022 .

[14]  Gregory E Sims,et al.  Protein conformational space in higher order phi-Psi maps. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[15]  García,et al.  Large-amplitude nonlinear motions in proteins. , 1992, Physical review letters.

[16]  David J. Wales,et al.  Energy landscapes of model polyalanines , 2002 .

[17]  Alessandro Laio,et al.  Advillin folding takes place on a hypersurface of small dimensionality. , 2008, Physical review letters.

[18]  Cecilia Clementi,et al.  Polymer reversal rate calculated via locally scaled diffusion map. , 2011, The Journal of chemical physics.

[19]  P. Argos,et al.  Knowledge‐based protein secondary structure assignment , 1995, Proteins.

[20]  Massimiliano Bonomi,et al.  PLUMED: A portable plugin for free-energy calculations with molecular dynamics , 2009, Comput. Phys. Commun..

[21]  Lydia E Kavraki,et al.  Low-dimensional, free-energy landscapes of protein-folding reactions by nonlinear dimensionality reduction , 2006, Proc. Natl. Acad. Sci. USA.

[22]  Alexander M. Bronstein,et al.  Nonlinear Dimensionality Reduction by Topologically Constrained Isometric Embedding , 2010, International Journal of Computer Vision.

[23]  Carsten Kutzner,et al.  GROMACS 4:  Algorithms for Highly Efficient, Load-Balanced, and Scalable Molecular Simulation. , 2008, Journal of chemical theory and computation.

[24]  M. Maggioni,et al.  Determination of reaction coordinates via locally scaled diffusion map. , 2011, The Journal of chemical physics.

[25]  Richard Bellman,et al.  Adaptive Control Processes: A Guided Tour , 1961, The Mathematical Gazette.

[26]  Guillermo Sapiro,et al.  A Gromov-Hausdorff Framework with Diffusion Geometry for Topologically-Robust Non-rigid Shape Matching , 2010, International Journal of Computer Vision.

[27]  Richard Bellman,et al.  Adaptive Control Processes: A Guided Tour , 1961, The Mathematical Gazette.

[28]  Stéphane Lafon,et al.  Diffusion maps , 2006 .

[29]  S T Roweis,et al.  Nonlinear dimensionality reduction by locally linear embedding. , 2000, Science.

[30]  P. Kollman Advances and Continuing Challenges in Achieving Realistic and Predictive Simulations of the Properties of Organic and Biological Molecules , 1996 .

[31]  A. Bronstein,et al.  A MULTIGRID APPROACH FOR MULTI-DIMENSIONAL SCALING∗ , 2004 .

[32]  Berend Smit,et al.  Chapter 3 – Monte Carlo Simulations , 2002 .

[33]  Jing Wang,et al.  Local linear transformation embedding , 2009, Neurocomputing.

[34]  D. Donoho,et al.  Hessian eigenmaps: Locally linear embedding techniques for high-dimensional data , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[35]  Andrew L. Ferguson,et al.  Systematic determination of order parameters for chain dynamics using diffusion maps , 2010, Proceedings of the National Academy of Sciences.

[36]  Sung-Hou Kim,et al.  Protein conformational space in higher order-maps , 2005 .

[37]  B. Schölkopf,et al.  Advances in kernel methods: support vector learning , 1999 .

[38]  H. Berendsen,et al.  Essential dynamics of proteins , 1993, Proteins.

[39]  Hernan F. Stamati,et al.  Application of nonlinear dimensionality reduction to characterize the conformational landscape of small peptides , 2010, Proteins.

[40]  Berend Smit,et al.  Understanding Molecular Simulation , 2001 .

[41]  David B. Shmoys,et al.  A Best Possible Heuristic for the k-Center Problem , 1985, Math. Oper. Res..

[42]  Mikhail Belkin,et al.  Laplacian Eigenmaps for Dimensionality Reduction and Data Representation , 2003, Neural Computation.