Topological obstructions in the way of data-driven collective variables.

Nonlinear dimensionality reduction (NLDR) techniques are increasingly used to visualize molecular trajectories and to create data-driven collective variables for enhanced sampling simulations. The success of these methods relies on their ability to identify the essential degrees of freedom characterizing conformational changes. Here, we show that NLDR methods face serious obstacles when the underlying collective variables present periodicities, e.g., arising from proper dihedral angles. As a result, NLDR methods collapse very distant configurations, thus leading to misinterpretations and inefficiencies in enhanced sampling. Here, we identify this largely overlooked problem and discuss possible approaches to overcome it. We also characterize the geometry and topology of conformational changes of alanine dipeptide, a benchmark system for testing new methods to identify collective variables.

[1]  Alessandro Laio,et al.  Metadynamics convergence law in a multidimensional system. , 2010, Physical review. E, Statistical, nonlinear, and soft matter physics.

[2]  Kazuhiko Kinosita,et al.  Direct observation of the rotation of F1-ATPase , 1997, Nature.

[3]  J. Tenenbaum,et al.  A global geometric framework for nonlinear dimensionality reduction. , 2000, Science.

[4]  E. Coutsias,et al.  Topology of cyclo-octane energy landscape. , 2010, The Journal of chemical physics.

[5]  B. L. de Groot,et al.  Molecular dynamics in principal component space. , 2012, The journal of physical chemistry. B.

[6]  Michele Parrinello,et al.  Demonstrating the Transferability and the Descriptive Power of Sketch-Map. , 2013, Journal of chemical theory and computation.

[7]  Gerhard Stock,et al.  How complex is the dynamics of Peptide folding? , 2007, Physical review letters.

[8]  Lydia E. Kavraki,et al.  Understanding Protein Flexibility through Dimensionality Reduction , 2003, J. Comput. Biol..

[9]  Teuvo Kohonen,et al.  Self-organized formation of topologically correct feature maps , 2004, Biological Cybernetics.

[10]  Alessandro Laio,et al.  A Collective Variable for the Efficient Exploration of Protein Beta-Sheet Structures: Application to SH3 and GB1. , 2009, Journal of chemical theory and computation.

[11]  Ioannis G Kevrekidis,et al.  Integrating diffusion maps with umbrella sampling: application to alanine dipeptide. , 2011, The Journal of chemical physics.

[12]  García,et al.  Large-amplitude nonlinear motions in proteins. , 1992, Physical review letters.

[13]  Michel Verleysen,et al.  Nonlinear Dimensionality Reduction , 2021, Computer Vision.

[14]  Alessandro Laio,et al.  Advillin folding takes place on a hypersurface of small dimensionality. , 2008, Physical review letters.

[15]  Michele Parrinello,et al.  Using sketch-map coordinates to analyze and bias molecular dynamics simulations , 2012, Proceedings of the National Academy of Sciences.

[16]  Lydia E Kavraki,et al.  Low-dimensional, free-energy landscapes of protein-folding reactions by nonlinear dimensionality reduction , 2006, Proc. Natl. Acad. Sci. USA.

[17]  Michel Verleysen,et al.  Nonlinear dimensionality reduction of data manifolds with essential loops , 2005, Neurocomputing.

[18]  Marino Arroyo,et al.  Modeling and enhanced sampling of molecular systems with smooth and nonlinear data-driven collective variables. , 2013, The Journal of chemical physics.

[19]  M. Maggioni,et al.  Determination of reaction coordinates via locally scaled diffusion map. , 2011, The Journal of chemical physics.

[20]  Charles C. David,et al.  Principal component analysis: a method for determining the essential dynamics of proteins. , 2014, Methods in molecular biology.

[21]  S T Roweis,et al.  Nonlinear dimensionality reduction by locally linear embedding. , 2000, Science.

[22]  S. Huo,et al.  Euclidean sections of protein conformation space and their implications in dimensionality reduction , 2014, Proteins.

[23]  Oliver F. Lange,et al.  Can principal components yield a dimension reduced description of protein dynamics on long time scales? , 2006, The journal of physical chemistry. B.

[24]  B. L. de Groot,et al.  Essential dynamics of reversible peptide folding: memory-free conformational dynamics governed by internal hydrogen bonds. , 2001, Journal of molecular biology.

[25]  Christopher M. Bishop,et al.  GTM: The Generative Topographic Mapping , 1998, Neural Computation.

[26]  Geoffrey E. Hinton,et al.  Reducing the Dimensionality of Data with Neural Networks , 2006, Science.

[27]  Vojtech Spiwok,et al.  Metadynamics in essential coordinates: free energy simulation of conformational changes. , 2007, The journal of physical chemistry. B.

[28]  Yuzhen Xue,et al.  State reduction in molecular simulations , 2013, Comput. Chem. Eng..

[29]  Michele Parrinello,et al.  Simplifying the representation of complex free-energy landscapes using sketch-map , 2011, Proceedings of the National Academy of Sciences.

[30]  Vojtěch Spiwok,et al.  Metadynamics in the conformational space nonlinearly dimensionally reduced by Isomap. , 2011, The Journal of chemical physics.

[31]  Karl Pearson F.R.S. LIII. On lines and planes of closest fit to systems of points in space , 1901 .

[32]  Andrew L. Ferguson,et al.  Systematic determination of order parameters for chain dynamics using diffusion maps , 2010, Proceedings of the National Academy of Sciences.

[33]  C. Malsburg Self-organization of orientation sensitive cells in the striate cortex , 2004, Kybernetik.

[34]  Alessandro Laio,et al.  The conformational free energy landscape of beta-D-glucopyranose. Implications for substrate preactivation in beta-glucoside hydrolases. , 2007, Journal of the American Chemical Society.

[35]  Jean-Paul Watson,et al.  Algorithmic dimensionality reduction for molecular structure analysis. , 2008, The Journal of chemical physics.

[36]  A. Liwo,et al.  Principal component analysis for protein folding dynamics. , 2009, Journal of molecular biology.

[37]  H. Berendsen,et al.  Essential dynamics of proteins , 1993, Proteins.

[38]  Imre G. Csizmadia,et al.  Variation of conformational properties at a glance. True graphical visualization of the Ramachandran surface topology as a periodic potential energy surface , 2012 .

[39]  Hernan F. Stamati,et al.  Application of nonlinear dimensionality reduction to characterize the conformational landscape of small peptides , 2010, Proteins.

[40]  David E. Shaw,et al.  The future of molecular dynamics simulations in drug discovery , 2011, Journal of Computer-Aided Molecular Design.