Modeling Molecular Kinetics with tICA and the Kernel Trick

The allure of a molecular dynamics simulation is that, given a sufficiently accurate force field, it can provide an atomic-level view of many interesting phenomena in biology. However, the result of a simulation is a large, high-dimensional time series that is difficult to interpret. Recent work has introduced the time-structure based Independent Components Analysis (tICA) method for analyzing MD, which attempts to find the slowest decorrelating linear functions of the molecular coordinates. This method has been used in conjunction with Markov State Models (MSMs) to provide estimates of the characteristic eigenprocesses contained in a simulation (e.g., protein folding, ligand binding). Here, we extend the tICA method using the kernel trick to arrive at nonlinear solutions. This is a substantial improvement as it allows for kernel-tICA (ktICA) to provide estimates of the characteristic eigenprocesses directly without building an MSM.

[1]  Oliver F. Lange,et al.  Evaluation and optimization of discrete state models of protein folding. , 2012, The journal of physical chemistry. B.

[2]  Joseph A. Bank,et al.  Supporting Online Material Materials and Methods Figs. S1 to S10 Table S1 References Movies S1 to S3 Atomic-level Characterization of the Structural Dynamics of Proteins , 2022 .

[3]  Bernhard Schölkopf,et al.  Nonlinear Component Analysis as a Kernel Eigenvalue Problem , 1998, Neural Computation.

[4]  Vijay S. Pande,et al.  Screen Savers of the World Unite! , 2000, Science.

[5]  Vijay S. Pande,et al.  Accelerating molecular dynamic simulation on graphics processing units , 2009, J. Comput. Chem..

[6]  Thomas J Lane,et al.  MSMBuilder2: Modeling Conformational Dynamics at the Picosecond to Millisecond Scale. , 2011, Journal of chemical theory and computation.

[7]  B. Roux,et al.  Explaining why Gleevec is a specific and potent inhibitor of Abl kinase , 2013, Proceedings of the National Academy of Sciences.

[8]  Frank Noé,et al.  Variational Approach to Molecular Kinetics. , 2014, Journal of chemical theory and computation.

[9]  Diwakar Shukla,et al.  OpenMM 4: A Reusable, Extensible, Hardware Independent Library for High Performance Molecular Simulation. , 2013, Journal of chemical theory and computation.

[10]  R. Dror,et al.  Improved side-chain torsion potentials for the Amber ff99SB protein force field , 2010, Proteins.

[11]  Duncan Poole,et al.  Routine Microsecond Molecular Dynamics Simulations with AMBER on GPUs. 1. Generalized Born , 2012, Journal of chemical theory and computation.

[12]  K. Müller,et al.  Location of saddle points and minimum energy paths by a constrained simplex optimization procedure , 1979 .

[13]  Albert C. Pan,et al.  Recovery from Slow Inactivation in K+ Channels is Controlled by Water Molecules , 2013, Nature.

[14]  Bart Kosko,et al.  Neural networks for signal processing , 1992 .

[15]  Toni Giorgino,et al.  Identification of slow molecular order parameters for Markov model construction. , 2013, The Journal of chemical physics.

[16]  Vincent A. Voelz,et al.  Slow unfolded-state structuring in Acyl-CoA binding protein folding revealed by simulation and experiment. , 2012, Journal of the American Chemical Society.

[17]  R. Altman,et al.  Cloud-based simulations on Google Exacycle reveal ligand-modulation of GPCR activation pathways , 2013, Nature chemistry.

[18]  Dahlia R. Weiss,et al.  Millisecond dynamics of RNA polymerase II translocation at atomic resolution , 2014, Proceedings of the National Academy of Sciences.

[19]  J. Berg,et al.  Molecular dynamics simulations of biomolecules , 2002, Nature Structural Biology.

[20]  Peter Deuflhard,et al.  Transfer Operator Approach to Conformational Dynamics in Biomolecular Systems , 2001 .

[21]  Schuster,et al.  Separation of a mixture of independent signals using time delayed correlations. , 1994, Physical review letters.

[22]  M. Levitt The birth of computational structural biology , 2001, Nature Structural Biology.

[23]  Albert C. Pan,et al.  Pathway and mechanism of drug binding to G-protein-coupled receptors , 2011, Proceedings of the National Academy of Sciences.

[24]  Kenji Fukumizu,et al.  Statistical Consistency of Kernel Canonical Correlation Analysis , 2007 .

[25]  R. Dror,et al.  How Fast-Folding Proteins Fold , 2011, Science.

[26]  Vijay S Pande,et al.  Statistical model selection for Markov models of biomolecular dynamics. , 2014, The journal of physical chemistry. B.

[27]  Frank Noé,et al.  Markov models of molecular kinetics: generation and validation. , 2011, The Journal of chemical physics.

[28]  Kyle A. Beauchamp,et al.  Markov state model reveals folding and functional dynamics in ultra-long MD trajectories. , 2011, Journal of the American Chemical Society.

[29]  V. Pande,et al.  Activation pathway of Src kinase reveals intermediate states as novel targets for drug design , 2014, Nature Communications.

[30]  David P. Anderson,et al.  High-Throughput All-Atom Molecular Dynamics Simulations Using Distributed Computing , 2010, J. Chem. Inf. Model..

[31]  Amedeo Caflisch,et al.  Distribution of Reciprocal of Interatomic Distances: a Fast Structural Metric , 2022 .

[32]  Peter M. Kasson,et al.  GROMACS 4.5: a high-throughput and highly parallel open source molecular simulation toolkit , 2013, Bioinform..

[33]  G. de Fabritiis,et al.  Complete reconstruction of an enzyme-inhibitor binding process by molecular dynamics simulations , 2011, Proceedings of the National Academy of Sciences.

[34]  Vijay S Pande,et al.  Improvements in Markov State Model Construction Reveal Many Non-Native Interactions in the Folding of NTL9. , 2013, Journal of chemical theory and computation.

[35]  Diwakar Shukla,et al.  To milliseconds and beyond: challenges in the simulation of protein folding. , 2013, Current opinion in structural biology.

[36]  Vijay S Pande,et al.  Learning Kinetic Distance Metrics for Markov State Models of Protein Conformational Dynamics. , 2013, Journal of chemical theory and computation.

[37]  Michael I. Jordan,et al.  Kernel independent component analysis , 2003 .

[38]  Frank Noé,et al.  A Variational Approach to Modeling Slow Processes in Stochastic Dynamical Systems , 2012, Multiscale Model. Simul..

[39]  Gregory R. Bowman,et al.  Extensive Conformational Heterogeneity within Protein Cores , 2014, The journal of physical chemistry. B.

[40]  Sotaro Fuchigami,et al.  Slow dynamics in protein fluctuations revealed by time-structure based independent component analysis: the case of domain motions. , 2011, The Journal of chemical physics.

[41]  Laurenz Wiskott,et al.  What Is the Relation Between Slow Feature Analysis and Independent Component Analysis? , 2006, Neural Computation.

[42]  Diwakar Shukla,et al.  Complex pathways in folding of protein G explored by simulation and experiment. , 2014, Biophysical journal.

[43]  Vincent A. Voelz,et al.  Computational Screening and Selection of Cyclic Peptide Hairpin Mimetics by Molecular Simulation and Kinetic Network Models , 2014, J. Chem. Inf. Model..

[44]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[45]  J Andrew McCammon,et al.  Elucidating the inhibition mechanism of HIV-1 non-nucleoside reverse transcriptase inhibitors through multicopy molecular dynamics simulations. , 2009, Journal of molecular biology.

[46]  Eric T. Kim,et al.  How does a drug molecule find its target binding site? , 2011, Journal of the American Chemical Society.