Identification of slow molecular order parameters for Markov model construction.

A goal in the kinetic characterization of a macromolecular system is the description of its slow relaxation processes via (i) identification of the structural changes involved in these processes and (ii) estimation of the rates or timescales at which these slow processes occur. Most of the approaches to this task, including Markov models, master-equation models, and kinetic network models, start by discretizing the high-dimensional state space and then characterize relaxation processes in terms of the eigenvectors and eigenvalues of a discrete transition matrix. The practical success of such an approach depends very much on the ability to finely discretize the slow order parameters. How can this task be achieved in a high-dimensional configuration space without relying on subjective guesses of the slow order parameters? In this paper, we use the variational principle of conformation dynamics to derive an optimal way of identifying the "slow subspace" of a large set of prior order parameters - either generic internal coordinates or a user-defined set of parameters. Using a variational formulation of conformational dynamics, it is shown that an existing method-the time-lagged independent component analysis-provides the optional solution to this problem. In addition, optimal indicators-order parameters indicating the progress of the slow transitions and thus may serve as reaction coordinates-are readily identified. We demonstrate that the slow subspace is well suited to construct accurate kinetic models of two sets of molecular dynamics simulations, the 6-residue fluorescent peptide MR121-GSGSW and the 30-residue intrinsically disordered peptide kinase inducible domain (KID). The identified optimal indicators reveal the structural changes associated with the slow processes of the molecular system under analysis.

[1]  F. Rao,et al.  The protein folding network. , 2004, Journal of molecular biology.

[2]  M J Harvey,et al.  ACEMD: Accelerating Biomolecular Dynamics in the Microsecond Time Scale. , 2009, Journal of chemical theory and computation.

[3]  M. Rief,et al.  The Complex Folding Network of Single Calmodulin Molecules , 2011, Science.

[4]  H. Dyson,et al.  Mechanism of coupled folding and binding of an intrinsically disordered protein , 2007, Nature.

[5]  Benjamin A. Shoemaker,et al.  Speeding molecular recognition by using the folding funnel: the fly-casting mechanism. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[6]  Jeremy C. Smith,et al.  Dynamical fingerprints for probing individual relaxation processes in biomolecular dynamics with simulations and kinetic experiments , 2011, Proceedings of the National Academy of Sciences.

[7]  V. Pande,et al.  Calculation of the distribution of eigenvalues and eigenvectors in Markovian state models for molecular dynamics. , 2007, The Journal of chemical physics.

[8]  K. Dill,et al.  Automatic discovery of metastable states for the construction of Markov models of macromolecular conformational dynamics. , 2007, The Journal of chemical physics.

[9]  Philipp Metzner,et al.  Mechanisms of protein-ligand association and its modulation by protein mutations. , 2011, Biophysical journal.

[10]  C. Schütte,et al.  Supplementary Information for “ Constructing the Equilibrium Ensemble of Folding Pathways from Short Off-Equilibrium Simulations ” , 2009 .

[11]  H. Berendsen,et al.  Essential dynamics of proteins , 1993, Proteins.

[12]  Philip M. Long,et al.  Performance guarantees for hierarchical clustering , 2002, J. Comput. Syst. Sci..

[13]  Frank Noé,et al.  A Variational Approach to Modeling Slow Processes in Stochastic Dynamical Systems , 2012, Multiscale Model. Simul..

[14]  Berk Hess,et al.  Improving efficiency of large time‐scale molecular dynamics simulations of hydrogen‐rich systems , 1999, Journal of computational chemistry.

[15]  Kyle A. Beauchamp,et al.  Molecular simulation of ab initio protein folding for a millisecond folder NTL9(1-39). , 2010, Journal of the American Chemical Society.

[16]  Heng Tao Shen,et al.  Principal Component Analysis , 2009, Encyclopedia of Biometrics.

[17]  Max Löhning,et al.  Chung Folding Transition Path Times Single-Molecule Fluorescence Experiments Determine Protein , 2012 .

[18]  Schuster,et al.  Separation of a mixture of independent signals using time delayed correlations. , 1994, Physical review letters.

[19]  Frank Noé,et al.  On the Approximation Quality of Markov State Models , 2010, Multiscale Model. Simul..

[20]  F. Noé,et al.  Transition networks for modeling the kinetics of conformational change in macromolecules. , 2008, Current opinion in structural biology.

[21]  Frank Noé,et al.  EMMA: A Software Package for Markov Model Building and Analysis. , 2012, Journal of chemical theory and computation.

[22]  Amedeo Caflisch,et al.  The Free Energy Landscape of Small Molecule Unbinding , 2011, PLoS Comput. Biol..

[23]  G. Bowman,et al.  Equilibrium fluctuations of a single folded protein reveal a multitude of potential cryptic allosteric sites , 2012, Proceedings of the National Academy of Sciences.

[24]  Ann B. Lee,et al.  Geometric diffusions as a tool for harmonic analysis and structure definition of data: diffusion maps. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[25]  Thomas J Lane,et al.  MSMBuilder2: Modeling Conformational Dynamics at the Picosecond to Millisecond Scale. , 2011, Journal of chemical theory and computation.

[26]  Seungjin Choi,et al.  Independent Component Analysis , 2009, Handbook of Natural Computing.

[27]  B. L. de Groot,et al.  Essential dynamics of reversible peptide folding: memory-free conformational dynamics governed by internal hydrogen bonds. , 2001, Journal of molecular biology.

[28]  Eric J. Deeds,et al.  Understanding ensemble protein folding at atomic detail , 2006, Proceedings of the National Academy of Sciences.

[29]  R. Dror,et al.  How Fast-Folding Proteins Fold , 2011, Science.

[30]  A. Mitsutake,et al.  Relaxation mode analysis of a peptide system: comparison with principal component analysis. , 2011, The Journal of chemical physics.

[31]  E. Oja,et al.  Independent Component Analysis , 2013 .

[32]  A. Berezhkovskii,et al.  Reactive flux and folding pathways in network models of coarse-grained protein dynamics. , 2009, The Journal of chemical physics.

[33]  Jörg Langowski,et al.  Nucleosome disassembly intermediates characterized by single-molecule FRET , 2009, Proceedings of the National Academy of Sciences.

[34]  A. Caflisch,et al.  Kinetic analysis of molecular dynamics simulations reveals changes in the denatured state and switch of folding pathways upon single‐point mutation of a β‐sheet miniprotein , 2008, Proteins.

[35]  Francesca Fanelli,et al.  Wordom: A User-Friendly Program for the Analysis of Molecular Structures, Trajectories, and Free Energy Surfaces , 2010, J. Comput. Chem..

[36]  W. L. Jorgensen,et al.  Comparison of simple potential functions for simulating liquid water , 1983 .

[37]  M. Karplus,et al.  Hidden complexity of free energy surfaces for peptide (protein) folding. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[38]  Frank Noé,et al.  Solvent electrostriction-driven peptide folding revealed by quasi-Gaussian entropy theory and molecular dynamics simulation. , 2008, The journal of physical chemistry. B.

[39]  V. Pande,et al.  Error analysis and efficient sampling in Markovian state models for molecular dynamics. , 2005, The Journal of chemical physics.

[40]  Vijay S Pande,et al.  Progress and challenges in the automated construction of Markov state models for full protein systems. , 2009, The Journal of chemical physics.

[41]  R. Dror,et al.  Improved side-chain torsion potentials for the Amber ff99SB protein force field , 2010, Proteins.

[42]  G. Hummer,et al.  Coarse master equations for peptide folding dynamics. , 2008, The journal of physical chemistry. B.

[43]  L. Kay,et al.  Intrinsic dynamics of an enzyme underlies catalysis , 2005, Nature.

[44]  G. Nienhaus,et al.  Ligand binding and conformational motions in myoglobin , 2000, Nature.

[45]  Sören Doose,et al.  Dynamics of unfolded polypeptide chains in crowded environment studied by fluorescence correlation spectroscopy. , 2007, Journal of molecular biology.

[46]  Joseph A. Bank,et al.  Supporting Online Material Materials and Methods Figs. S1 to S10 Table S1 References Movies S1 to S3 Atomic-level Characterization of the Structural Dynamics of Proteins , 2022 .

[47]  Gerhard Stock,et al.  Construction of the free energy landscape of biomolecules via dihedral angle principal component analysis. , 2008, The Journal of chemical physics.

[48]  T. Cheatham,et al.  Determination of Alkali and Halide Monovalent Ion Parameters for Use in Explicitly Solvated Biomolecular Simulations , 2008, The journal of physical chemistry. B.

[49]  Sotaro Fuchigami,et al.  Slow dynamics in protein fluctuations revealed by time-structure based independent component analysis: the case of domain motions. , 2011, The Journal of chemical physics.

[50]  Vijay S Pande,et al.  Simple few-state models reveal hidden complexity in protein folding , 2012, Proceedings of the National Academy of Sciences.

[51]  Matthias Rief,et al.  Full distance-resolved folding energy landscape of one single protein molecule , 2010, Proceedings of the National Academy of Sciences.

[52]  Jeremy C. Smith,et al.  Hierarchical analysis of conformational dynamics in biomolecules: transition networks of metastable states. , 2007, The Journal of chemical physics.

[53]  Albert C. Pan,et al.  Building Markov state models along pathways to determine free energies and rates of transitions. , 2008, The Journal of chemical physics.

[54]  J. Torella,et al.  Conformational transitions in DNA polymerase I revealed by single-molecule FRET , 2009, Proceedings of the National Academy of Sciences.

[55]  G. Nienhaus,et al.  Mg2+-dependent folding of a Diels-Alderase ribozyme probed by single-molecule FRET analysis , 2007, Nucleic acids research.

[56]  R. Liu,et al.  AMUSE: a new blind identification algorithm , 1990, IEEE International Symposium on Circuits and Systems.

[57]  William Swope,et al.  Describing Protein Folding Kinetics by Molecular Dynamics Simulations. 1. Theory , 2004 .

[58]  Yan Zhang,et al.  Structure-function-folding relationship in a WW domain. , 2006, Proceedings of the National Academy of Sciences of the United States of America.

[59]  X. Xie,et al.  Observation of a power-law memory kernel for fluctuations within a single protein molecule. , 2005, Physical review letters.

[60]  Jeremy C. Smith,et al.  Hydrogen-Bond Driven Loop-Closure Kinetics in Unfolded Polypeptide Chains , 2010, PLoS Comput. Biol..

[61]  Vincent A. Voelz,et al.  Atomistic folding simulations of the five-helix bundle protein λ(6−85). , 2011, Journal of the American Chemical Society.

[62]  Alessandro Laio,et al.  METAGUI. A VMD interface for analyzing metadynamics and molecular dynamics simulations , 2012, Comput. Phys. Commun..

[63]  G. de Fabritiis,et al.  Complete reconstruction of an enzyme-inhibitor binding process by molecular dynamics simulations , 2011, Proceedings of the National Academy of Sciences.

[64]  Vijay S Pande,et al.  Improvements in Markov State Model Construction Reveal Many Non-Native Interactions in the Folding of NTL9. , 2013, Journal of chemical theory and computation.

[65]  S. P. Lloyd,et al.  Least squares quantization in PCM , 1982, IEEE Trans. Inf. Theory.

[66]  I. Kevrekidis,et al.  Coarse master equation from Bayesian analysis of replica molecular dynamics simulations. , 2005, The journal of physical chemistry. B.

[67]  Jane Clarke,et al.  Experimental evidence for a frustrated energy landscape in a 3-helix bundle protein family , 2009, Nature.

[68]  Frank Noé,et al.  Kinetic characterization of the critical step in HIV-1 protease maturation , 2012, Proceedings of the National Academy of Sciences.

[69]  Eric Vanden-Eijnden,et al.  Transition Path Theory for Markov Jump Processes , 2009, Multiscale Model. Simul..

[70]  W. Kabsch A solution for the best rotation to relate two sets of vectors , 1976 .

[71]  Lawryn H Kasper,et al.  Conditional Knockout Mice Reveal Distinct Functions for the Global Transcriptional Coactivators CBP and p300 in T-Cell Development , 2006, Molecular and Cellular Biology.

[72]  Stefan Fischer,et al.  Structural mechanism of the recovery stroke in the myosin molecular motor. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[73]  Peter E Wright,et al.  Solution Structure of the KIX Domain of CBP Bound to the Transactivation Domain of CREB: A Model for Activator:Coactivator Interactions , 1997, Cell.

[74]  W. Ritz Über eine neue Methode zur Lösung gewisser Variationsprobleme der mathematischen Physik. , 1909 .

[75]  J. Chodera,et al.  Spectral Rate Theory for Two-State Kinetics. , 2012, Physical review. X.

[76]  Andreas Volkmer,et al.  Orientational and dynamical heterogeneity of rhodamine 6G terminally attached to a DNA helix revealed by NMR and single-molecule fluorescence spectroscopy. , 2007, Journal of the American Chemical Society.

[77]  Marc S. Cortese,et al.  Analysis of molecular recognition features (MoRFs). , 2006, Journal of molecular biology.

[78]  Jan Kubelka,et al.  Dynamics of protein folding: probing the kinetic network of folding-unfolding transitions with experiment and theory. , 2011, Biochimica et biophysica acta.

[79]  Vladimir N Uversky,et al.  Intrinsically disordered proteins from A to Z. , 2011, The international journal of biochemistry & cell biology.

[80]  M. Maggioni,et al.  Determination of reaction coordinates via locally scaled diffusion map. , 2011, The Journal of chemical physics.

[81]  G. Ziv,et al.  Single-molecule fluorescence spectroscopy maps the folding landscape of a large protein. , 2011, Nature communications.

[82]  Diego Prada-Gracia,et al.  Accounting for the kinetics in order parameter analysis: lessons from theoretical models and a disordered peptide. , 2012, The Journal of chemical physics.

[83]  P. Deuflhard,et al.  Robust Perron cluster analysis in conformation dynamics , 2005 .

[84]  Marcus Weber,et al.  A coarse graining method for the identification of transition rates between molecular conformations. , 2007, The Journal of chemical physics.

[85]  Christof Schütte,et al.  Estimating the Eigenvalue Error of Markov State Models , 2012, Multiscale Model. Simul..

[86]  Frank Noé,et al.  Markov models and dynamical fingerprints: Unraveling the complexity of molecular kinetics , 2012 .

[87]  Jeremy C. Smith,et al.  Transition Networks for the Comprehensive Characterization of Complex Conformational Change in Proteins. , 2006, Journal of chemical theory and computation.

[88]  Paul Tavan,et al.  Extracting Markov Models of Peptide Conformational Dynamics from Simulation Data. , 2005, Journal of chemical theory and computation.

[89]  P. Deuflhard,et al.  A Direct Approach to Conformational Dynamics Based on Hybrid Monte Carlo , 1999 .

[90]  Frank Noé,et al.  Markov models of molecular kinetics: generation and validation. , 2011, The Journal of chemical physics.

[91]  David P. Anderson,et al.  High-Throughput All-Atom Molecular Dynamics Simulations Using Distributed Computing , 2010, J. Chem. Inf. Model..

[92]  Marcus Weber Improved Perron Cluster Analysis , 2003 .

[93]  C. Brooks,et al.  Statistical clustering techniques for the analysis of long molecular dynamics trajectories: analysis of 2.2-ns trajectories of YPGDV. , 1993, Biochemistry.

[94]  Gerhard Stock,et al.  Hidden Complexity of Protein Free-Energy Landscapes Revealed by Principal Component Analysis by Parts , 2010 .

[95]  F. Noé Probability distributions of molecular observables computed from Markov models. , 2008, The Journal of chemical physics.