Systematic characterization of protein folding pathways using diffusion maps: application to Trp-cage miniprotein.

Understanding the mechanisms by which proteins fold from disordered amino-acid chains to spatially ordered structures remains an area of active inquiry. Molecular simulations can provide atomistic details of the folding dynamics which complement experimental findings. Conventional order parameters, such as root-mean-square deviation and radius of gyration, provide structural information but fail to capture the underlying dynamics of the protein folding process. It is therefore advantageous to adopt a method that can systematically analyze simulation data to extract relevant structural as well as dynamical information. The nonlinear dimensionality reduction technique known as diffusion maps automatically embeds the high-dimensional folding trajectories in a lower-dimensional space from which one can more easily visualize folding pathways, assuming the data lie approximately on a lower-dimensional manifold. The eigenvectors that parametrize the low-dimensional space, furthermore, are determined systematically, rather than chosen heuristically, as is done with phenomenological order parameters. We demonstrate that diffusion maps can effectively characterize the folding process of a Trp-cage miniprotein. By embedding molecular dynamics simulation trajectories of Trp-cage folding in diffusion maps space, we identify two folding pathways and intermediate structures that are consistent with the previous studies, demonstrating that this technique can be employed as an effective way of analyzing and constructing protein folding pathways from molecular simulations.

[1]  Ioannis G Kevrekidis,et al.  Diffusion maps, clustering and fuzzy Markov modeling in peptide folding transitions. , 2014, The Journal of chemical physics.

[2]  S. Hagen,et al.  Folding Dynamics and Pathways of the Trp-Cage Miniproteins , 2014, Biochemistry.

[3]  Harold W. Hatch,et al.  Computational study of the stability of the miniprotein trp-cage, the GB1 β-hairpin, and the AK16 peptide, under negative pressure. , 2014, The journal of physical chemistry. B.

[4]  Ronald M Levy,et al.  How kinetics within the unfolded state affects protein folding: an analysis based on markov state models and an ultra-long MD trajectory. , 2013, The journal of physical chemistry. B.

[5]  Peter G Bolhuis,et al.  Folding dynamics of the Trp-cage miniprotein: evidence for a native-like intermediate from combined time-resolved vibrational spectroscopy and molecular dynamics simulations. , 2013, The journal of physical chemistry. B.

[6]  Cecilia Clementi,et al.  Rapid exploration of configuration space with diffusion-map-directed molecular dynamics. , 2013, The journal of physical chemistry. B.

[7]  Li Han,et al.  Evaluation of Dimensionality-reduction Methods from Peptide Folding-unfolding Simulations. , 2013, Journal of chemical theory and computation.

[8]  C. Clementi,et al.  Discovering mountain passes via torchlight: methods for the definition of reaction coordinates and pathways in complex macromolecular reactions. , 2013, Annual review of physical chemistry.

[9]  Jiye Shi,et al.  Enhanced sampling molecular dynamics simulation captures experimentally suggested intermediate and unfolded states in the folding pathway of Trp-cage miniprotein. , 2012, The Journal of chemical physics.

[10]  A. Liwo,et al.  Conformational dynamics of the trp-cage miniprotein at its folding temperature. , 2012, The journal of physical chemistry. B.

[11]  R. Best,et al.  Force-field dependence of chignolin folding and misfolding: comparison with experiment and redesign. , 2012, Biophysical journal.

[12]  Rhiju Das,et al.  Are Protein Force Fields Getting Better? A Systematic Benchmark on 524 Diverse NMR Measurements. , 2012, Journal of chemical theory and computation.

[13]  R. Best,et al.  Residue-specific α-helix propensities from molecular simulation. , 2012, Biophysical journal.

[14]  A. Garcia,et al.  Influence of water-protein hydrogen bonding on the stability of Trp-cage miniprotein. A comparison between the TIP3P and TIP4P-Ew water models. , 2011, Physical chemistry chemical physics : PCCP.

[15]  A. Caflisch,et al.  Delineation of folding pathways of a β-sheet miniprotein. , 2011, The journal of physical chemistry. B.

[16]  Petra Rovó,et al.  Cooperativity network of Trp‐cage miniproteins: probing salt‐bridges , 2011, Journal of peptide science : an official publication of the European Peptide Society.

[17]  Michele Parrinello,et al.  Simplifying the representation of complex free-energy landscapes using sketch-map , 2011, Proceedings of the National Academy of Sciences.

[18]  Ioannis G. Kevrekidis,et al.  Nonlinear dimensionality reduction in molecular simulation: The diffusion map approach , 2011 .

[19]  Ioannis G Kevrekidis,et al.  Integrating diffusion maps with umbrella sampling: application to alanine dipeptide. , 2011, The Journal of chemical physics.

[20]  M. Maggioni,et al.  Determination of reaction coordinates via locally scaled diffusion map. , 2011, The Journal of chemical physics.

[21]  Tobin R Sosnick,et al.  The folding of single domain proteins--have we reached a consensus? , 2011, Current opinion in structural biology.

[22]  Michael Andrec,et al.  Kinetic network study of the diversity and temperature dependence of Trp-Cage folding pathways: combining transition path theory with stochastic simulations. , 2011, The journal of physical chemistry. B.

[23]  I. Kevrekidis,et al.  Think Globally, Move Locally: Coarse Graining of Effective Free Energy Surfaces , 2011 .

[24]  Dirk Roose,et al.  Coping with complexity : model reduction and data analysis , 2011 .

[25]  Andrew L. Ferguson,et al.  An experimental and computational investigation of spontaneous lasso formation in microcin J25. , 2010, Biophysical journal.

[26]  R. Best,et al.  Protein simulations with an optimized water model: cooperative helix formation and temperature-induced unfolded state collapse. , 2010, The journal of physical chemistry. B.

[27]  Conrad Sanderson,et al.  Armadillo: An Open Source C++ Linear Algebra Library for Fast Prototyping and Computationally Intensive Experiments , 2010 .

[28]  Ernesto E. Borrero,et al.  Kinetics and mechanism of the unfolding native-to-loop transition of Trp-cage in explicit solvent via optimized forward flux sampling simulations. , 2010, The Journal of chemical physics.

[29]  Andrew L. Ferguson,et al.  Systematic determination of order parameters for chain dynamics using diffusion maps , 2010, Proceedings of the National Academy of Sciences.

[30]  A. Garcia,et al.  Microsecond simulations of the folding/unfolding thermodynamics of the Trp‐cage miniprotein , 2010, Proteins.

[31]  Martin Zacharias,et al.  Folding simulations of Trp‐cage mini protein in explicit solvent using biasing potential replica‐exchange molecular dynamics simulations , 2009, Proteins.

[32]  Ioannis G Kevrekidis,et al.  Exploration of effective potential landscapes using coarse reverse integration. , 2008, The Journal of chemical physics.

[33]  Heng Tao Shen,et al.  Principal Component Analysis , 2009, Encyclopedia of Biometrics.

[34]  A. Garcia,et al.  Computing the stability diagram of the Trp-cage miniprotein , 2008, Proceedings of the National Academy of Sciences.

[35]  Ronald R. Coifman,et al.  Graph Laplacian Tomography From Unknown Random Projections , 2008, IEEE Transactions on Image Processing.

[36]  Xu Zhang,et al.  Dynamics and cooperativity of Trp-cage folding. , 2008, Archives of biochemistry and biophysics.

[37]  Carsten Kutzner,et al.  GROMACS 4:  Algorithms for Highly Efficient, Load-Balanced, and Scalable Molecular Simulation. , 2008, Journal of chemical theory and computation.

[38]  Berk Hess,et al.  P-LINCS:  A Parallel Linear Constraint Solver for Molecular Simulation. , 2008, Journal of chemical theory and computation.

[39]  K. H. Mok,et al.  A pre-existing hydrophobic collapse in the unfolded state of an ultrafast folding protein , 2007, Nature.

[40]  P. Bolhuis,et al.  Sampling the multiple folding mechanisms of Trp-cage in explicit solvent , 2006, Proceedings of the National Academy of Sciences.

[41]  Lydia E Kavraki,et al.  Low-dimensional, free-energy landscapes of protein-folding reactions by nonlinear dimensionality reduction , 2006, Proc. Natl. Acad. Sci. USA.

[42]  C. Vega,et al.  A general purpose model for the condensed phases of water: TIP4P/2005. , 2005, The Journal of chemical physics.

[43]  Gerrit Groenhof,et al.  GROMACS: Fast, flexible, and free , 2005, J. Comput. Chem..

[44]  Sören Doose,et al.  A microscopic view of miniprotein folding: enhanced folding efficiency through formation of an intermediate. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[45]  S. Asher,et al.  UV-resonance raman thermal unfolding study of Trp-cage shows that it is not a simple two-state miniprotein. , 2005, Journal of the American Chemical Society.

[46]  宁北芳,et al.  疟原虫var基因转换速率变化导致抗原变异[英]/Paul H, Robert P, Christodoulou Z, et al//Proc Natl Acad Sci U S A , 2005 .

[47]  Ann B. Lee,et al.  Geometric diffusions as a tool for harmonic analysis and structure definition of data: diffusion maps. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[48]  P. R. ten Wolde,et al.  Sampling rare switching events in biochemical networks. , 2004, Physical review letters.

[49]  Y. Duan,et al.  Characterizing the rate-limiting step of Trp-cage folding by all-atom molecular dynamics simulations , 2004 .

[50]  R. Zhou Trp-cage: Folding free energy landscape in explicit water , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[51]  Wei Zhang,et al.  A point‐charge force field for molecular mechanics simulations of proteins based on condensed‐phase quantum mechanical calculations , 2003, J. Comput. Chem..

[52]  William Swope,et al.  Understanding folding and design: Replica-exchange simulations of ``Trp-cage'' miniproteins , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[53]  Mikhail Belkin,et al.  Laplacian Eigenmaps for Dimensionality Reduction and Data Representation , 2003, Neural Computation.

[54]  Claudio Soto,et al.  Unfolding the role of protein misfolding in neurodegenerative diseases , 2003, Nature Reviews Neuroscience.

[55]  V. Pande,et al.  The Trp cage: folding kinetics and unfolded state topology via molecular dynamics simulations. , 2002, Journal of the American Chemical Society.

[56]  A. Roitberg,et al.  Smaller and faster: the 20-residue Trp-cage protein folds in 4 micros. , 2002, Journal of the American Chemical Society.

[57]  A. Laio,et al.  Escaping free-energy minima , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[58]  J. W. Neidigh,et al.  Designing a 20-residue protein , 2002, Nature Structural Biology.

[59]  Berk Hess,et al.  GROMACS 3.0: a package for molecular simulation and trajectory analysis , 2001 .

[60]  J. Tenenbaum,et al.  A global geometric framework for nonlinear dimensionality reduction. , 2000, Science.

[61]  S T Roweis,et al.  Nonlinear dimensionality reduction by locally linear embedding. , 2000, Science.

[62]  A. Voter,et al.  Temperature-accelerated dynamics for simulation of infrequent events , 2000 .

[63]  Y. Sugita,et al.  Replica-exchange molecular dynamics method for protein folding , 1999 .

[64]  C. Dellago,et al.  Transition path sampling and the calculation of rate constants , 1998 .

[65]  Berk Hess,et al.  LINCS: A linear constraint solver for molecular simulations , 1997, J. Comput. Chem..

[66]  A. Fersht Nucleation mechanisms in protein folding. , 1997, Current opinion in structural biology.

[67]  K Schulten,et al.  VMD: visual molecular dynamics. , 1996, Journal of molecular graphics.

[68]  T. Darden,et al.  A smooth particle mesh Ewald method , 1995 .

[69]  D. van der Spoel,et al.  GROMACS: A message-passing parallel molecular dynamics implementation , 1995 .

[70]  E I Shakhnovich,et al.  Specific nucleus as the transition state for protein folding: evidence from the lattice model. , 1994, Biochemistry.

[71]  P. Kollman,et al.  Settle: An analytical version of the SHAKE and RATTLE algorithm for rigid water models , 1992 .

[72]  D. E. Anderson,et al.  pH-induced denaturation of proteins: a single salt bridge contributes 3-5 kcal/mol to the free energy of folding of T4 lysozyme. , 1990, Biochemistry.

[73]  C. Brooks Computer simulation of liquids , 1989 .

[74]  Hoover,et al.  Canonical dynamics: Equilibrium phase-space distributions. , 1985, Physical review. A, General physics.

[75]  S. Nosé A unified formulation of the constant temperature molecular dynamics methods , 1984 .

[76]  S. Nosé,et al.  Constant pressure molecular dynamics for molecular systems , 1983 .

[77]  M. Parrinello,et al.  Polymorphic transitions in single crystals: A new molecular dynamics method , 1981 .

[78]  W. Kabsch A discussion of the solution for the best rotation to relate two sets of vectors , 1978 .

[79]  G. Torrie,et al.  Nonphysical sampling distributions in Monte Carlo free-energy estimation: Umbrella sampling , 1977 .

[80]  W. Kabsch A solution for the best rotation to relate two sets of vectors , 1976 .

[81]  M. Karplus,et al.  Protein-folding dynamics , 1976, Nature.