Identifying Metastable States of Folding Proteins.

Recent molecular dynamics simulations of biopolymers have shown that in many cases the global features of the free energy landscape can be characterized in terms of the metastable conformational states of the system. To identify these states, a conceptionally and computationally simple approach is proposed. It consists of (i) an initial preprocessing via principal component analysis to reduce the dimensionality of the data, followed by k-means clustering to generate up to 10(4) microstates, (ii) the most probable path algorithm to identify the metastable states of the system, and (iii) boundary corrections of these states via the introduction of cluster cores in order to obtain the correct dynamics. By adopting two well-studied model problems, hepta-alanine and the villin headpiece protein, the potential and the performance of the approach are demonstrated.

[1]  Vijay S Pande,et al.  Progress and challenges in the automated construction of Markov state models for full protein systems. , 2009, The Journal of chemical physics.

[2]  P. Nguyen,et al.  Complexity of free energy landscapes of peptides revealed by nonlinear principal component analysis , 2006, Proteins.

[3]  Gerhard Stock,et al.  How complex is the dynamics of Peptide folding? , 2007, Physical review letters.

[4]  Amedeo Caflisch,et al.  One-dimensional barrier-preserving free-energy projections of a beta-sheet miniprotein: new insights into the folding process. , 2008, The journal of physical chemistry. B.

[5]  R. Elber,et al.  Computing time scales from reaction coordinates by milestoning. , 2004, The Journal of chemical physics.

[6]  Gerhard Stock,et al.  Construction of the free energy landscape of biomolecules via dihedral angle principal component analysis. , 2008, The Journal of chemical physics.

[7]  H. Berendsen,et al.  Essential dynamics of proteins , 1993, Proteins.

[8]  宁北芳,et al.  疟原虫var基因转换速率变化导致抗原变异[英]/Paul H, Robert P, Christodoulou Z, et al//Proc Natl Acad Sci U S A , 2005 .

[9]  F. Rao,et al.  The protein folding network. , 2004, Journal of molecular biology.

[10]  J. Hofrichter,et al.  Experimental tests of villin subdomain folding simulations. , 2003, Journal of molecular biology.

[11]  R. Hegger,et al.  Dihedral angle principal component analysis of molecular dynamics simulations. , 2007, The Journal of chemical physics.

[12]  G. Hummer,et al.  Coarse master equations for peptide folding dynamics. , 2008, The journal of physical chemistry. B.

[13]  F. Young Biochemistry , 1955, The Indian Medical Gazette.

[14]  Klaus Schulten,et al.  Going beyond Clustering in MD Trajectory Analysis: An Application to Villin Headpiece Folding , 2010, PloS one.

[15]  Vijay S Pande,et al.  Protein folded states are kinetic hubs , 2010, Proceedings of the National Academy of Sciences.

[16]  P. Deuflhard,et al.  A Direct Approach to Conformational Dynamics Based on Hybrid Monte Carlo , 1999 .

[17]  P. Kollman,et al.  Pathways to a protein folding intermediate observed in a 1-microsecond simulation in aqueous solution. , 1998, Science.

[18]  Frank Noé,et al.  Markov state models based on milestoning. , 2011, The Journal of chemical physics.

[19]  K. Dill,et al.  Automatic discovery of metastable states for the construction of Markov models of macromolecular conformational dynamics. , 2007, The Journal of chemical physics.

[20]  R. Rosenfeld Nature , 2009, Otolaryngology--head and neck surgery : official journal of American Academy of Otolaryngology-Head and Neck Surgery.

[21]  K. Dill,et al.  From Levinthal to pathways to funnels , 1997, Nature Structural Biology.

[22]  P. Nguyen,et al.  Energy landscape of a small peptide revealed by dihedral angle principal component analysis , 2004, Proteins.

[23]  R. Dror,et al.  How Fast-Folding Proteins Fold , 2011, Science.

[24]  P. Henklein,et al.  An unlocking/relocking barrier in conformational fluctuations of villin headpiece subdomain , 2010, Proceedings of the National Academy of Sciences.

[25]  Ariel Fernández,et al.  Large-scale context in protein folding: villin headpiece. , 2003, Biochemistry.

[26]  C. Schütte,et al.  Supplementary Information for “ Constructing the Equilibrium Ensemble of Folding Pathways from Short Off-Equilibrium Simulations ” , 2009 .

[27]  V. Pande,et al.  Heterogeneity even at the speed limit of folding: large-scale molecular dynamics study of a fast-folding variant of the villin headpiece. , 2007, Journal of molecular biology.

[28]  Oliver F. Lange,et al.  Full correlation analysis of conformational protein dynamics , 2007, Proteins.

[29]  Shai Carmi,et al.  Partition of networks into basins of attraction. , 2008, Physical review. E, Statistical, nonlinear, and soft matter physics.

[30]  Kyle A. Beauchamp,et al.  Quantitative comparison of villin headpiece subdomain simulations and triplet–triplet energy transfer experiments , 2011, Proceedings of the National Academy of Sciences.

[31]  Gerhard Stock,et al.  Free-energy landscape of RNA hairpins constructed via dihedral angle principal component analysis. , 2009, The journal of physical chemistry. B.

[32]  Jeremy C. Smith,et al.  Hierarchical analysis of conformational dynamics in biomolecules: transition networks of metastable states. , 2007, The Journal of chemical physics.

[33]  Francesco Rao,et al.  Protein dynamics investigated by inherent structure analysis , 2010, Proceedings of the National Academy of Sciences.

[34]  J. Onuchic,et al.  Theory of protein folding: the energy landscape perspective. , 1997, Annual review of physical chemistry.

[35]  F. Stillinger,et al.  Packing Structures and Transitions in Liquids and Solids , 1984, Science.

[36]  Francesco Rao,et al.  Local Transition Gradients Indicating the Global Attributes of Protein Energy Landscapes , 2010 .

[37]  Bettina Keller,et al.  An Analysis of the Validity of Markov State Models for Emulating the Dynamics of Classical Molecular Systems and Ensembles. , 2011, Journal of chemical theory and computation.

[38]  Frank Noé,et al.  Markov models of molecular kinetics: generation and validation. , 2011, The Journal of chemical physics.

[39]  Y. Duan,et al.  Folding free-energy landscape of villin headpiece subdomain from molecular dynamics simulations , 2007, Proceedings of the National Academy of Sciences.

[40]  William Swope,et al.  Describing Protein Folding Kinetics by Molecular Dynamics Simulations. 1. Theory , 2004 .

[41]  J. A. Hartigan,et al.  A k-means clustering algorithm , 1979 .

[42]  Gerhard Stock,et al.  Hidden Complexity of Protein Free-Energy Landscapes Revealed by Principal Component Analysis by Parts , 2010 .

[43]  Wilfred F van Gunsteren,et al.  Comparing geometric and kinetic cluster algorithms for molecular simulation data. , 2010, The Journal of chemical physics.

[44]  V. Pande,et al.  Absolute comparison of simulated and experimental protein-folding dynamics , 2002, Nature.

[45]  Alessandro Laio,et al.  Which similarity measure is better for analyzing protein structures in a molecular dynamics trajectory? , 2011, Physical chemistry chemical physics : PCCP.

[46]  H. Scheraga,et al.  Global optimization of clusters, crystals, and biomolecules. , 1999, Science.

[47]  H. Schwalbe,et al.  Structure and dynamics of the homologous series of alanine peptides: a joint molecular dynamics/NMR study. , 2007, Journal of the American Chemical Society.

[48]  Lydia E Kavraki,et al.  Low-dimensional, free-energy landscapes of protein-folding reactions by nonlinear dimensionality reduction , 2006, Proc. Natl. Acad. Sci. USA.