Assessment and optimization of collective variables for protein conformational landscape: GB1 β-hairpin as a case study.

Collective variables (CVs), when chosen judiciously, can play an important role in recognizing rate-limiting processes and rare events in any biomolecular systems. However, high dimensionality and inherent complexities associated with such biochemical systems render the identification of an optimal CV a challenging task, which in turn precludes the elucidation of an underlying conformational landscape in sufficient details. In this context, a relevant model system is presented by a 16-residue β-hairpin of GB1 protein. Despite being the target of numerous theoretical and computational studies for understanding the protein folding, the set of CVs optimally characterizing the conformational landscape of the β-hairpin of GB1 protein has remained elusive, resulting in a lack of consensus on its folding mechanism. Here we address this by proposing a pair of optimal CVs which can resolve the underlying free energy landscape of the GB1 hairpin quite efficiently. Expressed as a linear combination of a number of traditional CVs, the optimal CV for this system is derived by employing the recently introduced time-structured independent component analysis approach on a large number of independent unbiased simulations. By projecting the replica-exchange simulated trajectories along these pair of optimized CVs, the resulting free energy landscape of this system is able to resolve four distinct well-separated metastable states encompassing the extensive ensembles of folded, unfolded, and molten globule states. Importantly, the optimized CVs were found to be capable of automatically recovering a novel partial helical state of this protein, without needing to explicitly invoke helicity as a constituent CV. Furthermore, a quantitative sensitivity analysis of each constituent in the optimized CV provided key insights on the relative contributions of the constituent CVs in the overall free energy landscapes. Finally, the kinetic pathways connecting these metastable states, constructed using a Markov state model, provide an optimum description of the underlying folding mechanism of the peptide. Taken together, this work offers a quantitatively robust approach toward comprehensive mapping of the underlying folding landscape of a quintessential model system along its optimized CV.

[1]  Frank Noé,et al.  PyEMMA 2: A Software Package for Estimation, Validation, and Analysis of Markov Models. , 2015, Journal of chemical theory and computation.

[2]  Andrew L. Ferguson,et al.  Nonlinear reconstruction of single-molecule free-energy surfaces from univariate time series. , 2016, Physical review. E.

[3]  Hao Wu,et al.  VAMPnets for deep learning of molecular kinetics , 2017, Nature Communications.

[4]  Anthony K. Felts,et al.  Distinguishing among structural ensembles of the GB1 peptide: REMD simulations and NMR experiments. , 2007, Journal of the American Chemical Society.

[5]  Robert B Best,et al.  Microscopic events in β-hairpin folding from alternative unfolded ensembles , 2011, Proceedings of the National Academy of Sciences.

[6]  Mark E Tuckerman,et al.  Stochastic Neural Network Approach for Learning High-Dimensional Free Energy Surfaces. , 2017, Physical review letters.

[7]  P. Bolhuis Kinetic Pathways of β-Hairpin (Un)folding in Explicit Solvent , 2005 .

[8]  T. Darden,et al.  Particle mesh Ewald: An N⋅log(N) method for Ewald sums in large systems , 1993 .

[9]  G. Hummer,et al.  Optimized molecular dynamics force fields applied to the helix-coil transition of polypeptides. , 2009, The journal of physical chemistry. B.

[10]  Wei Zhang,et al.  A point‐charge force field for molecular mechanics simulations of proteins based on condensed‐phase quantum mechanical calculations , 2003, J. Comput. Chem..

[11]  Berk Hess,et al.  GROMACS: High performance molecular simulations through multi-level parallelism from laptops to supercomputers , 2015 .

[12]  Frank Noé,et al.  Variational Approach to Molecular Kinetics. , 2014, Journal of chemical theory and computation.

[13]  Michele Parrinello,et al.  Probing the Unfolded Configurations of a β-Hairpin Using Sketch-Map. , 2015, Journal of chemical theory and computation.

[14]  C. Schütte,et al.  Supplementary Information for “ Constructing the Equilibrium Ensemble of Folding Pathways from Short Off-Equilibrium Simulations ” , 2009 .

[15]  M. Parrinello,et al.  Polymorphic transitions in single crystals: A new molecular dynamics method , 1981 .

[16]  W. E,et al.  Towards a Theory of Transition Paths , 2006 .

[17]  R. Best,et al.  Free‐energy landscape of the GB1 hairpin in all‐atom explicit solvent simulations with different force fields: Similarities and differences , 2011, Proteins.

[18]  R. McGibbon,et al.  Variational cross-validation of slow dynamical modes in molecular kinetics. , 2014, The Journal of chemical physics.

[19]  Peter G Bolhuis,et al.  Effects of a mutation on the folding mechanism of a beta-hairpin. , 2009, The journal of physical chemistry. B.

[20]  R. Best,et al.  Folding Kinetics and Unfolded State Dynamics of the GB1 Hairpin from Molecular Simulation. , 2013, Journal of chemical theory and computation.

[21]  G. Saladino,et al.  Metadynamics study of a β-hairpin stability in mixed solvents. , 2011, Journal of the American Chemical Society.

[22]  A. Laio,et al.  Free-energy landscape for beta hairpin folding from combined parallel tempering and metadynamics. , 2006, Journal of the American Chemical Society.

[23]  Hongbin Wan,et al.  A Maximum-Caliber Approach to Predicting Perturbed Folding Kinetics Due to Mutations. , 2016, Journal of chemical theory and computation.

[24]  B. Berne,et al.  The free energy landscape for β hairpin folding in explicit water , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[25]  Michele Parrinello,et al.  Refining Collective Coordinates and Improving Free Energy Representation in Variational Enhanced Sampling. , 2018, Journal of chemical theory and computation.

[26]  A. Laio,et al.  Escaping free-energy minima , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[27]  J. Gumbart,et al.  Folding free energy landscapes of β-sheets with non-polarizable and polarizable CHARMM force fields. , 2018, The Journal of chemical physics.

[28]  Michele Parrinello,et al.  A variational conformational dynamics approach to the selection of collective variables in metadynamics. , 2017, The Journal of chemical physics.

[29]  J. P. Grossman,et al.  Biomolecular simulation: a computational microscope for molecular biology. , 2012, Annual review of biophysics.

[30]  A. Irbäck,et al.  Markov modeling of peptide folding in the presence of protein crowders. , 2018, The Journal of chemical physics.

[31]  Marcus Weber,et al.  Fuzzy spectral clustering by PCCA+: application to Markov state models and data classification , 2013, Advances in Data Analysis and Classification.

[32]  V. Muñoz,et al.  Folding dynamics and mechanism of β-hairpin formation , 1997, Nature.

[33]  P. Kollman,et al.  Settle: An analytical version of the SHAKE and RATTLE algorithm for rigid water models , 1992 .

[34]  W. L. Jorgensen,et al.  Comparison of simple potential functions for simulating liquid water , 1983 .

[35]  Vincent A. Voelz,et al.  Kinetic network models of tryptophan mutations in β-hairpins reveal the importance of non-native interactions. , 2015, Journal of chemical theory and computation.

[36]  Eric Vanden-Eijnden,et al.  Transition Path Theory for Markov Jump Processes , 2009, Multiscale Model. Simul..

[37]  Ken A Dill,et al.  Use of the Weighted Histogram Analysis Method for the Analysis of Simulated and Parallel Tempering Simulations. , 2007, Journal of chemical theory and computation.

[38]  A. Gronenborn,et al.  A novel, highly stable fold of the immunoglobulin binding domain of streptococcal protein G. , 1993, Science.

[39]  P. Deuflhard,et al.  Robust Perron cluster analysis in conformation dynamics , 2005 .

[40]  Vijay S Pande,et al.  Identification of simple reaction coordinates from complex dynamics. , 2016, The Journal of chemical physics.

[41]  Matthews Cr PATHWAYS OF PROTEIN FOLDING , 1993 .

[42]  Y. Sugita,et al.  Replica-exchange molecular dynamics method for protein folding , 1999 .

[43]  Toni Giorgino,et al.  Identification of slow molecular order parameters for Markov model construction. , 2013, The Journal of chemical physics.

[44]  H. Berendsen,et al.  Molecular dynamics with coupling to an external bath , 1984 .

[45]  Mohammad M. Sultan,et al.  MSMBuilder: Statistical Models for Biomolecular Dynamics , 2016, bioRxiv.

[46]  B. Berne,et al.  Spectral gap optimization of order parameters for sampling complex molecular systems , 2015, Proceedings of the National Academy of Sciences.

[47]  Massimiliano Bonomi,et al.  PLUMED: A portable plugin for free-energy calculations with molecular dynamics , 2009, Comput. Phys. Commun..

[48]  G. Torrie,et al.  Nonphysical sampling distributions in Monte Carlo free-energy estimation: Umbrella sampling , 1977 .

[49]  R. Fesinmeyer,et al.  Enhanced hairpin stability through loop design: the case of the protein G B1 domain hairpin. , 2004, Journal of the American Chemical Society.

[50]  M. Parrinello,et al.  Well-tempered metadynamics: a smoothly converging and tunable free-energy method. , 2008, Physical review letters.

[51]  M. Parrinello,et al.  Canonical sampling through velocity rescaling. , 2007, The Journal of chemical physics.

[52]  R. Levy,et al.  Protein folding pathways from replica exchange simulations and a kinetic network model. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[53]  Pratyush Tiwary,et al.  Reweighted autoencoded variational Bayes for enhanced sampling (RAVE). , 2018, The Journal of chemical physics.

[54]  Vijay S Pande,et al.  Improvements in Markov State Model Construction Reveal Many Non-Native Interactions in the Folding of NTL9. , 2013, Journal of chemical theory and computation.

[55]  S. P. Lloyd,et al.  Least squares quantization in PCM , 1982, IEEE Trans. Inf. Theory.

[56]  Michele Parrinello,et al.  Effect of urea on the β-hairpin conformational ensemble and protein denaturation mechanism. , 2011, Journal of the American Chemical Society.

[57]  Vijay S Pande,et al.  tICA-Metadynamics: Accelerating Metadynamics by Using Kinetically Selected Collective Variables. , 2017, Journal of chemical theory and computation.

[58]  A. Bax,et al.  SPARTA+: a modest improvement in empirical NMR chemical shift prediction by means of an artificial neural network , 2010, Journal of biomolecular NMR.

[59]  Michele Parrinello,et al.  Enhancing Important Fluctuations: Rare Events and Metadynamics from a Conceptual Viewpoint. , 2016, Annual review of physical chemistry.