Using Local States To Drive the Sampling of Global Conformations in Proteins

Conformational changes associated with protein function often occur beyond the time scale currently accessible to unbiased molecular dynamics (MD) simulations, so that different approaches have been developed to accelerate their sampling. Here we investigate how the knowledge of backbone conformations preferentially adopted by protein fragments, as contained in precalculated libraries known as structural alphabets (SA), can be used to explore the landscape of protein conformations in MD simulations. We find that (a) enhancing the sampling of native local states in both metadynamics and steered MD simulations allows the recovery of global folded states in small proteins; (b) folded states can still be recovered when the amount of information on the native local states is reduced by using a low-resolution version of the SA, where states are clustered into macrostates; and (c) sequences of SA states derived from collections of structural motifs can be used to sample alternative conformations of preselected protein regions. The present findings have potential impact on several applications, ranging from protein model refinement to protein folding and design.

[1]  P. Pascutti,et al.  Free Energy Profiles along Consensus Normal Modes Provide Insight into HIV-1 Protease Flap Opening. , 2011, Journal of chemical theory and computation.

[2]  M. Vendruscolo,et al.  Statistical mechanics of the denatured state of a protein using replica-averaged metadynamics. , 2014, Journal of the American Chemical Society.

[3]  Robert B Best,et al.  Microscopic events in β-hairpin folding from alternative unfolded ensembles , 2011, Proceedings of the National Academy of Sciences.

[4]  Alessandro Pandini,et al.  MinSet: a general approach to derive maximally representative database subsets by using fragment dictionaries and its application to the SCOP database , 2007, Bioinform..

[5]  Alessandro Pandini,et al.  GSATools: analysis of allosteric communication and functional local motions using a structural alphabet , 2013, Bioinform..

[6]  H. Berendsen,et al.  Molecular dynamics with coupling to an external bath , 1984 .

[7]  N. Greenfield Using circular dichroism spectra to estimate protein secondary structure , 2007, Nature Protocols.

[8]  P. Bolhuis Kinetic pathways of beta-hairpin (un)folding in explicit solvent. , 2005, Biophysical journal.

[9]  P. Deschavanne,et al.  Enhanced protein fold recognition using a structural alphabet , 2009, Proteins.

[10]  A. Laio,et al.  The inverted free energy landscape of an intrinsically disordered peptide by simulations and experiments , 2015, Scientific Reports.

[11]  Bohdan Schneider,et al.  Protein flexibility in the light of structural alphabets , 2015, Front. Mol. Biosci..

[12]  Zhiyong Wang,et al.  Protein 8-class secondary structure prediction using Conditional Neural Fields , 2010, 2010 IEEE International Conference on Bioinformatics and Biomedicine (BIBM).

[13]  P Bork,et al.  The immunoglobulin fold. Structural classification, sequence patterns and common core. , 1994, Journal of molecular biology.

[14]  R. Best,et al.  Dependence of Internal Friction on Folding Mechanism , 2015, Journal of the American Chemical Society.

[15]  Gianluca Pollastri,et al.  Structural alphabets for protein structure classification: a comparison study. , 2009, Journal of molecular biology.

[16]  A. Fornili,et al.  In silico phosphorylation of the autoinhibited form of p47(phox): insights into the mechanism of activation. , 2010, Biophysical journal.

[17]  J. Changeux 50 years of allosteric interactions: the twists and turns of the models , 2013, Nature Reviews Molecular Cell Biology.

[18]  A Maritan,et al.  Recurrent oligomers in proteins: An optimal scheme reconciling accurate and concise backbone representations in automated folding and design studies , 2000, Proteins.

[19]  I. Pivkin,et al.  A kMC-MD method with generalized move-sets for the simulation of folding of α-helical and β-stranded peptides. , 2015, The Journal of chemical physics.

[20]  K. Dill,et al.  Assessment of the protein‐structure refinement category in CASP8 , 2009, Proteins.

[21]  C Kooperberg,et al.  Assembly of protein tertiary structures from fragments with similar local sequences using simulated annealing and Bayesian scoring functions. , 1997, Journal of molecular biology.

[22]  Hui Lu,et al.  Specialized Dynamical Properties of Promiscuous Residues Revealed by Simulated Conformational Ensembles , 2013, Journal of chemical theory and computation.

[23]  Yang Zhang,et al.  Atomic-level protein structure refinement using fragment-guided molecular dynamics conformation sampling. , 2011, Structure.

[24]  A. Laio,et al.  Escaping free-energy minima , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[25]  Jane Clarke,et al.  Take home lessons from studies of related proteins , 2013, Current opinion in structural biology.

[26]  Pierrick Craveur,et al.  PredyFlexy: flexibility and local structure prediction from sequence , 2012, Nucleic Acids Res..

[27]  J. Onuchic,et al.  Folding a protein in a computer: An atomic description of the folding/unfolding of protein A , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[28]  P. Bolhuis Kinetic Pathways of β-Hairpin (Un)folding in Explicit Solvent , 2005 .

[29]  Lars Konermann,et al.  Hydrogen exchange mass spectrometry for studying protein structure and dynamics. , 2011, Chemical Society reviews.

[30]  V Muñoz,et al.  Folding dynamics and mechanism of beta-hairpin formation. , 1997, Nature.

[31]  R. Dror,et al.  How Fast-Folding Proteins Fold , 2011, Science.

[32]  A. Fornili,et al.  Energy landscapes associated with macromolecular conformational changes from endpoint structures. , 2010, Journal of the American Chemical Society.

[33]  Francesco Luigi Gervasio,et al.  Effects of oncogenic mutations on the conformational free-energy landscape of EGFR kinase , 2013, Proceedings of the National Academy of Sciences.

[34]  Amedeo Caflisch,et al.  New insights into the folding of a β-sheet miniprotein in a reduced space of collective hydrogen bond variables: application to a hydrodynamic analysis of the folding flow. , 2013, The journal of physical chemistry. B.

[35]  Francesco Luigi Gervasio,et al.  From A to B in free energy space. , 2007, The Journal of chemical physics.

[36]  P. Rousseeuw Silhouettes: a graphical aid to the interpretation and validation of cluster analysis , 1987 .

[37]  R. Nussinov,et al.  The origin of allosteric functional modulation: multiple pre-existing pathways. , 2009, Structure.

[38]  V. Pande,et al.  The Trp cage: folding kinetics and unfolded state topology via molecular dynamics simulations. , 2002, Journal of the American Chemical Society.

[39]  Alessandro Pandini,et al.  Structural alphabets derived from attractors in conformational space , 2010, BMC Bioinformatics.

[40]  D T Jones,et al.  Protein secondary structure prediction based on position-specific scoring matrices. , 1999, Journal of molecular biology.

[41]  Alexandre G. de Brevern,et al.  Improving protein fold recognition with hybrid profiles combining sequence and structure evolution , 2015, Bioinform..

[42]  A. Valencia,et al.  Emerging methods in protein co-evolution , 2013, Nature Reviews Genetics.

[43]  P. Kollman,et al.  Settle: An analytical version of the SHAKE and RATTLE algorithm for rigid water models , 1992 .

[44]  Pierre Tufféry,et al.  SA-Search: a web tool for protein structure mining based on a Structural Alphabet , 2004, Nucleic Acids Res..

[45]  E. Birney,et al.  Pfam: the protein families database , 2013, Nucleic Acids Res..

[46]  R. Dror,et al.  Improved side-chain torsion potentials for the Amber ff99SB protein force field , 2010, Proteins.

[47]  T. Darden,et al.  A smooth particle mesh Ewald method , 1995 .

[48]  F. Noé,et al.  Transition networks for modeling the kinetics of conformational change in macromolecules. , 2008, Current opinion in structural biology.

[49]  A. Laio,et al.  A bias-exchange approach to protein folding. , 2007, The journal of physical chemistry. B.

[50]  Julien Rey,et al.  De novo peptide structure prediction: an overview. , 2015, Methods in molecular biology.

[51]  A C Camproux,et al.  A hidden markov model derived structural alphabet for proteins. , 2004, Journal of molecular biology.

[52]  A. Roitberg,et al.  Smaller and faster: the 20-residue Trp-cage protein folds in 4 micros. , 2002, Journal of the American Chemical Society.

[53]  David Baker,et al.  Protein Structure Prediction Using Rosetta , 2004, Numerical Computer Methods, Part D.

[54]  M. Parrinello,et al.  Canonical sampling through velocity rescaling. , 2007, The Journal of chemical physics.

[55]  Leo S. D. Caves,et al.  Bio3d: An R Package , 2022 .

[56]  A. Caflisch,et al.  Evolutionary conserved Tyr169 stabilizes the β2-α2 loop of the prion protein. , 2015, Journal of the American Chemical Society.

[57]  Anne-Claude Camproux,et al.  SA-Mot: a web server for the identification of motifs of interest extracted from protein loops , 2011, Nucleic Acids Res..

[58]  A. Pandini,et al.  Detection of allosteric signal transmission by information-theoretic analysis of protein dynamics , 2012, FASEB journal : official publication of the Federation of American Societies for Experimental Biology.

[59]  A. Laio,et al.  Free-energy landscape for beta hairpin folding from combined parallel tempering and metadynamics. , 2006, Journal of the American Chemical Society.

[60]  Ruth Nussinov,et al.  Protein dynamics and conformational selection in bidirectional signal transduction , 2012, BMC Biology.

[61]  Massimiliano Bonomi,et al.  PLUMED: A portable plugin for free-energy calculations with molecular dynamics , 2009, Comput. Phys. Commun..

[62]  Jens Meiler,et al.  ROSETTA3: an object-oriented software suite for the simulation and design of macromolecules. , 2011, Methods in enzymology.

[63]  V. Muñoz,et al.  Folding dynamics and mechanism of β-hairpin formation , 1997, Nature.

[64]  Alessandro Laio,et al.  A Collective Variable for the Efficient Exploration of Protein Beta-Sheet Structures: Application to SH3 and GB1. , 2009, Journal of chemical theory and computation.

[65]  A. Laio,et al.  Metadynamics: a method to simulate rare events and reconstruct the free energy in biophysics, chemistry and material science , 2008 .

[66]  Vahid Mirjalili,et al.  Physics‐based protein structure refinement through multiple molecular dynamics trajectories and structure averaging , 2014, Proteins.

[67]  Baldomero Oliva,et al.  ArchDB 2014: structural classification of loops in proteins , 2013, Nucleic Acids Res..

[68]  Vahid Mirjalili,et al.  Protein structure refinement via molecular‐dynamics simulations: What works and what does not? , 2016, Proteins.

[69]  A. D. de Brevern,et al.  From local structure to a global framework: recognition of protein folds , 2014, Journal of The Royal Society Interface.

[70]  Christoph F. Schmidt,et al.  Moving into the cell: single-molecule studies of molecular motors in complex environments , 2011, Nature Reviews Molecular Cell Biology.

[71]  X. Daura,et al.  Peptide Folding: When Simulation Meets Experiment , 1999 .

[72]  Peter M. Kasson,et al.  GROMACS 4.5: a high-throughput and highly parallel open source molecular simulation toolkit , 2013, Bioinform..

[73]  Robin S. Dothager,et al.  Random-coil behavior and the dimensions of chemically unfolded proteins. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[74]  C. Etchebest,et al.  Bayesian probabilistic approach for predicting backbone structures in terms of protein blocks , 2000, Proteins.

[75]  Alessandro Laio,et al.  A Kinetic Model of Trp-Cage Folding from Multiple Biased Molecular Dynamics Simulations , 2009, PLoS Comput. Biol..

[76]  M. Tyagi,et al.  Local Protein Structures , 2007 .

[77]  Raffaello Potestio,et al.  Knotted vs. Unknotted Proteins: Evidence of Knot-Promoting Loops , 2010, PLoS Comput. Biol..

[78]  K. Schulten,et al.  Steered molecular dynamics and mechanical functions of proteins. , 2001, Current opinion in structural biology.

[79]  Daniel W. A. Buchan,et al.  Scalable web services for the PSIPRED Protein Analysis Workbench , 2013, Nucleic Acids Res..

[80]  Christian Cole,et al.  JPred4: a protein secondary structure prediction server , 2015, Nucleic Acids Res..

[81]  Berk Hess,et al.  LINCS: A linear constraint solver for molecular simulations , 1997, J. Comput. Chem..

[82]  Jean-Christophe Gelly,et al.  mulPBA: an efficient multiple protein structure alignment method based on a structural alphabet , 2014, Journal of biomolecular structure & dynamics.

[83]  Adrian E Roitberg,et al.  Smaller and faster: the 20-residue Trp-cage protein folds in 4 micros. , 2002, Journal of the American Chemical Society.

[84]  Alessandro Laio,et al.  Exploring the Universe of Protein Structures beyond the Protein Data Bank , 2010, PLoS Comput. Biol..