From coarse‐grain to all‐atom: Toward multiscale analysis of protein landscapes

Multiscale methods are becoming increasingly promising as a way to characterize the dynamics of large protein systems on biologically relevant time‐scales. The underlying assumption in multiscale simulations is that it is possible to move reliably between different resolutions. We present a method that efficiently generates realistic all‐atom protein structures starting from the Cα atom positions, as obtained for instance from extensive coarse‐grain simulations. The method, a reconstruction algorithm for coarse‐grain structures (RACOGS), is validated by reconstructing ensembles of coarse‐grain structures obtained during folding simulations of the proteins src‐SH3 and S6. The results show that RACOGS consistently produces low energy, all‐atom structures. A comparison of the free energy landscapes calculated using the coarse‐grain structures versus the all‐atom structures shows good correspondence and little distortion in the protein folding landscape. Proteins 2007. © 2007 Wiley‐Liss, Inc.

[1]  Z. Xiang,et al.  Extending the accuracy limits of prediction for side-chain conformations. , 2001, Journal of molecular biology.

[2]  P. Payne,et al.  Reconstruction of protein conformations from estimated positions of the Cα coordinates , 1993, Protein science : a publication of the Protein Society.

[3]  D. Case,et al.  Exploring protein native states and large‐scale conformational changes with a modified generalized born model , 2004, Proteins.

[4]  R. Bruccoleri,et al.  Application of a directed conformational search for generating 3‐D coordinates for protein structures from α‐carbon coordinates , 1992, Proteins.

[5]  Peter G Wolynes,et al.  P versus Q: structural reaction coordinates capture protein folding on smooth landscapes. , 2006, Proceedings of the National Academy of Sciences of the United States of America.

[6]  S. Subbiah,et al.  Prediction of protein side-chain conformation by packing optimization. , 1991, Journal of molecular biology.

[7]  Luis Serrano,et al.  A thermodynamic and kinetic analysis of the folding pathway of an SH3 domain entropically stabilised by a redesigned hydrophobic core. , 2003, Journal of molecular biology.

[8]  Luis Serrano,et al.  Unspecific hydrophobic stabilization of folding transition states , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[9]  T. N. Bhat,et al.  The Protein Data Bank , 2000, Nucleic Acids Res..

[10]  Mariusz Milik,et al.  Algorithm for rapid reconstruction of protein backbone from alpha carbon coordinates , 1997 .

[11]  David Baker,et al.  Experiment and theory highlight role of native state topology in SH3 folding , 1999, Nature Structural Biology.

[12]  P. Koehl,et al.  Application of a self-consistent mean field theory to predict protein side-chains conformation and estimate their conformational entropy. , 1994, Journal of molecular biology.

[13]  L. Serrano,et al.  Bergerac-SH3: "frustation" induced by stabilizing the folding nucleus. , 2001, Journal of molecular biology.

[14]  M Feig,et al.  Accurate reconstruction of all‐atom protein representations from side‐chain‐based low‐resolution models , 2000, Proteins.

[15]  Mona Singh,et al.  Solving and analyzing side-chain positioning problems using linear and integer programming , 2005, Bioinform..

[16]  Tatsuya Akutsu,et al.  Protein Side-chain Packing Problem: A Maximum Edge-weight Clique Algorithmic Approach , 2005, APBC.

[17]  Arieh Warshel,et al.  Using simplified protein representation as a reference potential for all-atom calculations of folding free energy , 1999 .

[18]  J. Mendes,et al.  Improved modeling of side‐chains in proteins with rotamer‐based methods: A flexible rotamer model , 1999, Proteins.

[19]  Junmei Wang,et al.  How well does a restrained electrostatic potential (RESP) model perform in calculating conformational energies of organic and biological molecules? , 2000, J. Comput. Chem..

[20]  J. Williamson,et al.  Structure of the S15,S6,S18-rRNA complex: assembly of the 30S ribosome central domain. , 2000, Science.

[21]  C. Micheletti,et al.  Study of the Villin headpiece folding dynamics by combining coarse‐grained Monte Carlo evolution and all‐atom molecular dynamics , 2004 .

[22]  Wilfred F van Gunsteren,et al.  Multigraining: an algorithm for simultaneous fine-grained and coarse-grained simulation of molecular systems. , 2006, The Journal of chemical physics.

[23]  R Rein,et al.  Finding the global minimum: a fuzzy end elimination implementation. , 1995, Protein engineering.

[24]  R. Lavery,et al.  A new approach to the rapid determination of protein side chain conformations. , 1991, Journal of biomolecular structure & dynamics.

[25]  R. Swendsen,et al.  THE weighted histogram analysis method for free‐energy calculations on biomolecules. I. The method , 1992 .

[26]  A. Liljas,et al.  Crystal structure of the ribosomal protein S6 from Thermus thermophilus. , 1994, The EMBO journal.

[27]  Mona Singh,et al.  A Semidefinite Programming Approach to Side Chain Positioning with New Rounding Strategies , 2004, INFORMS J. Comput..

[28]  N. Grishin,et al.  Side‐chain modeling with an optimized scoring function , 2002, Protein science : a publication of the Protein Society.

[29]  I. Lasters,et al.  Fast and accurate side‐chain topology and energy refinement (FASTER) as a new method for protein structure optimization , 2002, Proteins.

[30]  H. Stanley,et al.  Molecular dynamics simulation of the SH3 domain aggregation suggests a generic amyloidogenesis mechanism. , 2002, Journal of molecular biology.

[31]  Jeffrey Skolnick,et al.  Efficient algorithm for the reconstruction of a protein backbone from the α‐carbon coordinates , 1992 .

[32]  K. Kremer,et al.  Adaptive resolution molecular-dynamics simulation: changing the degrees of freedom on the fly. , 2005, The Journal of chemical physics.

[33]  Juan C. Meza,et al.  OPT++: An object-oriented class library for nonlinear optimization , 1994 .

[34]  Roland L. Dunbrack Rotamer libraries in the 21st century. , 2002, Current opinion in structural biology.

[35]  L L Looger,et al.  Generalized dead-end elimination algorithms make large-scale protein side-chain structure prediction tractable: implications for protein design and structural genomics. , 2001, Journal of molecular biology.

[36]  D. Otzen,et al.  Designed protein tetramer zipped together with a hydrophobic Alzheimer homology: a structural clue to amyloid assembly. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[37]  Lydia E Kavraki,et al.  Low-dimensional, free-energy landscapes of protein-folding reactions by nonlinear dimensionality reduction , 2006, Proc. Natl. Acad. Sci. USA.

[38]  Qiang Shi,et al.  Mixed atomistic and coarse-grained molecular dynamics: simulation of a membrane-bound ion channel. , 2006, The journal of physical chemistry. B.

[39]  R. Goldstein Efficient rotamer elimination applied to protein side-chains and related spin glasses. , 1994, Biophysical journal.

[40]  Jörg Gsponer,et al.  Molecular dynamics simulations of protein folding from the transition state , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[41]  Olivier Gascuel,et al.  Proceedings of the First International Workshop on Algorithms in Bioinformatics , 2001 .

[42]  KumarShankar,et al.  The weighted histogram analysis method for free-energy calculations on biomolecules. I , 1992 .

[43]  D B Gordon,et al.  Branch-and-terminate: a combinatorial optimization algorithm for protein design. , 1999, Structure.

[44]  Tony Pawson,et al.  Protein modules and signalling networks , 1995, Nature.

[45]  J Moult,et al.  Genetic algorithms for protein structure prediction. , 1996, Current opinion in structural biology.

[46]  E. Alm,et al.  Prediction of protein-folding mechanisms from free-energy landscapes derived from native structures. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[47]  Adrian A Canutescu,et al.  Access the most recent version at doi: 10.1110/ps.03154503 References , 2003 .

[48]  J. Onuchic,et al.  Topological and energetic factors: what determines the structural details of the transition state ensemble and "en-route" intermediates for protein folding? An investigation for small globular proteins. , 2000, Journal of molecular biology.

[49]  Roland L. Dunbrack,et al.  Prediction of protein side-chain rotamers from a backbone-dependent rotamer library: a new homology modeling tool. , 1997, Journal of molecular biology.

[50]  Cecilia Clementi,et al.  Optimal combination of theory and experiment for the characterization of the protein folding landscape of S6: how far can a minimalist model go? , 2004, Journal of molecular biology.

[51]  Scott Brown,et al.  Influence of denatured and intermediate states of folding on protein aggregation , 2005, Protein science : a publication of the Protein Society.

[52]  Vincent Kanade,et al.  Clustering Algorithms , 2021, Wireless RF Energy Transfer in the Massive IoT Era.

[53]  C. Sander,et al.  Database algorithm for generating protein backbone and side-chain co-ordinates from a C alpha trace application to model building and detection of co-ordinate errors. , 1991, Journal of molecular biology.

[54]  Stephen L. Mayo,et al.  Conformational splitting: A more powerful criterion for dead-end elimination , 2000, J. Comput. Chem..

[55]  P Argos,et al.  A method to configure protein side-chains from the main-chain trace in homology modelling. , 1993, Journal of molecular biology.

[56]  A R Leach,et al.  Exploring the conformational space of protein side chains using dead‐end elimination and the A* algorithm , 1998, Proteins.

[57]  Mariusz Milik,et al.  Algorithm for rapid reconstruction of protein backbone from alpha carbon coordinates , 1997, J. Comput. Chem..

[58]  Cecilia Clementi,et al.  Minimalist protein model as a diagnostic tool for misfolding and aggregation. , 2006, Journal of molecular biology.

[59]  S. Rackovsky,et al.  Calculation of protein backbone geometry from α‐carbon coordinates based on peptide‐group dipole alignment , 1993, Protein science : a publication of the Protein Society.

[60]  A. Finkelstein,et al.  A theoretical search for folding/unfolding nuclei in three-dimensional protein structures. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[61]  T. A. Jones,et al.  Using known substructures in protein model building and crystallography. , 1986, The EMBO journal.

[62]  R Samudrala,et al.  A graph-theoretic algorithm for comparative modeling of protein structure. , 1998, Journal of molecular biology.

[63]  M. Levitt,et al.  Accuracy of side‐chain prediction upon near‐native protein backbones generated by ab initio folding methods , 1998, Proteins.

[64]  Edward Lyman,et al.  Resolution exchange simulation. , 2005, Physical review letters.

[65]  Y Wang,et al.  A new procedure for constructing peptides into a given Calpha chain. , 1998, Folding & design.

[66]  Arne Elofsson,et al.  All are not equal: A benchmark of different homology modeling programs , 2005, Protein science : a publication of the Protein Society.

[67]  D. Bedrov,et al.  Multiscale Modeling of Poly(ethylene oxide)-Poly(propylene oxide)-Poly(ethylene oxide) Triblock Copolymer Micelles in Aqueous Solution. , 2006, Journal of chemical theory and computation.

[68]  D Thirumalai,et al.  Probing the initial stage of aggregation of the Abeta(10-35)-protein: assessing the propensity for peptide dimerization. , 2005, Journal of molecular biology.

[69]  Cecilia Clementi,et al.  The effects of nonnative interactions on protein folding rates: Theory and simulation , 2004, Protein science : a publication of the Protein Society.

[70]  C. Lee,et al.  Predicting protein mutant energetics by self-consistent ensemble optimization. , 1994, Journal of molecular biology.

[71]  Roland L. Dunbrack,et al.  Backbone-dependent rotamer library for proteins. Application to side-chain prediction. , 1993, Journal of molecular biology.

[72]  Janet M. Thornton,et al.  Rebuilding flavodoxin from Cα coordinates: A test study , 1989 .

[73]  Dmitry M Korzhnev,et al.  Dramatic acceleration of protein folding by stabilization of a nonnative backbone conformation. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[74]  Kurt Kremer,et al.  Long time atomistic polymer trajectories from coarse grained simulations: bisphenol-A polycarbonate. , 2006, Soft matter.

[75]  S. Wodak,et al.  Modelling the polypeptide backbone with 'spare parts' from known protein structures. , 1989, Protein engineering.

[76]  Joan-Emma Shea,et al.  Self-assembly of peptides into a β-barrel motif , 2004 .

[77]  Matej Praprotnik,et al.  Adaptive resolution scheme for efficient hybrid atomistic-mesoscale molecular dynamics simulations of dense liquids. , 2006, Physical review. E, Statistical, nonlinear, and soft matter physics.

[78]  D Baker,et al.  Folding dynamics of the src SH3 domain. , 1997, Biochemistry.

[79]  M. Proctor,et al.  Structural changes in the transition state of protein folding: alternative interpretations of curved chevron plots. , 1999, Biochemistry.

[80]  Joan-Emma Shea,et al.  Probing the folding free energy landscape of the src-SH3 protein domain , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[81]  Ulrich H E Hansmann,et al.  Efficient sampling of protein structures by model hopping. , 2005, Physical review letters.

[82]  V. Pande,et al.  On the transition coordinate for protein folding , 1998 .

[83]  Niles A Pierce,et al.  Protein design is NP-hard. , 2002, Protein engineering.

[84]  Arne Elofsson,et al.  Side Chain-Positioning as an Integer Programming Problem , 2001, WABI.

[85]  Marilisa Neri,et al.  Coarse-grained model of proteins incorporating atomistic detail of the active site. , 2005, Physical review letters.

[86]  Johan Desmet,et al.  The dead-end elimination theorem and its use in protein side-chain positioning , 1992, Nature.

[87]  Feng Ding,et al.  Reconstruction of the src-SH3 protein domain transition state ensemble using multiscale molecular dynamics simulations. , 2005, Journal of molecular biology.

[88]  Daniel E. Otzen,et al.  Conformational plasticity in folding of the split β-α-β protein S6: evidence for burst-phase disruption of the native state , 2002 .

[89]  Cecilia Clementi,et al.  Balancing energy and entropy: a minimalist model for the characterization of protein folding landscapes. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[90]  D Baker,et al.  Long-range order in the src SH3 folding transition state. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[91]  O. Schueler‐Furman,et al.  Improved side‐chain modeling for protein–protein docking , 2005, Protein science : a publication of the Protein Society.

[92]  Temperature dependence of the free energy landscape of the src‐SH3 protein domain , 2004, Proteins.

[93]  I Lasters,et al.  All in one: a highly detailed rotamer library improves both accuracy and speed in the modelling of sidechains by dead-end elimination. , 1997, Folding & design.