Native structure-based modeling and simulation of biomolecular systems per mouse click

BackgroundMolecular dynamics (MD) simulations provide valuable insight into biomolecular systems at the atomic level. Notwithstanding the ever-increasing power of high performance computers current MD simulations face several challenges: the fastest atomic movements require time steps of a few femtoseconds which are small compared to biomolecular relevant timescales of milliseconds or even seconds for large conformational motions. At the same time, scalability to a large number of cores is limited mostly due to long-range interactions. An appealing alternative to atomic-level simulations is coarse-graining the resolution of the system or reducing the complexity of the Hamiltonian to improve sampling while decreasing computational costs. Native structure-based models, also called Gō-type models, are based on energy landscape theory and the principle of minimal frustration. They have been tremendously successful in explaining fundamental questions of, e.g., protein folding, RNA folding or protein function. At the same time, they are computationally sufficiently inexpensive to run complex simulations on smaller computing systems or even commodity hardware. Still, their setup and evaluation is quite complex even though sophisticated software packages support their realization.ResultsHere, we establish an efficient infrastructure for native structure-based models to support the community and enable high-throughput simulations on remote computing resources via GridBeans and UNICORE middleware. This infrastructure organizes the setup of such simulations resulting in increased comparability of simulation results. At the same time, complete workflows for advanced simulation protocols can be established and managed on remote resources by a graphical interface which increases reusability of protocols and additionally lowers the entry barrier into such simulations for, e.g., experimental scientists who want to compare their results against simulations. We demonstrate the power of this approach by illustrating it for protein folding simulations for a range of proteins.ConclusionsWe present software enhancing the entire workflow for native structure-based simulations including exception-handling and evaluations. Extending the capability and improving the accessibility of existing simulation packages the software goes beyond the state of the art in the domain of biomolecular simulations. Thus we expect that it will stimulate more individuals from the community to employ more confidently modeling in their research.

[1]  A. Schug,et al.  Simulating Biomolecular Folding and Function by Native‐Structure‐Based/Go‐Type Models , 2014 .

[2]  José N Onuchic,et al.  Accommodation of aminoacyl-tRNA into the ribosome involves reversible excursions along multiple pathways. , 2010, RNA.

[3]  J. Onuchic,et al.  Biomolecular dynamics: order–disorder transitions and energy landscapes , 2012, Reports on progress in physics. Physical Society.

[4]  Cecilia Clementi,et al.  The effects of nonnative interactions on protein folding rates: Theory and simulation , 2004, Protein science : a publication of the Protein Society.

[5]  Heiko Lammert,et al.  Allostery in the ferredoxin protein motif does not involve a conformational switch , 2011, Proceedings of the National Academy of Sciences.

[6]  Glenn Cater,et al.  Service Oriented Architecture (SOA) , 2011, Encyclopedia of Information Assurance.

[7]  Joseph A. Bank,et al.  Supporting Online Material Materials and Methods Figs. S1 to S10 Table S1 References Movies S1 to S3 Atomic-level Characterization of the Structural Dynamics of Proteins , 2022 .

[8]  J. Onuchic,et al.  The Many Faces of Structure-Based Potentials: From Protein Folding Landscapes to Structural Characterization of Complex Biomolecules , 2012 .

[9]  Wayne Boucher,et al.  The CCPN data model for NMR spectroscopy: Development of a software pipeline , 2005, Proteins.

[10]  S. Plotkin Speeding protein folding beyond the Gō model: How a little frustration sometimes helps , 2000, Proteins.

[11]  R. Dror,et al.  How Fast-Folding Proteins Fold , 2011, Science.

[12]  J. Onuchic,et al.  Theory of Protein Folding This Review Comes from a Themed Issue on Folding and Binding Edited Basic Concepts Perfect Funnel Landscapes and Common Features of Folding Mechanisms , 2022 .

[13]  Thomas Steinke,et al.  A Data Driven Science Gateway for Computational Workflows , 2012 .

[14]  Björn Hagemeier,et al.  UNICORE 6 — Recent and Future Advancements , 2010, Ann. des Télécommunications.

[15]  Wolfgang Wenzel,et al.  UNICORE-based integrated application services for multiscale materials modeling , 2011 .

[16]  Borja Sotomayor,et al.  Cloud-based bioinformatics workflow platform for large-scale next-generation sequencing analyses , 2014, J. Biomed. Informatics.

[17]  Peter M. Kasson,et al.  GROMACS 4.5: a high-throughput and highly parallel open source molecular simulation toolkit , 2013, Bioinform..

[18]  Rasmus H. Fogh,et al.  Structure Simulation with Calculated NMR Parameters - Integrating COSMOS into the CCPN Framework , 2012, HealthGrid.

[19]  Thomas Steinke,et al.  MoSGrid: efficient data management and a standardized data exchange format for molecular simulations in a grid environment , 2012, Journal of Cheminformatics.

[20]  Andreas Prlic,et al.  BioJava: an open-source framework for bioinformatics in 2012 , 2012, Bioinform..

[21]  Eugene I. Shakhnovich,et al.  Kinetics, thermodynamics and evolution of non-native interactions in a protein folding nucleus , 2000, Nature Structural Biology.

[22]  José M. García,et al.  A Pipeline Pilot based SOAP implementation of FlexScreen for High-Throughput Virtual Screening , 2011, IWSG-Life.

[23]  C. Clementi,et al.  The effects of non-native interactions on protein folding rates: Theory and simulation , 2004, q-bio/0407023.

[24]  Thomas Steinke,et al.  Standards‐based metadata management for molecular simulations , 2014, Concurr. Comput. Pract. Exp..

[25]  Ian T. Foster,et al.  Globus: a Metacomputing Infrastructure Toolkit , 1997, Int. J. High Perform. Comput. Appl..

[26]  Y. Levy,et al.  Effect of glycosylation on protein folding: A close look at thermodynamic stabilization , 2008, Proceedings of the National Academy of Sciences.

[27]  J. Onuchic,et al.  Funnels, pathways, and the energy landscape of protein folding: A synthesis , 1994, Proteins.

[28]  Wolfgang Wenzel,et al.  UNICORE based workflows for the simulation of organic light-emitting diodes , 2012 .

[29]  Cecilia Clementi,et al.  Quantifying the roughness on the free energy landscape: entropic bottlenecks and protein folding rates. , 2004, Journal of the American Chemical Society.

[30]  Changbong Hyeon,et al.  Revealing the bifurcation in the unfolding pathways of GFP by using single-molecule experiments and simulations , 2007, Proceedings of the National Academy of Sciences.

[31]  J. Onuchic,et al.  An all‐atom structure‐based potential for proteins: Bridging minimal models with all‐atom empirical forcefields , 2009, Proteins.

[32]  T. N. Bhat,et al.  A framework for scientific data modeling and automated software development , 2005, Bioinform..

[33]  N. Go Protein folding as a stochastic process , 1983 .

[34]  Bernd Schuller,et al.  The UNICORE Rich Client: Facilitating the Automated Execution of Scientific Workflows , 2010, 2010 IEEE Sixth International Conference on e-Science.

[35]  T. N. Bhat,et al.  The Protein Data Bank , 2000, Nucleic Acids Res..

[36]  Yaakov Levy,et al.  Mutations as trapdoors to two competing native conformations of the Rop-dimer , 2007, Proceedings of the National Academy of Sciences.

[37]  Martin Weigt,et al.  Structural basis of histidine kinase autophosphorylation deduced by integrating genomics, molecular dynamics, and mutagenesis , 2012, Proceedings of the National Academy of Sciences.

[38]  Alexander Schug,et al.  From protein folding to protein function and biomolecular binding by energy landscape theory. , 2010, Current opinion in pharmacology.

[39]  J. Onuchic,et al.  Topological and energetic factors: what determines the structural details of the transition state ensemble and "en-route" intermediates for protein folding? An investigation for small globular proteins. , 2000, Journal of molecular biology.

[40]  J. Onuchic,et al.  Robustness and generalization of structure‐based models for protein folding and function , 2009, Proteins.

[41]  Alexandre M J J Bonvin,et al.  A Flexible, Grid-Enabled Web Portal for GROMACS Molecular Dynamics Simulations. , 2012, Journal of chemical theory and computation.

[42]  John Karanicolas,et al.  Improved Gō-like models demonstrate the robustness of protein folding mechanisms towards non-native interactions. , 2003, Journal of molecular biology.

[43]  Bernd Schuller,et al.  Chemomentum - UNICORE 6 Based Infrastructure for Complex Applications in Science and Technology , 2007, Euro-Par Workshops.

[44]  Gerhard Hummer,et al.  Native contacts determine protein folding mechanisms in atomistic simulations , 2013, Proceedings of the National Academy of Sciences.

[45]  Peter Klein,et al.  A structure-based model for ligand binding and dimerization of EGF receptors. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[46]  Terence Hwa,et al.  High-resolution protein complexes from integrating genomic information with molecular simulation , 2009, Proceedings of the National Academy of Sciences.

[47]  M. Fernanda Rey-Stolle,et al.  Topology‐based models and NMR structures in protein folding simulations , 2009, J. Comput. Chem..

[48]  Jeffrey K. Noel,et al.  SMOG@ctbp: simplified deployment of structure-based models in GROMACS , 2010, Nucleic Acids Res..

[49]  W F Drew Bennett,et al.  Improved Parameters for the Martini Coarse-Grained Protein Force Field. , 2013, Journal of chemical theory and computation.

[50]  J. Lindemann,et al.  Advanced Resource Connector middleware for lightweight computational Grids , 2007, Future Gener. Comput. Syst..

[51]  H. Grubmüller,et al.  Energy barriers and driving forces in tRNA translocation through the ribosome , 2013, Nature Structural &Molecular Biology.

[52]  Andreas Prlic,et al.  Sequence analysis , 2003 .

[53]  G. Voth Coarse-Graining of Condensed Phase and Biomolecular Systems , 2008 .

[54]  Alexander Schug,et al.  Differences between cotranscriptional and free riboswitch folding , 2013, Nucleic acids research.

[55]  Alexander Schug,et al.  Geometrical features of the protein folding mechanism are a robust property of the energy landscape: a detailed investigation of several reduced models. , 2008, The journal of physical chemistry. B.

[56]  Daniel Bayer,et al.  Integrating ARC grid middleware with Taverna workflows , 2008, Bioinform..

[57]  Cecilia Clementi,et al.  Coarse-grained models of protein folding: toy models or predictive tools? , 2008, Current opinion in structural biology.

[58]  Ian T. Foster,et al.  Globus Toolkit Version 4: Software for Service-Oriented Systems , 2005, Journal of Computer Science and Technology.

[59]  A. Schug,et al.  Coarse-Grained Structure-Based Simulations of Proteins and RNA , 2008 .

[60]  Andreas Bender,et al.  A Service-oriented Framework for Integration of Domain-specific Data Models in Scientific Workflows , 2013, ICCS.

[61]  J. Onuchic,et al.  Prediction of folding mechanism for circular-permuted proteins. , 2001, Journal of molecular biology.

[62]  Abhinav Verma,et al.  eSBMTools 1.0: enhanced native structure-based modeling tools , 2013, Bioinform..

[63]  Morris Riedel,et al.  GridBeans: Support e-Science and Grid Applications , 2006, 2006 Second IEEE International Conference on e-Science and Grid Computing (e-Science'06).

[64]  Peter G Wolynes,et al.  Localizing frustration in native proteins and protein assemblies , 2007, Proceedings of the National Academy of Sciences.

[65]  Dmitri I. Svergun,et al.  WeNMR: Structural Biology on the Grid , 2011, Journal of Grid Computing.