GSAFold: A new application of GSA to protein structure prediction

The folding process defines three‐dimensional protein structures from their amino acid chains. A protein's structure determines its activity and properties; thus knowing such conformation on an atomic level is essential for both basic and applied studies of protein function and dynamics. However, the acquisition of such structures by experimental methods is slow and expensive, and current computational methods mostly depend on previously known structures to determine new ones. Here we present a new software called GSAFold that applies the generalized simulated annealing (GSA) algorithm on ab initio protein structure prediction. The GSA is a stochastic search algorithm employed in energy minimization and used in global optimization problems, especially those that depend on long‐range interactions, such as gravity models and conformation optimization of small molecules. This new implementation applies, for the first time in ab initio protein structure prediction, an analytical inverse for the Visitation function of GSA. It also employs the broadly used NAMD Molecular Dynamics package to carry out energy calculations, allowing the user to select different force fields and parameterizations. Moreover, the software also allows the execution of several simulations simultaneously. Applications that depend on protein structures include rational drug design and structure‐based protein function prediction. Applying GSAFold in a test peptide, it was possible to predict the structure of mastoparan‐X to a root mean square deviation of 3.00 Å. Proteins 2012; © 2012 Wiley Periodicals, Inc.

[1]  H. Szu Fast simulated annealing , 1987 .

[2]  C. Tsallis,et al.  Geometry optimization and conformational analysis through generalized simulated annealing , 1998 .

[3]  M Kokkinidis,et al.  Protein plasticity to the extreme: changing the topology of a 4-alpha-helical bundle with a single amino acid substitution. , 1999, Structure.

[4]  R. Kolodny,et al.  Sequence-similar, structure-dissimilar protein pairs in the PDB , 2007, Proteins.

[5]  M. Bruch,et al.  Characterization of the structure and dynamics of mastoparan‐X during folding in aqueous TFE by CD and NMR spectroscopy , 2008, Biopolymers.

[6]  K. C. Mundim,et al.  Atomic basis sets optimization using the generalized simulated annealing approach: New basis sets for the first row elements , 2008 .

[7]  V. Ingram,et al.  Gene Mutations in Human Hæmoglobin: the Chemical Difference Between Normal and Sickle Cell Hæmoglobin , 1957, Nature.

[8]  T. Kohno,et al.  G protein-bound conformation of mastoparan-X: heteronuclear multidimensional transferred nuclear overhauser effect analysis of peptide uniformly enriched with 13C and 15N. , 1998, Biochemistry.

[9]  Torsten Schwede,et al.  BIOINFORMATICS Bioinformatics Advance Access published November 12, 2005 The SWISS-MODEL Workspace: A web-based environment for protein structure homology modelling , 2022 .

[10]  C. Chothia,et al.  Assessing sequence comparison methods with reliable structurally identified distant evolutionary relationships. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[11]  Laxmikant V. Kale,et al.  NAMD2: Greater Scalability for Parallel Molecular Dynamics , 1999 .

[12]  J. Onuchic,et al.  The energy landscape theory of protein folding: insights into folding mechanisms and scenarios. , 2000, Advances in protein chemistry.

[13]  Renio S. Mendes,et al.  Is re-association in folded proteins a case of nonextensivity? , 1999 .

[14]  M. Moret,et al.  New stochastic strategy to analyze helix folding. , 2002, Biophysical journal.

[15]  C. Tsallis Possible generalization of Boltzmann-Gibbs statistics , 1988 .

[16]  Akbar Nayeem,et al.  A comparative study of available software for high‐accuracy homology modeling: From sequence alignments to structural models , 2006, Protein science : a publication of the Protein Society.

[17]  Alexander D. MacKerell,et al.  Extending the treatment of backbone energetics in protein force fields: Limitations of gas‐phase quantum mechanics in reproducing protein conformational distributions in molecular dynamics simulations , 2004, J. Comput. Chem..

[18]  Torsten Schwede,et al.  The SWISS-MODEL Repository of annotated three-dimensional protein structure homology models , 2004, Nucleic Acids Res..

[19]  D. Eisenberg,et al.  A method to identify protein sequences that fold into a known three-dimensional structure. , 1991, Science.

[20]  Alexander D. MacKerell,et al.  All-atom empirical potential for molecular modeling and dynamics studies of proteins. , 1998, The journal of physical chemistry. B.

[21]  C. Tsallis,et al.  Generalized simulated annealing , 1995, cond-mat/9501047.

[22]  M. Moret,et al.  Stochastic strategy to analyze protein folding , 2005 .

[23]  W. Wenzel,et al.  Scaling behavior of stochastic minimization algorithms in a perfect funnel landscape , 1999 .

[24]  T. N. Bhat,et al.  The Protein Data Bank , 2000, Nucleic Acids Res..

[25]  Marcelo A. Moret,et al.  Stochastic molecular optimization using generalized simulated annealing , 1998, J. Comput. Chem..

[26]  K. Dill Dominant forces in protein folding. , 1990, Biochemistry.

[27]  C. Chothia Proteins. One thousand families for the molecular biologist. , 1992, Nature.

[28]  C. D. Gelatt,et al.  Optimization by Simulated Annealing , 1983, Science.

[29]  Marcelo A. Moret,et al.  Generalized simulated annealing applied to protein folding studies , 2006, J. Comput. Chem..

[30]  K Schulten,et al.  VMD: visual molecular dynamics. , 1996, Journal of molecular graphics.

[31]  Juno Choe,et al.  Protein tolerance to random amino acid change. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[32]  C. Chothia One thousand families for the molecular biologist , 1992, Nature.

[33]  K. C. Mundim,et al.  Optimization of non-linear gravity models through generalized simulated annealing , 1998 .

[34]  Richard W. Aldrich,et al.  Statistical Limits to the Identification of Ion Channel Domains by Sequence Similarity , 2006, The Journal of general physiology.

[35]  Pedro G. Pascutti,et al.  Studies of molecular docking between fibroblast growth factor and heparin using generalized simulated annealing , 2008 .

[36]  W. DeGrado,et al.  Role of helix nucleation in the kinetics of binding of mastoparan X to phospholipid bilayers. , 2007, Biochemistry.

[37]  K. Dill,et al.  The protein folding problem. , 1993, Annual review of biophysics.

[38]  T. Kameda,et al.  Combined use of replica-exchange molecular dynamics and magic-angle-spinning solid-state NMR spectral simulations for determining the structure and orientation of membrane-bound peptide. , 2011, The journal of physical chemistry. B.

[39]  Thomas Schanze,et al.  An exact D-dimensional Tsallis random number generator for generalized simulated annealing , 2006, Comput. Phys. Commun..

[40]  Daniel R. Ripoll,et al.  A parallel Monte Carlo search algorithm for the conformational analysis of polypeptides , 1992, The Journal of Supercomputing.