A database assisted protein structure prediction method via a swarm intelligence algorithm

The complex and rugged potential energy landscape has made protein structure prediction a challenging task in computational biology. Here, we propose an efficient protein structure prediction method combining both template-based and template-free methods. Specifically, the initial protein conformations can be built by a non-redundant protein database and random sampling method with constraints of the secondary structure of the proteins. Three different structure evolution methods including improved particle swarm optimization (PSO) algorithm, random perturbation and fragment substitution are employed to update the protein structures while keeping the secondary structures the same. The present method is benchmarked on several known protein structures with distinct folding patterns, including α proteins, β proteins and αβ proteins. The high success rate and the accuracy of the results demonstrate the reliability of this method.

[1]  Dong Xu,et al.  Toward optimal fragment generations for ab initio protein structure assembly , 2013, Proteins.

[2]  Yang Zhang,et al.  Ab initio protein structure assembly using continuous structure fragments and optimized knowledge‐based force field , 2012, Proteins.

[3]  J. Skolnick,et al.  Development and large scale benchmark testing of the PROSPECTOR_3 threading algorithm , 2004, Proteins.

[4]  Yanchao Wang,et al.  Particle-swarm structure prediction on clusters. , 2012, The Journal of chemical physics.

[5]  R. Othman,et al.  Computational identification of self‐inhibitory peptides from envelope proteins , 2012, Proteins.

[6]  Yang Zhang Protein structure prediction: when is it useful? , 2009, Current opinion in structural biology.

[7]  Yang Zhang,et al.  Template‐based modeling and free modeling by I‐TASSER in CASP7 , 2007, Proteins.

[8]  U. Singh,et al.  A NEW FORCE FIELD FOR MOLECULAR MECHANICAL SIMULATION OF NUCLEIC ACIDS AND PROTEINS , 1984 .

[9]  A. Kukol,et al.  Molecular Modeling of Proteins , 2008, Methods Molecular Biology™.

[10]  George Karypis,et al.  Introduction to Protein Structure Prediction: Methods and Algorithms , 2010 .

[11]  Yang Zhang,et al.  How significant is a protein structure similarity with TM-score = 0.5? , 2010, Bioinform..

[12]  Yanming Ma,et al.  Structure Prediction of Atoms Adsorbed on Two-Dimensional Layer Materials: Method and Applications , 2015 .

[13]  R. Friesner,et al.  Evaluation and Reparametrization of the OPLS-AA Force Field for Proteins via Comparison with Accurate Quantum Chemical Calculations on Peptides† , 2001 .

[14]  Alexander D. MacKerell,et al.  All-atom empirical potential for molecular modeling and dynamics studies of proteins. , 1998, The journal of physical chemistry. B.

[15]  W. C. Still,et al.  The GB/SA Continuum Model for Solvation. A Fast Analytical Method for the Calculation of Approximate Born Radii , 1997 .

[16]  Yang Zhang,et al.  Scoring function for automated assessment of protein structure template quality , 2004, Proteins.

[17]  Michael Habeck,et al.  HHfrag: HMM-based fragment detection using HHpred , 2011, Bioinform..

[18]  J. Skolnick,et al.  Ab initio modeling of small proteins by iterative TASSER simulations , 2007, BMC Biology.

[19]  C. Anfinsen Principles that govern the folding of protein chains. , 1973, Science.

[20]  Li Zhu,et al.  CALYPSO: A method for crystal structure prediction , 2012, Comput. Phys. Commun..

[21]  C. Floudas,et al.  ASTRO-FOLD: a combinatorial and global optimization framework for Ab initio prediction of three-dimensional structures of proteins from the amino acid sequence. , 2003, Biophysical journal.

[22]  A. Schug,et al.  Basin hopping simulations for all-atom protein folding. , 2006, The Journal of chemical physics.

[23]  Arnim Hellweg,et al.  TmoleX—A graphical user interface for TURBOMOLE , 2010, J. Comput. Chem..

[24]  Roland L. Dunbrack,et al.  Prediction of protein side-chain rotamers from a backbone-dependent rotamer library: a new homology modeling tool. , 1997, Journal of molecular biology.

[25]  P. Bradley,et al.  Toward High-Resolution de Novo Structure Prediction for Small Proteins , 2005, Science.

[26]  M. Karplus,et al.  CHARMM: A program for macromolecular energy, minimization, and dynamics calculations , 1983 .

[27]  W. C. Still,et al.  Semianalytical treatment of solvation for molecular mechanics and dynamics , 1990 .

[28]  Gerhard Klimeck,et al.  Coherent Electron Transport by Adiabatic Passage in an Imperfect Donor Chain , 2010, 1008.1494.

[29]  David Baker,et al.  Protein Structure Prediction Using Rosetta , 2004, Numerical Computer Methods, Part D.

[30]  J. Skolnick,et al.  Ab initio protein structure prediction using chunk-TASSER. , 2007, Biophysical journal.

[31]  P. Kollman,et al.  Pathways to a protein folding intermediate observed in a 1-microsecond simulation in aqueous solution. , 1998, Science.

[32]  Mohammed J. Zaki,et al.  Protein Structure Prediction , 2008, Methods in Molecular Biology™.

[33]  Yang Zhang,et al.  A Novel Side-Chain Orientation Dependent Potential Derived from Random-Walk Reference State for Protein Fold Selection and Structure Prediction , 2010, PloS one.

[34]  Yanming Ma,et al.  Self-assembled ultrathin nanotubes on diamond (100) surface , 2014, Nature Communications.

[35]  Thomas L. Madden,et al.  Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. , 1997, Nucleic acids research.

[36]  Helio J. C. Barbosa,et al.  A multiple minima genetic algorithm for protein structure prediction , 2014, Appl. Soft Comput..

[37]  Y Xu,et al.  Protein threading using PROSPECT: Design and evaluation , 2000, Proteins.

[38]  C. Floudas,et al.  Ab initio prediction of the three‐dimensional structure of a de novo designed protein: A double‐blind case study , 2004, Proteins.

[39]  David C. Jones Predicting novel protein folds by using FRAGFOLD , 2001, Proteins.

[40]  Wilfred F. van Gunsteren,et al.  An improved GROMOS96 force field for aliphatic hydrocarbons in the condensed phase , 2001, J. Comput. Chem..

[41]  Yanming Ma,et al.  An effective structure prediction method for layered materials based on 2D particle swarm optimization algorithm. , 2012, The Journal of chemical physics.

[42]  N. Guex,et al.  SWISS‐MODEL and the Swiss‐Pdb Viewer: An environment for comparative protein modeling , 1997, Electrophoresis.

[43]  D T Jones,et al.  Protein secondary structure prediction based on position-specific scoring matrices. , 1999, Journal of molecular biology.

[44]  P. Kollman,et al.  A Second Generation Force Field for the Simulation of Proteins, Nucleic Acids, and Organic Molecules , 1995 .

[45]  M. Karplus,et al.  Simulation of activation free energies in molecular systems , 1996 .

[46]  T. N. Bhat,et al.  The Protein Data Bank , 2000, Nucleic Acids Res..

[47]  Gerrit Groenhof,et al.  GROMACS: Fast, flexible, and free , 2005, J. Comput. Chem..

[48]  A. Liwo,et al.  Protein structure prediction by global optimization of a potential energy function. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[49]  A. Liwo,et al.  Ab initio simulations of protein-folding pathways by molecular dynamics with the united-residue model of polypeptide chains. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[50]  J. Skolnick,et al.  Automated structure prediction of weakly homologous proteins on a genomic scale. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[51]  Anna Tramontano,et al.  The assessment of methods for protein structure prediction. , 2008, Methods in molecular biology.