Novel use of a genetic algorithm for protein structure prediction: Searching template and sequence alignment space

A novel genetic algorithm was applied to all CASP5 targets. The algorithm simultaneously searches template and alignment space. Results show that the current implementation of the method is perhaps most useful in recognizing and refining remote homology targets. This new method is briefly described and results are analyzed. Strengths and weaknesses of the current implementation of the algorithm are discussed. Proteins 2003;53:424–429. © 2003 Wiley‐Liss, Inc.

[1]  H A Scheraga,et al.  Improved genetic algorithm for the protein folding problem by use of a Cartesian combination operator , 1996, Protein science : a publication of the Protein Society.

[2]  Paul W. Fitzjohn,et al.  Comparative modelling: an essential methodology for protein structure prediction in the post-genomic era. , 2002, Applied bioinformatics.

[3]  Chris Sander,et al.  Thermitase, a thermostable subtilisin: Comparison of predicted and experimental structures and the molecular cause of thermostability , 1989, Proteins.

[4]  Thomas L. Madden,et al.  Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. , 1997, Nucleic acids research.

[5]  C Kooperberg,et al.  Assembly of protein tertiary structures from fragments with similar local sequences using simulated annealing and Bayesian scoring functions. , 1997, Journal of molecular biology.

[6]  Paul W. Fitzjohn,et al.  In silico protein recombination: enhancing template and sequence alignment selection for comparative protein modelling. , 2003, Journal of molecular biology.

[7]  T L Blundell,et al.  FUGUE: sequence-structure homology recognition using environment-specific substitution tables and structure-dependent gap penalties. , 2001, Journal of molecular biology.

[8]  R Unger,et al.  Genetic algorithms for protein folding simulations. , 1992, Journal of molecular biology.

[9]  Berk Hess,et al.  GROMACS 3.0: a package for molecular simulation and trajectory analysis , 2001 .

[10]  C. Lambert,et al.  ESyPred 3 D : Prediction of proteins 3 D structures , 2002 .

[11]  William L. Jorgensen,et al.  OPLS all‐atom force field for carbohydrates , 1997 .

[12]  Paul A. Bates,et al.  Domain Fishing: a first step in protein comparative modelling , 2002, Bioinform..

[13]  A G Murzin,et al.  SCOP: a structural classification of proteins database for the investigation of sequences and structures. , 1995, Journal of molecular biology.

[14]  Marc A. Martí-Renom,et al.  EVA: continuous automatic evaluation of protein structure prediction servers , 2001, Bioinform..

[15]  M. Sternberg,et al.  Enhanced genome annotation using structural profiles in the program 3D-PSSM. , 2000, Journal of molecular biology.

[16]  M J Sippl,et al.  Assessment of the CASP4 fold recognition category , 2001, Proteins.

[17]  Michael Levitt,et al.  Roles of mutation and recombination in the evolution of protein thermodynamics , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[18]  D J Osguthorpe,et al.  Refined models for computer simulation of protein folding. Applications to the study of conserved secondary structure and flexible hinge points during the folding of pancreatic trypsin inhibitor. , 1979, Journal of molecular biology.

[19]  Sean R. Eddy,et al.  Pfam: multiple sequence alignments and HMM-profiles of protein domains , 1998, Nucleic Acids Res..

[20]  T. N. Bhat,et al.  The Protein Data Bank , 2000, Nucleic Acids Res..

[21]  M J Sternberg,et al.  Model building by comparison at CASP3: Using expert knowledge and computer automation , 1999, Proteins.

[22]  R Leplae,et al.  Analysis and assessment of comparative modeling predictions in CASP4 , 2001, Proteins.

[23]  M. Karplus,et al.  CHARMM: A program for macromolecular energy, minimization, and dynamics calculations , 1983 .

[24]  David S. Goodsell,et al.  Distributed automated docking of flexible ligands to proteins: Parallel applications of AutoDock 2.4 , 1996, J. Comput. Aided Mol. Des..

[25]  A. D. McLachlan,et al.  Solvation energy in protein folding and binding , 1986, Nature.

[26]  Christophe G. Lambert,et al.  ESyPred3D: Prediction of proteins 3D structures , 2002, Bioinform..

[27]  A. Lesk,et al.  The relation between the divergence of sequence and structure in proteins. , 1986, The EMBO journal.

[28]  A C May,et al.  Improved genetic algorithm-based protein structure comparisons: pairwise and multiple superpositions. , 1995, Protein engineering.

[29]  H. Umeyama,et al.  An automatic homology modeling method consisting of database searches and simulated annealing. , 2000, Journal of molecular graphics & modelling.

[30]  J Lundström,et al.  Pcons: A neural‐network–based consensus predictor that improves fold recognition , 2001, Protein science : a publication of the Protein Society.

[31]  D Fischer,et al.  Hybrid fold recognition: combining sequence derived properties with evolutionary information. , 1999, Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing.