Using genetic algorithm to design protein sequence

Abstract In this work, genetic algorithms concepts along with a rotamer library for proteins side chains are used to optimize the tertiary structure of the hydrophobic core of Cytochrome b 562 starting from the known PDB structure of its backbone which is kept fixed while the side chains of the hydrophobic core are allowed to adopt the conformations present in the rotamer library. The atoms of the side chains forming the core interact via van der Waals energy. Besides the prediction of the native core structure, it is also suggested a set of different amino acid sequences for this core. Comparison between these new cores and the native are made in terms of their volumes, van der Waals energies values and the numbers of contacts made by the side chains forming the cores. This paper proves that genetic algorithms area efficient to design new sequence for the protein core.

[1]  J R Desjarlais,et al.  Side-chain and backbone flexibility in protein core design. , 1999, Journal of molecular biology.

[2]  Christopher T. Saunders,et al.  Evaluation of structural and evolutionary contributions to deleterious mutation prediction. , 2002, Journal of molecular biology.

[3]  R Samudrala,et al.  Constructing side chains on near-native main chains for ab initio protein structure prediction. , 2000, Protein engineering.

[4]  Steven M. Muskal,et al.  Predicting protein secondary structure content. A tandem neural network approach. , 1992, Journal of molecular biology.

[5]  M. Palumbo,et al.  Patterns, structures, and amino acid frequencies in structural building blocks, a protein secondary structure classification scheme , 1997, Proteins.

[6]  Martin Fowler,et al.  Patterns , 2021, IEEE Software.

[7]  H. Qian,et al.  Prediction of α-Helices in Proteins Based on Thermodynamic Parameters from Solution Chemistry , 1996 .

[8]  N. Grishin,et al.  Side‐chain modeling with an optimized scoring function , 2002, Protein science : a publication of the Protein Society.

[9]  K. Dill,et al.  The protein folding problem. , 1993, Annual review of biophysics.

[10]  Marc L. Pusey,et al.  Artificial neural network prediction of tetragonal lysozyme face growth rates , 1996 .

[11]  Adam Zemla,et al.  Critical assessment of methods of protein structure prediction (CASP)‐round V , 2005, Proteins.

[12]  Jaap Heringa,et al.  Protein secondary structure prediction. , 2010, Methods in molecular biology.

[13]  S. Sun,et al.  A genetic algorithm that seeks native states of peptides and proteins. , 1995, Biophysical journal.

[14]  R. Friesner,et al.  Tertiary structure prediction of mixed α/β proteins via energy minimization , 1998 .

[15]  P. Argos,et al.  Incorporation of non-local interactions in protein secondary structure prediction from the amino acid sequence. , 1996, Protein engineering.

[16]  Adrian A Canutescu,et al.  Access the most recent version at doi: 10.1110/ps.03154503 References , 2003 .

[17]  C Sander,et al.  Third generation prediction of secondary structures. , 2000, Methods in molecular biology.

[18]  M. Karplus,et al.  Use of quantitative structure‐property relationships to predict the folding ability of model proteins , 1998, Proteins.

[19]  P Argos,et al.  Folding the main chain of small proteins with the genetic algorithm. , 1994, Journal of molecular biology.

[20]  H Kono,et al.  Energy minimization method using automata network for sequence and side‐chain conformation prediction from given backbone geometry , 1994, Proteins.

[21]  Richard Bonneau,et al.  Contact order and ab initio protein structure prediction , 2002, Protein science : a publication of the Protein Society.

[22]  D. Shortle,et al.  Prediction of protein structure by emphasizing local side‐chain/backbone interactions in ensembles of turn fragments , 2003, Proteins.

[23]  Jin Li,et al.  On Evaluating Molecular-Docking Methods for Pose Prediction and Enrichment Factors , 2006, J. Chem. Inf. Model..

[24]  Y. Cui,et al.  Protein folding simulation with genetic algorithm and supersecondary structure constraints , 1998, Proteins.

[25]  Luis P. B. Scott,et al.  Use of genetic algorithms and solvation potential to study peptides structure , 2008, Appl. Math. Comput..

[26]  Lecture 21 Side-chain & Backbone Flexibility in Protein Core Design , 2022 .

[27]  J M Chandonia,et al.  The importance of larger data sets for protein secondary structure prediction with neural networks , 1996, Protein science : a publication of the Protein Society.

[28]  Y D Cai,et al.  Using neural networks for prediction of domain structural classes. , 2000, Biochimica et biophysica acta.

[29]  Andrea Cavalli,et al.  A Comparative Study on the Application of Hierarchical-Agglomerative Clustering Approaches to Organize Outputs of Reiterated Docking Runs , 2006, J. Chem. Inf. Model..

[30]  Kuo-Chen Chou,et al.  Classification and prediction of ߖturn types by neural network , 1999 .

[31]  Giovanni Soda,et al.  Exploiting the past and the future in protein secondary structure prediction , 1999, Bioinform..

[32]  Yang Zhang,et al.  The protein structure prediction problem could be solved using the current PDB library. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[33]  J. Ponder,et al.  Tertiary templates for proteins. Use of packing criteria in the enumeration of allowed sequences for different structural classes. , 1987, Journal of molecular biology.

[34]  Xavier Llorà,et al.  ENPDA: an evolutionary structure-based de novo peptide design algorithm , 2005, J. Comput. Aided Mol. Des..

[35]  D. Yee,et al.  Principles of protein folding — A perspective from simple exact models , 1995, Protein science : a publication of the Protein Society.

[36]  Junwei Zhang,et al.  VISCANA: Visualized Cluster Analysis of Protein-Ligand Interaction Based on the ab Initio Fragment Molecular Orbital Method for Virtual Ligand Screening , 2006, J. Chem. Inf. Model..

[37]  B. Rost,et al.  Combining evolutionary information and neural networks to predict protein secondary structure , 1994, Proteins.

[38]  M. Ejtehadi,et al.  Conservation of statistical results under the reduction of pair-contact interactions to solvation interactions. , 2005, Physical review. E, Statistical, nonlinear, and soft matter physics.

[39]  George A. Kaminski,et al.  Force Field Validation Using Protein Side Chain Prediction , 2002 .

[40]  J Moult,et al.  Protein folding simulations with genetic algorithms and a detailed molecular description. , 1997, Journal of molecular biology.

[41]  D. Baker,et al.  Prediction of local structure in proteins using a library of sequence-structure motifs. , 1998, Journal of molecular biology.

[42]  S. Brunak,et al.  Protein secondary structure and homology by neural networks The α‐helices in rhodopsin , 1988 .

[43]  K. Leonhard,et al.  Solvent–amino acid interaction energies in three‐dimensional‐lattice Monte Carlo simulations of a model 27‐mer protein: Folding thermodynamics and kinetics , 2004, Protein science : a publication of the Protein Society.

[44]  Jeffrey J. Gray,et al.  Protein-protein docking with simultaneous optimization of rigid-body displacement and side-chain conformations. , 2003, Journal of molecular biology.

[45]  A Joshua Wand,et al.  Improved side‐chain prediction accuracy using an ab initio potential energy function and a very large rotamer library , 2004, Protein science : a publication of the Protein Society.

[46]  J. Skolnick,et al.  A reduced model of short range interactions in polypeptide chains , 1995 .

[47]  J M Chandonia,et al.  New methods for accurate prediction of protein secondary structure , 1999, Proteins.