Extended HP Model for Protein Structure Prediction

This paper describes a detailed investigation of a lattice-based HP (hydrophobic-hydrophilic) model for ab initio protein structure prediction (PSP). The outcome of the simplified HP lattice model has high degeneracy, which could mislead the prediction. The HPNX model was proposed to address the degeneracy problem as well as to avoid the conformational deformity with the hydrophilic (P) residues. We have experimentally shown that it is necessary to further improve the existing HPNX model. We have found and solved the critical error of another existing YhHX model. By extracting the significant features from the YhHX for the HPNX model, we have proposed a novel hHPNX model. Hybrid Genetic Algorithm (HGA) has been used to compare the predictability of these models and hHPNX outperformed other models. We preferred 3D face-centered-cube (FCC) lattice configuration to have closest resemblance to the real folded 3D protein.

[1]  Mihalis Yannakakis,et al.  On the complexity of protein folding (extended abstract) , 1998, STOC '98.

[2]  T. Hales The Kepler conjecture , 1998, math/9811078.

[3]  Ram Samudrala,et al.  A Combined Approach for Ab Initio Construction of Low Resolution Protein Tertiary Structures from Sequence , 1999, Pacific Symposium on Biocomputing.

[4]  Yong Wang,et al.  Optimal HP configurations of proteins by combining local search with elastic net algorithm. , 2007, Journal of biochemical and biophysical methods.

[5]  R Unger,et al.  Genetic algorithms for protein folding simulations. , 1992, Journal of molecular biology.

[6]  Rolf Backofen,et al.  Algorithmic approach to quantifying the hydrophobic force contribution in protein folding , 1999, German Conference on Bioinformatics.

[7]  David Baker,et al.  Simple physical models connect theory and experiment in protein folding kinetics. , 2002, Journal of molecular biology.

[8]  Anthony J. Guttmann,et al.  Self-avoiding walks in constrained and random geometries: Series studies , 2005 .

[9]  S Karlin,et al.  How are close residues of protein structures distributed in primary sequence? , 1995, Proceedings of the National Academy of Sciences of the United States of America.

[10]  Nicolas E. Buchler,et al.  Effect of alphabet size and foldability requirements on protein structure designability , 1999, Proteins.

[11]  Michael Bachmann,et al.  Exact enumeration of three-dimensional lattice proteins , 2005, Comput. Phys. Commun..

[12]  K. Lin,et al.  Universal amplitude ratios for three-dimensional self-avoiding walks , 2002 .

[13]  Yue,et al.  Sequence-structure relationships in proteins and copolymers. , 1993, Physical review. E, Statistical physics, plasmas, fluids, and related interdisciplinary topics.

[14]  Rolf Backofen,et al.  Application of constraint programming techniques for structure prediction of lattice proteins with extended alphabets , 1999, Bioinform..

[15]  Madhu Chetty,et al.  A new guided genetic algorithm for 2D hydrophobic-hydrophilic model to predict protein folding , 2005, 2005 IEEE Congress on Evolutionary Computation.

[16]  Richard A. Goldstein,et al.  Surveying determinants of protein structure designability across different energy models and amino-acid alphabets: A consensus , 2000 .

[17]  Erich Bornberg-Bauer,et al.  Comparing folding codes in simple heteropolymer models of protein evolutionary landscape: robustness of the superfunnel paradigm. , 2005, Biophysical journal.

[18]  E. Koonin,et al.  A universal trend of amino acid gain and loss in protein evolution , 2005, Nature.

[19]  K. Dill Theory for the folding and stability of globular proteins. , 1985, Biochemistry.

[20]  D. Yee,et al.  Principles of protein folding — A perspective from simple exact models , 1995, Protein science : a publication of the Protein Society.

[21]  K Schulten,et al.  VMD: visual molecular dynamics. , 1996, Journal of molecular graphics.

[22]  Yong Wang,et al.  Exploration of two-dimensional hydrophobic-polar lattice model by combining local search with elastic net algorithm. , 2006, The Journal of chemical physics.

[23]  R Samudrala,et al.  Ab initio construction of protein tertiary structures using a hierarchical approach. , 2000, Journal of molecular biology.

[24]  D. Baker,et al.  Prediction and design of macromolecular structures and interactions , 2006, Philosophical Transactions of the Royal Society B: Biological Sciences.

[25]  O. Schueler‐Furman,et al.  Progress in Modeling of Protein Structures and Interactions , 2005, Science.

[26]  Konstantinos G. Margaritis,et al.  An Experimental Study of Benchmarking Functions for Genetic Algorithms , 2002, Int. J. Comput. Math..

[27]  Erich Bornberg-Bauer,et al.  Chain growth algorithms for HP-type lattice proteins , 1997, RECOMB '97.

[28]  Madhu Chetty,et al.  A Hybrid Genetic Algorithm for 2D FCC Hydrophobic-Hydrophilic Lattice Model to Predict Protein Folding , 2006, Australian Conference on Artificial Intelligence.

[29]  M. Levitt,et al.  Exploring conformational space with a simple lattice model for protein structure. , 1994, Journal of molecular biology.

[30]  N. Wingreen,et al.  Emergence of Preferred Structures in a Simple Model of Protein Folding , 1996, Science.

[31]  André Bellemans,et al.  Self-avoiding walks on the simple cubic lattice , 1973 .

[32]  Dominik Gront,et al.  A simple lattice model that exhibits a protein-like cooperative all-or-none folding transition. , 2003, Biopolymers.

[33]  R. Jernigan,et al.  Residue-residue potentials with a favorable contact pair term and an unfavorable high packing density term, for simulation and threading. , 1996, Journal of molecular biology.

[34]  Steven Skiena,et al.  Local rules for protein folding on a triangular lattice and generalized hydrophobicity in the HP model , 1997, RECOMB '97.

[35]  Lars Malmström,et al.  Automated prediction of CASP‐5 structures using the Robetta server , 2003, Proteins.

[36]  Michael Levitt,et al.  A brighter future for protein structure prediction , 1999, Nature Structural Biology.

[37]  Abdul Sattar,et al.  Protein folding prediction in 3D FCC HP lattice model using genetic algorithm , 2007, 2007 IEEE Congress on Evolutionary Computation.

[38]  R L Jernigan,et al.  Ideal architecture of residue packing and its observation in protein structures , 1997, Protein science : a publication of the Protein Society.

[39]  Frank Thomson Leighton,et al.  Protein folding in the hydrophobic-hydrophilic (HP) is NP-complete , 1998, RECOMB '98.

[40]  G. Crippen Prediction of protein folding from amino acid sequence over discrete conformation spaces. , 1991, Biochemistry.

[41]  Gui-Rong Liu,et al.  Quantifying the parameters of Prusiner's heterodimer model for prion replication , 2005 .

[42]  Gerik Scheuermann,et al.  Visualization of Lattice-Based Protein Folding Simulations , 2006, Tenth International Conference on Information Visualisation (IV'06).

[43]  Madhu Chetty,et al.  A Guided Genetic Algorithm for Protein Folding Prediction Using 3D Hydrophobic-Hydrophilic Model , 2006, 2006 IEEE International Conference on Evolutionary Computation.