On Lattice Protein Structure Prediction Revisited

Protein structure prediction is regarded as a highly challenging problem both for the biology and for the computational communities. In recent years, many approaches have been developed, moving to increasingly complex lattice models and off-lattice models. This paper presents a Large Neighborhood Search (LNS) to find the native state for the Hydrophobic-Polar (HP) model on the Face-Centered Cubic (FCC) lattice or, in other words, a self-avoiding walk on the FCC lattice having a maximum number of H-H contacts. The algorithm starts with a tabu-search algorithm, whose solution is then improved by a combination of constraint programming and LNS. The flexible framework of this hybrid algorithm allows an adaptation to the Miyazawa-Jernigan contact potential, in place of the HP model, thus suggesting its potential for tertiary structure prediction. Benchmarking statistics are given for our method against the hydrophobic core threading program HPstruct, an exact method which can be viewed as complementary to our method.

[1]  Ján Manuch,et al.  Inverse protein folding in 3D hexagonal prism lattice under HP model , 2008, BIOCOMP.

[2]  Salil P. Vadhan,et al.  Computational Complexity , 2005, Encyclopedia of Cryptography and Security.

[3]  N. Madras,et al.  THE SELF-AVOIDING WALK , 2006 .

[4]  R. Schulz,et al.  Protein Structure Prediction , 2020, Methods in Molecular Biology.

[5]  Alessandro Dal Palù,et al.  Constraint Logic Programming approach to protein structure prediction , 2004, BMC Bioinformatics.

[6]  Ruben Abagyan,et al.  ICM—A new method for protein modeling and design: Applications to docking and structure prediction from the distorted native conformation , 1994, J. Comput. Chem..

[7]  P. Clote,et al.  Structural RNA has lower folding energy than random RNA of the same dinucleotide frequency. , 2005, RNA.

[8]  M. Sippl Calculation of conformational ensembles from potentials of mean force. An approach to the knowledge-based prediction of local structures in globular proteins. , 1990, Journal of molecular biology.

[9]  Wei Zhang,et al.  A point‐charge force field for molecular mechanics simulations of proteins based on condensed‐phase quantum mechanical calculations , 2003, J. Comput. Chem..

[10]  H. Bernstein Recent changes to RasMol, recombining the variants. , 2000, Trends in biochemical sciences.

[11]  Yue,et al.  Sequence-structure relationships in proteins and copolymers. , 1993, Physical review. E, Statistical physics, plasmas, fluids, and related interdisciplinary topics.

[12]  Wing Hung Wong,et al.  A study of density of states and ground states in hydrophobic-hydrophilic protein folding models by equi-energy sampling. , 2006, The Journal of chemical physics.

[13]  Peter Clote,et al.  DiANNA 1.1: an extension of the DiANNA web server for ternary cysteine classification , 2006, Nucleic Acids Res..

[14]  C. Anfinsen Principles that govern the folding of protein chains. , 1973, Science.

[15]  Mihalis Yannakakis,et al.  On the Complexity of Protein Folding , 1998, J. Comput. Biol..

[16]  M. Karplus,et al.  CHARMM: A program for macromolecular energy, minimization, and dynamics calculations , 1983 .

[17]  Richard M. Jackson,et al.  An evaluation of automated homology modelling methods at low target-template sequence similarity , 2007, Bioinform..

[18]  Pascal Van Hentenryck,et al.  Parallelizing Constraint Programs Transparently , 2007, CP.

[19]  Rolf Backofen,et al.  Algorithmic approach to quantifying the hydrophobic force contribution in protein folding , 1999, German Conference on Bioinformatics.

[20]  Paul Shaw,et al.  Using Constraint Programming and Local Search Methods to Solve Vehicle Routing Problems , 1998, CP.

[21]  P. Bradley,et al.  Toward High-Resolution de Novo Structure Prediction for Small Proteins , 2005, Science.

[22]  Yang Zhang,et al.  I-TASSER server for protein 3D structure prediction , 2008, BMC Bioinformatics.

[23]  Peter Clote,et al.  Disulfide connectivity prediction using secondary structure information and diresidue frequencies , 2005, Bioinform..

[24]  T. N. Bhat,et al.  The Protein Data Bank , 2000, Nucleic Acids Res..

[25]  C A Floudas,et al.  Computational methods in protein structure prediction. , 2007, Biotechnology and bioengineering.

[26]  B. Lee,et al.  The interpretation of protein structures: estimation of static accessibility. , 1971, Journal of molecular biology.

[27]  A. Sali,et al.  Comparative protein structure modeling by iterative alignment, model building and model assessment. , 2003, Nucleic Acids Research.

[28]  Pawel Winter,et al.  Reconstructing protein structure from solvent exposure using tabu search , 2006, Algorithms for Molecular Biology.

[29]  Rolf Backofen The Protein Structure Prediction Problem: A Constraint Optimization Approach using a New Lower Bound , 2004, Constraints.

[30]  E I Shakhnovich,et al.  A test of lattice protein folding algorithms. , 1995, Proceedings of the National Academy of Sciences of the United States of America.

[31]  Daniel Fischer,et al.  Convergent evolution of protein structure prediction and computer chess tournaments: CASP, Kasparov, and CAFASP , 2001, IBM Syst. J..

[32]  Rolf Backofen,et al.  Application of constraint programming techniques for structure prediction of lattice proteins with extended alphabets , 1999, Bioinform..

[33]  Ian M. Sander,et al.  Cotranslational folding promotes beta-helix formation and avoids aggregation in vivo. , 2008, Journal of molecular biology.

[34]  Sebastian Will Constraint-Based Hydrophobic Core Construction for Protein Structure Prediction in the Face-Centered-Cubic Lattice , 2002, Pacific Symposium on Biocomputing.

[35]  K Yue,et al.  Folding proteins with a simple energy function and extensive conformational searching , 1996, Protein science : a publication of the Protein Society.

[36]  John L. Klepeis,et al.  Prediction of β‐sheet topology and disulfide bridges in polypeptides , 2003, J. Comput. Chem..

[37]  J. Borreguero,et al.  Mechanism for the α‐helix to β‐hairpin transition , 2003, Proteins.

[38]  Rolf Backofen,et al.  A Constraint-Based Approach to Structure Prediction for Simplified Protein Models That Outperforms Other Existing Methods , 2003, ICLP.

[39]  M. Karplus,et al.  How does a protein fold? , 1994, Nature.

[40]  Sebastian Will Exact, constraint-based structure prediction in simple protein models , 2005 .

[41]  K. Dill,et al.  A lattice statistical mechanics model of the conformational and sequence spaces of proteins , 1989 .

[42]  PatrickY.-S. Lam,et al.  Rational design of potent, bioavailable, nonpeptide cyclic ureas as HIV protease inhibitors. , 1994, Science.

[43]  Torsten Schwede,et al.  The SWISS-MODEL workspace: a web-based environment for protein structure homology modelling , 2006, Bioinform..

[44]  P. Baldi,et al.  Prediction of coordination number and relative solvent accessibility in proteins , 2002, Proteins.

[45]  Temple F. Smith,et al.  Global optimum protein threading with gapped alignment and empirical pair score functions. , 1996, Journal of molecular biology.

[46]  N. J. A. Sloane,et al.  Sphere Packings, Lattices and Groups , 1987, Grundlehren der mathematischen Wissenschaften.

[47]  Glennie Helles,et al.  A comparative study of the reported performance of ab initio protein structure prediction algorithms , 2008, Journal of The Royal Society Interface.

[48]  R. Doolittle,et al.  A simple method for displaying the hydropathic character of a protein. , 1982, Journal of molecular biology.

[49]  Barry Cipra,et al.  Packing Challenge Mastered At Last , 1998, Science.

[50]  M. Karplus,et al.  Kinetics of protein folding. A lattice model study of the requirements for folding to the native state. , 1994, Journal of molecular biology.

[51]  A. Kolinski,et al.  Simulations of the Folding of a Globular Protein , 1990, Science.

[52]  R. Lathrop The protein threading problem with sequence amino acid interaction preferences is NP-complete. , 1994, Protein engineering.

[53]  Andrzej Kloczkowski,et al.  Inferring ideal amino acid interaction forms from statistical protein contact potentials , 2005, Proteins.

[54]  Yang Zhang,et al.  The protein structure prediction problem could be solved using the current PDB library. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[55]  D. Covell,et al.  Conformations of folded proteins in restricted spaces. , 1990, Biochemistry.

[56]  Pascal Van Hentenryck,et al.  Protein Structure Prediction on the Face Centered Cubic Lattice by Local Search , 2008, AAAI.

[57]  Thomas Wüst,et al.  Versatile approach to access the low temperature thermodynamics of lattice polymers and proteins. , 2009, Physical review letters.

[58]  Erich Bornberg-Bauer,et al.  Chain growth algorithms for HP-type lattice proteins , 1997, RECOMB '97.

[59]  S. Altschul,et al.  Significance of nucleotide sequence alignments: a method for random sequence permutation that preserves dinucleotide and codon usage. , 1985, Molecular biology and evolution.

[60]  J. Skolnick,et al.  Ab initio modeling of small proteins by iterative TASSER simulations , 2007, BMC Biology.

[61]  C. Sander,et al.  Database algorithm for generating protein backbone and side-chain co-ordinates from a C alpha trace application to model building and detection of co-ordinate errors. , 1991, Journal of molecular biology.

[62]  R Unger,et al.  Genetic algorithms for protein folding simulations. , 1992, Journal of molecular biology.

[63]  Pascal Van Hentenryck,et al.  Protein Structure Prediction with Large Neighborhood Constraint Programming Search , 2008, CP.

[64]  R. Jernigan,et al.  Self‐consistent estimation of inter‐residue protein contact energies based on an equilibrium mixture approximation of residues , 1999, Proteins.