Constraint-Based Evolutionary Local Search for Protein Structures with Secondary Motifs

On-lattice protein structure prediction with empirical energy minimisation has drawn significant research effort. However, energy minimisation with free-modelling not necessarily leads to structures that are similar to the native structure of the given protein. In this paper, we show that energy minimisation has a positive correlation with structural similarity measures if we consider secondary motifs. We then present a constraint-based evolutionary local search framework for on-lattice protein structure prediction using secondary structural information. We approximate secondary motifs such as α-helix and β-strands on the lattice and propose a set of neighbourhood generation operators that respect those motifs. Our experimental results show significant improvement over the state-of-the-art methods in terms of similarity with the native structures determined by laboratory methods.

[1]  Kathleen Steinhöfel,et al.  A hybrid approach to protein folding problem integrating constraint programming with local search , 2010, BMC Bioinformatics.

[2]  Alessandro Dal Palù,et al.  Heuristics, optimizations, and parallelism for protein structure prediction in CLP(FD) , 2005, PPDP '05.

[3]  Abdul Sattar,et al.  An efficient encoding for simplified protein structure prediction using genetic algorithms , 2013, 2013 IEEE Congress on Evolutionary Computation.

[4]  Andrzej Kolinski,et al.  A minimal proteinlike lattice model: an alpha-helix motif. , 2005, The Journal of chemical physics.

[5]  Sue Whitesides,et al.  A complete and effective move set for simplified protein folding , 2003, RECOMB '03.

[6]  M. Levitt,et al.  The complexity and accuracy of discrete state models of protein structure. , 1995, Journal of molecular biology.

[7]  Federico Fogolari,et al.  Amino acid empirical contact energy definitions for fold recognition in the space of contact maps , 2003, BMC Bioinformatics.

[8]  Roberto Rossi,et al.  Cost-Based Filtering for Stochastic Inventory Control , 2006, CSCLP.

[9]  R. Jernigan,et al.  Estimation of effective interresidue contact energies from protein crystal structures: quasi-chemical approximation , 1985 .

[10]  Agostino Dovier,et al.  Using Secondary Structure Information for Protein Folding in CLP(FD) , 2002, Electron. Notes Theor. Comput. Sci..

[11]  J. Skolnick,et al.  A minimal physically realistic protein-like lattice model: designing an energy landscape that ensures all-or-none folding to a unique native state. , 2003, Biophysical journal.

[12]  Alessandro Dal Palù,et al.  A constraint solver for discrete lattices, its parallelization, and application to protein structure prediction , 2007 .

[13]  Andrzej Kolinski,et al.  Protein fragment reconstruction using various modeling techniques , 2003, J. Comput. Aided Mol. Des..

[14]  M. Levitt,et al.  A lattice model for protein structure prediction at low resolution. , 1992, Proceedings of the National Academy of Sciences of the United States of America.

[15]  Fernando Niño,et al.  A novel ab-initio genetic-based approach for protein folding prediction , 2007, GECCO '07.

[16]  Michael J. Maher,et al.  Kangaroo: An Efficient Constraint-Based Local Search System Using Lazy Propagation , 2011, CP.

[17]  Charlotte M. Deane,et al.  Producing High-Accuracy Lattice Models from Protein Atomic Coordinates Including Side Chains , 2012, Adv. Bioinformatics.

[18]  Alessandro Dal Palù,et al.  Constraint Logic Programming approach to protein structure prediction , 2004, BMC Bioinformatics.

[19]  Kathleen Steinhöfel,et al.  Population-based local search for protein folding simulation in the MJ energy model and cubic lattices , 2009, Comput. Biol. Chem..

[20]  Liam J. McGuffin,et al.  The PSIPRED protein structure prediction server , 2000, Bioinform..

[21]  J. Skolnick,et al.  Reduced models of proteins and their applications , 2004 .

[22]  Jeffrey Skolnick,et al.  Fast procedure for reconstruction of full‐atom protein models from reduced representations , 2008, J. Comput. Chem..

[23]  J. Skolnick,et al.  Comparison of three Monte Carlo conformational search strategies for a proteinlike homopolymer model: Folding thermodynamics and identification of low-energy structures , 2000 .

[24]  Abdul Sattar,et al.  Mixed Heuristic Local Search for Protein Structure Prediction , 2013, AAAI.

[25]  Abdul Sattar,et al.  Simplified Lattice Models for Protein Structure Prediction: How Good Are They? , 2013, AAAI.

[26]  Alessandro Dal Palù,et al.  Protein Folding in CLP(FD) with Empirical Contact Energies , 2003, CSCLP.