A novel approach for protein structure prediction based on an estimation of distribution algorithm

Protein structure prediction is one of the major challenges in structural biology and has wide potential applications in biotechnology. However, the problem is faced with a difficult optimization requirement with particularly complex energy landscapes. The current article aims to present a novel approach namely AHEDA as an evolutionary-based solution to overcome the problem. AHEDA uses the hydrophobic-polar model to develop a robust and efficient evolutionary-based algorithm for protein structure prediction. The method utilizes an integrated estimation of distribution algorithm that attempts to optimize the search process and prevent the destruction of structural blocks. It also uses a stochastic local search to improve its accuracy. Based on a comprehensive comparison with other existing methods on 24 widely used benchmarks, AHEDA was shown to generate highly accurate predictions compared to the other similar methods.

[1]  Kuo-Chen Chou,et al.  Using supervised fuzzy clustering to predict protein structural classes. , 2005, Biochemical and biophysical research communications.

[2]  Ian W. Davis,et al.  RosettaLigand docking with full ligand and receptor flexibility. , 2009, Journal of molecular biology.

[3]  Antônio F. Pereira de Araújo Folding protein models with a simple hydrophobic energy function: The fundamental importance of monomer inside/outside segregation , 1999 .

[4]  Janusz M Bujnicki,et al.  Protein‐Structure Prediction by Recombination of Fragments , 2006, Chembiochem : a European journal of chemical biology.

[5]  Gang Li,et al.  Heuristic-based tabu search algorithm for folding two-dimensional AB off-lattice model proteins , 2013, Comput. Biol. Chem..

[6]  J. Santos,et al.  Differential Evolution for Protein Structure Prediction Using the HP Model , 2011, IWINAC.

[7]  Frank Thomson Leighton,et al.  Protein folding in the hydrophobic-hydrophilic (HP) is NP-complete , 1998, RECOMB '98.

[8]  Faisal N. Abu-Khzam,et al.  Particle swarm optimization approach for protein structure prediction in the 3D HP model , 2009, Interdisciplinary Sciences: Computational Life Sciences.

[9]  Jianlin Cheng,et al.  A Deep Learning Network Approach to ab initio Protein Secondary Structure Prediction , 2015, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[10]  Peter V. Coveney,et al.  Protein Structure Prediction as a Hard Optimization Problem: The Genetic Algorithm Approach , 1997, physics/9708012.

[11]  Holger H. Hoos,et al.  An ant colony optimisation algorithm for the 2D and 3D hydrophobic polar protein folding problem , 2005, BMC Bioinformatics.

[12]  Shahriar Lotfi,et al.  Social-Based Algorithm (SBA) , 2013, Appl. Soft Comput..

[13]  K. Dill,et al.  Cooperativity in protein-folding kinetics. , 1993, Proceedings of the National Academy of Sciences of the United States of America.

[14]  K. Chou,et al.  iACP: a sequence-based tool for identifying anticancer peptides , 2016, Oncotarget.

[15]  Gregorio Toscano Pulido,et al.  Constraint-handling through multi-objective optimization: The hydrophobic-polar model for protein structure prediction , 2015, Comput. Oper. Res..

[16]  R Unger,et al.  Genetic algorithms for protein folding simulations. , 1992, Journal of molecular biology.

[17]  Vincenzo Cutello,et al.  An Immune Algorithm for Protein Structure Prediction on Lattice Models , 2007, IEEE Transactions on Evolutionary Computation.

[18]  M. Karplus,et al.  Protein Folding: A Perspective from Theory and Experiment. , 1998, Angewandte Chemie.

[19]  Helio J. C. Barbosa,et al.  A multiple minima genetic algorithm for protein structure prediction , 2014, Appl. Soft Comput..

[20]  Jaya Sil,et al.  An Improved Harmony Search Algorithm for Protein Structure Prediction Using 3D Off-Lattice Model , 2017, ICHSA.

[21]  S. Toma,et al.  Contact interactions method: A new algorithm for protein folding simulations , 1996, Protein science : a publication of the Protein Society.

[22]  David Baker,et al.  Accurate Automated Protein NMR Structure Determination Using Unassigned NOESY Data , 2009, Journal of the American Chemical Society.

[23]  Erik D. Goodman,et al.  A Standard GA Approach to Native Protein Conformation Prediction , 1995 .

[24]  K. Dill,et al.  A lattice statistical mechanics model of the conformational and sequence spaces of proteins , 1989 .

[25]  Christian N. S. Pedersen,et al.  Protein Folding in the 2D HP Model , 1999 .

[26]  E I Shakhnovich,et al.  A test of lattice protein folding algorithms. , 1995, Proceedings of the National Academy of Sciences of the United States of America.

[27]  P. Güntert Automated NMR structure calculation with CYANA. , 2004, Methods in molecular biology.

[28]  Andrea Tettamanzi,et al.  A Memetic Algorithm for Protein Structure Prediction in a 3D-Lattice HP Model , 2004, EvoWorkshops.

[29]  Keun Ho Ryu,et al.  A 9-state hidden Markov model using protein secondary structure information for protein fold recognition , 2009, Comput. Biol. Medicine.

[30]  D. Baker,et al.  De novo protein structure generation from incomplete chemical shift assignments , 2009, Journal of biomolecular NMR.

[31]  Jafar Razmara,et al.  A rapid protein structure alignment algorithm based on a text modeling technique , 2011, Bioinformation.

[32]  Zhang Yi,et al.  Protein secondary structure prediction by using deep learning method , 2017, Knowl. Based Syst..

[33]  Seyyed Ali Seyyedsalehi,et al.  Protein secondary structure prediction using modular reciprocal bidirectional recurrent neural networks , 2010, Comput. Methods Programs Biomed..

[34]  Erfan Khaji,et al.  3D protein structure prediction using Imperialist Competitive algorithm and half sphere exposure prediction. , 2016, Journal of theoretical biology.

[35]  S. Baskar,et al.  Protein structure prediction using diversity controlled self-adaptive differential evolution with local search , 2015, Soft Comput..

[36]  Mihalis Yannakakis,et al.  On the Complexity of Protein Folding , 1998, J. Comput. Biol..

[37]  Pedro Larrañaga,et al.  A Review on Estimation of Distribution Algorithms , 2002, Estimation of Distribution Algorithms.

[38]  Nashat Mansour,et al.  Evolutionary Algorithm for Protein Structure Prediction , 2010, 2008 International Conference on Advanced Computer Theory and Engineering.

[39]  Hailong Hu,et al.  Protein secondary structure prediction based on the fuzzy support vector machine with the hyperplane optimization. , 2018, Gene.

[40]  C. Anfinsen Principles that govern the folding of protein chains. , 1973, Science.