When the Lowest Energy Does Not Induce Native Structures: Parallel Minimization of Multi-Energy Values by Hybridizing Searching Intelligences

Background Protein structure prediction (PSP), which is usually modeled as a computational optimization problem, remains one of the biggest challenges in computational biology. PSP encounters two difficult obstacles: the inaccurate energy function problem and the searching problem. Even if the lowest energy has been luckily found by the searching procedure, the correct protein structures are not guaranteed to obtain. Results A general parallel metaheuristic approach is presented to tackle the above two problems. Multi-energy functions are employed to simultaneously guide the parallel searching threads. Searching trajectories are in fact controlled by the parameters of heuristic algorithms. The parallel approach allows the parameters to be perturbed during the searching threads are running in parallel, while each thread is searching the lowest energy value determined by an individual energy function. By hybridizing the intelligences of parallel ant colonies and Monte Carlo Metropolis search, this paper demonstrates an implementation of our parallel approach for PSP. 16 classical instances were tested to show that the parallel approach is competitive for solving PSP problem. Conclusions This parallel approach combines various sources of both searching intelligences and energy functions, and thus predicts protein conformations with good quality jointly determined by all the parallel searching threads and energy functions. It provides a framework to combine different searching intelligence embedded in heuristic algorithms. It also constructs a container to hybridize different not-so-accurate objective functions which are usually derived from the domain expertise.

[1]  Yang Zhang Progress and challenges in protein structure prediction. , 2008, Current opinion in structural biology.

[2]  Thomas Stützle,et al.  Ant Colony Optimization: Overview and Recent Advances , 2018, Handbook of Metaheuristics.

[3]  Joaquín Bautista,et al.  Multiobjective constructive heuristics for the 1/3 variant of the time and space assembly line balancing problem: ACO and random greedy search , 2010, Inf. Sci..

[4]  Thomas Stützle,et al.  Accelerating Molecular Docking Calculations Using Graphics Processing Units , 2011, J. Chem. Inf. Model..

[5]  Zhang Hai,et al.  SCIENCE CHINA Information Sciences , 2010 .

[6]  Michael Levitt,et al.  Generalized ensemble methods for de novo structure prediction , 2009, Proceedings of the National Academy of Sciences.

[7]  B. Yu,et al.  A parallel improved ant colony optimization for multi-depot vehicle routing problem , 2011, J. Oper. Res. Soc..

[8]  Michael W Deem,et al.  Parallel tempering: theory, applications, and new perspectives. , 2005, Physical chemistry chemical physics : PCCP.

[9]  R. Unger,et al.  Finding the lowest free energy conformation of a protein is an NP-hard problem: proof and implications. , 1993, Bulletin of mathematical biology.

[10]  David Baker,et al.  Protein Structure Prediction Using Rosetta , 2004, Numerical Computer Methods, Part D.

[11]  Tao Wang,et al.  Parallelization and performance characterization of protein 3D structure prediction of Rosetta , 2006, Proceedings 20th IEEE International Parallel & Distributed Processing Symposium.

[12]  David Baker,et al.  Ranking predicted protein structures with support vector regression , 2007, Proteins.

[13]  Marco Dorigo,et al.  Ant system: optimization by a colony of cooperating agents , 1996, IEEE Trans. Syst. Man Cybern. Part B.

[14]  Gary B. Lamont,et al.  Considerations in engineering parallel multiobjective evolutionary algorithms , 2003, IEEE Trans. Evol. Comput..

[15]  J. Deneubourg,et al.  The self-organizing exploratory pattern of the argentine ant , 1990, Journal of Insect Behavior.

[16]  A. Sali,et al.  Protein Structure Prediction and Structural Genomics , 2001, Science.

[17]  Christian Blum,et al.  Metaheuristics in combinatorial optimization: Overview and conceptual comparison , 2003, CSUR.

[18]  Thomas Stützle,et al.  MAX-MIN Ant System , 2000, Future Gener. Comput. Syst..

[19]  V. Cutello,et al.  A multi-objective evolutionary approach to the protein structure prediction problem , 2006, Journal of The Royal Society Interface.

[20]  T. N. Bhat,et al.  The Protein Data Bank , 2000, Nucleic Acids Res..

[21]  Joaquín Bautista,et al.  A new diversity induction mechanism for a multi-objective ant colony algorithm to solve a real-world time and space assembly line balancing problem , 2011, Memetic Comput..

[22]  Carlos A. Coello Coello,et al.  An updated survey of GA-based multiobjective optimization techniques , 2000, CSUR.

[23]  Hartmut Schmeck,et al.  Multi Colony Ant Algorithms , 2002, J. Heuristics.

[24]  G. Theraulaz,et al.  Inspiration for optimization from social insect behaviour , 2000, Nature.

[25]  Thomas Stützle,et al.  An ant colony optimization approach to flexible protein–ligand docking , 2007, Swarm Intelligence.

[26]  Paul H. Calamai,et al.  Exchange strategies for multiple Ant Colony System , 2007, Inf. Sci..

[27]  William E. Hart,et al.  Robust Proofs of NP-Hardness for Protein Folding: General Lattices and Energy Potentials , 1997, J. Comput. Biol..

[28]  Martín Pedemonte,et al.  A survey on parallel ant colony optimization , 2011, Appl. Soft Comput..

[29]  Jianlin Cheng,et al.  Evaluating the absolute quality of a single protein model using structural features and support vector machines , 2009, Proteins.

[30]  Milan Tuba,et al.  Comparison of different topologies for island-based multi-colony ant algorithms for the minimum weight vertex cover problem , 2010 .

[31]  David Baker,et al.  Advances in Rosetta protein structure prediction on massively parallel systems , 2008, IBM J. Res. Dev..

[32]  D. Baker,et al.  Improved recognition of native‐like protein structures using a combination of sequence‐dependent and sequence‐independent features of proteins , 1999, Proteins.

[33]  Thomas Stützle,et al.  Stochastic Local Search: Foundations & Applications , 2004 .

[34]  Julio Ortega Lopera,et al.  Comparison of parallel multi-objective approaches to protein structure prediction , 2011, The Journal of Supercomputing.

[35]  P. Bradley,et al.  Toward High-Resolution de Novo Structure Prediction for Small Proteins , 2005, Science.

[36]  Thomas Stützle,et al.  Empirical Scoring Functions for Advanced Protein-Ligand Docking with PLANTS , 2009, J. Chem. Inf. Model..

[37]  El-Ghazali Talbi,et al.  A parallel hybrid genetic algorithm for protein structure prediction on the computational grid , 2007, Future Gener. Comput. Syst..

[38]  Glennie Helles,et al.  A comparative study of the reported performance of ab initio protein structure prediction algorithms , 2008, Journal of The Royal Society Interface.

[39]  Manuel López-Ibáñez,et al.  Ant colony optimization , 2010, GECCO '10.

[40]  C. Anfinsen Principles that govern the folding of protein chains. , 1973, Science.

[41]  Guillermo A. Cecchi,et al.  When the Optimal Is Not the Best: Parameter Estimation in Complex Biological Models , 2010, PloS one.

[42]  Qiang Lv,et al.  Solving 2D HP Protein Folding Problem by Parallel Ant Colonies , 2009, 2009 2nd International Conference on Biomedical Engineering and Informatics.

[43]  Qiang Lv,et al.  A parallel ant colonies approach to de novo prediction of protein backbone in CASP8/9 , 2011, Science China Information Sciences.

[44]  J. Deneubourg,et al.  Self-organized shortcuts in the Argentine ant , 1989, Naturwissenschaften.

[45]  Thomas Stützle,et al.  An analysis of communication policies for homogeneous multi-colony ACO algorithms , 2010, Inf. Sci..

[46]  Samuel L. DeLuca,et al.  Practically Useful: What the Rosetta Protein Modeling Suite Can Do for You , 2010, Biochemistry.

[47]  Qiang Lv,et al.  A Parallel ACO Approach Based on One Pheromone Matrix , 2006, ANTS Workshop.

[48]  Dominik Gront,et al.  BioShell - a package of tools for structural biology computations , 2006, Bioinform..

[49]  Qiang Lv,et al.  Towards Termination Criteria of Ant Colony Optimization , 2007, Third International Conference on Natural Computation (ICNC 2007).

[50]  Julio Ortega Lopera,et al.  A Hybrid Scheme to Solve the Protein Structure Prediction Problem , 2010, IWPACBB.

[51]  R. K. Ursem Multi-objective Optimization using Evolutionary Algorithms , 2009 .

[52]  N. Metropolis,et al.  Equation of State Calculations by Fast Computing Machines , 1953, Resonance.

[53]  Luca Maria Gambardella,et al.  Ant colony system: a cooperative learning approach to the traveling salesman problem , 1997, IEEE Trans. Evol. Comput..

[54]  Jasper A Vrugt,et al.  Improved evolutionary optimization from genetically adaptive multimethod search , 2007, Proceedings of the National Academy of Sciences.

[55]  Marco Dorigo,et al.  Ant colony optimization theory: A survey , 2005, Theor. Comput. Sci..

[56]  Thomas Stützle,et al.  PLANTS: Application of Ant Colony Optimization to Structure-Based Drug Design , 2006, ANTS Workshop.

[57]  Teodor Gabriel Crainic,et al.  Parallel Meta-Heuristics , 2010 .