Adoption of an improved PSO to explore a compound multi-objective energy function in protein structure prediction

Abstract The protein structure prediction (PSP) problem, i.e., predicting the three-dimensional structure of a protein from its sequence, remains challenging in computational biology. The inaccuracy of existing protein energy functions and the huge conformation search space make the problem difficult to solve. In this study, the PSP problem is modeled as a multi-objective optimization problem. A physics-based energy function and a knowledge-based energy function are combined to construct the three-objective energy function. An improved multi-objective particle swarm optimization coupled with two archives is employed to execute the conformation space search. In addition, a mechanism based on Pareto non-dominated sorting is designed to properly address the slightly worse solutions. Finally, the experimental results demonstrate the effectiveness of the proposed approach. A new perspective for solving the PSP problem by means of multi-objective optimization is given in this paper.

[1]  M. Karplus,et al.  Effective energy functions for protein structure prediction. , 2000, Current opinion in structural biology.

[2]  P. Zielenkiewicz,et al.  Why similar protein sequences encode similar three-dimensional structures? , 2010 .

[3]  Yang Zhang,et al.  A Novel Side-Chain Orientation Dependent Potential Derived from Random-Walk Reference State for Protein Fold Selection and Structure Prediction , 2010, PloS one.

[4]  Z. Luthey-Schulten,et al.  Ab initio protein structure prediction. , 2002, Current opinion in structural biology.

[5]  John A. W. McCall,et al.  D2MOPSO: MOPSO Based on Decomposition and Dominance with Archiving Using Crowding Distance in Objective and Solution Spaces , 2014, Evolutionary Computation.

[6]  Adam Zemla,et al.  LGA: a method for finding 3D similarities in protein structures , 2003, Nucleic Acids Res..

[7]  Carlos A. Coello Coello,et al.  Evolutionary multi-objective optimization: a historical view of the field , 2006, IEEE Comput. Intell. Mag..

[8]  Marco Biasini,et al.  SWISS-MODEL: modelling protein tertiary and quaternary structure using evolutionary information , 2014, Nucleic Acids Res..

[9]  Seyed Naser Razavi,et al.  Empowering particle swarm optimization algorithm using multi agents' capability: A holonic approach , 2017, Knowl. Based Syst..

[10]  Alexander D. MacKerell,et al.  All-atom empirical potential for molecular modeling and dynamics studies of proteins. , 1998, The journal of physical chemistry. B.

[11]  Vincenzo Cutello,et al.  Computational Studies of Peptide and Protein Structure Prediction Problems via Multiobjective Evolutionary Algorithms , 2008, Multiobjective Problem Solving from Nature.

[12]  Richard Bonneau,et al.  Ab initio protein structure prediction: progress and prospects. , 2001, Annual review of biophysics and biomolecular structure.

[13]  Mario Inostroza-Ponta,et al.  APL: An angle probability list to improve knowledge-based metaheuristics for the three-dimensional protein structure prediction , 2015, Comput. Biol. Chem..

[14]  Jianhua Zhang,et al.  Robot path planning in uncertain environment using multi-objective particle swarm optimization , 2013, Neurocomputing.

[15]  A. Tramontano,et al.  Critical assessment of methods of protein structure prediction: Progress and new directions in round XI , 2016, Proteins.

[16]  Robert L Jernigan,et al.  How effective for fold recognition is a potential of mean force that includes relative orientations between contacting residues in proteins? , 2005, The Journal of chemical physics.

[17]  W. L. Jorgensen,et al.  Development and Testing of the OPLS All-Atom Force Field on Conformational Energetics and Properties of Organic Liquids , 1996 .

[18]  Joshua D. Knowles,et al.  Multiobjective Optimization in Bioinformatics and Computational Biology , 2007, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[19]  Frank DiMaio,et al.  Protein structure prediction using Rosetta in CASP12 , 2018, Proteins.

[20]  Narayanan Eswar,et al.  Protein structure modeling with MODELLER. , 2008, Methods in molecular biology.

[21]  Jack Snoeyink,et al.  Scientific benchmarks for guiding macromolecular energy function improvement. , 2013, Methods in enzymology.

[22]  David Baker,et al.  Protein Structure Prediction Using Rosetta , 2004, Numerical Computer Methods, Part D.

[23]  Hongyi Zhou,et al.  Distance‐scaled, finite ideal‐gas reference state improves structure‐derived potentials of mean force for structure selection and stability prediction , 2002, Protein science : a publication of the Protein Society.

[24]  Sandra M. Venske,et al.  ADEMO/D: An adaptive differential evolution for protein structure prediction problem , 2016, Expert Syst. Appl..

[25]  Marco Laumanns,et al.  Performance assessment of multiobjective optimizers: an analysis and review , 2003, IEEE Trans. Evol. Comput..

[26]  Yang Zhang,et al.  The I-TASSER Suite: protein structure and function prediction , 2014, Nature Methods.

[27]  D. Baker,et al.  An orientation-dependent hydrogen bonding potential improves prediction of specificity and structure for proteins and protein-protein complexes. , 2003, Journal of molecular biology.

[28]  P. Kollman,et al.  A Second Generation Force Field for the Simulation of Proteins, Nucleic Acids, and Organic Molecules , 1995 .

[29]  Sanghamitra Bandyopadhyay,et al.  Multi-Objective Particle Swarm Optimization with time variant inertia and acceleration coefficients , 2007, Inf. Sci..

[30]  Yang Zhang,et al.  Ab initio protein structure assembly using continuous structure fragments and optimized knowledge‐based force field , 2012, Proteins.

[31]  Wenyin Gong,et al.  An improved multiobjective differential evolution based on Pareto-adaptive epsilon-dominance and orthogonal design , 2009, Eur. J. Oper. Res..

[32]  Daniel W. A. Buchan,et al.  Scalable web services for the PSIPRED Protein Analysis Workbench , 2013, Nucleic Acids Res..

[33]  Carlos A. Coello Coello,et al.  Handling multiple objectives with particle swarm optimization , 2004, IEEE Transactions on Evolutionary Computation.

[34]  Riccardo Poli,et al.  Particle swarm optimization , 1995, Swarm Intelligence.

[35]  A. Kristina Downing Introduction to Protein Architecture: the structural biology of proteins: by Arthur M. Lesk , 2001 .

[36]  C. Anfinsen Principles that govern the folding of protein chains. , 1973, Science.

[37]  Dong Xu,et al.  Fast algorithm for population‐based protein structural model analysis , 2013, Proteomics.

[38]  George A. Khoury,et al.  Protein folding and de novo protein design for biotechnological applications. , 2014, Trends in biotechnology.

[39]  Shangce Gao,et al.  AIMOES: Archive information assisted multi-objective evolutionary strategy for ab initio protein structure prediction , 2018, Knowl. Based Syst..

[40]  Zbigniew Michalewicz,et al.  A locally convergent rotationally invariant particle swarm optimization algorithm , 2014, Swarm Intelligence.

[41]  Qingfu Zhang,et al.  MOEA/D: A Multiobjective Evolutionary Algorithm Based on Decomposition , 2007, IEEE Transactions on Evolutionary Computation.

[42]  Kalyanmoy Deb,et al.  A fast and elitist multiobjective genetic algorithm: NSGA-II , 2002, IEEE Trans. Evol. Comput..

[43]  D. Baker,et al.  Clustering of low-energy conformations near the native structures of small proteins. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[44]  Lisa N Kinch,et al.  Evaluation of free modeling targets in CASP11 and ROLL , 2016, Proteins.

[45]  Mengjie Zhang,et al.  Particle Swarm Optimization for Feature Selection in Classification: A Multi-Objective Approach , 2013, IEEE Transactions on Cybernetics.

[46]  Jooyoung Lee,et al.  De novo protein structure prediction by dynamic fragment assembly and conformational space annealing , 2011, Proteins.

[47]  Gary G. Yen,et al.  Cultural-Based Multiobjective Particle Swarm Optimization , 2011, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[48]  Aurora Trinidad Ramirez Pozo,et al.  Using reference points to update the archive of MOPSO algorithms in Many-Objective Optimization , 2014, Neurocomputing.

[49]  Alexander D. MacKerell,et al.  Development and current status of the CHARMM force field for nucleic acids , 2000, Biopolymers.

[50]  G M Crippen,et al.  Significance of root-mean-square deviation in comparing three-dimensional structures of globular proteins. , 1994, Journal of molecular biology.

[51]  Roland L. Dunbrack Rotamer libraries in the 21st century. , 2002, Current opinion in structural biology.

[52]  Jiujun Cheng,et al.  Incorporation of Solvent Effect into Multi-Objective Evolutionary Algorithm for Improved Protein Structure Prediction , 2018, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[53]  Cathy H. Wu,et al.  UniProt: the Universal Protein knowledgebase , 2004, Nucleic Acids Res..

[54]  Lothar Thiele,et al.  Multiobjective evolutionary algorithms: a comparative case study and the strength Pareto approach , 1999, IEEE Trans. Evol. Comput..

[55]  Kiyoshi Tanaka,et al.  Controlling Dominance Area of Solutions and Its Impact on the Performance of MOEAs , 2007, EMO.

[56]  Yaoqi Zhou,et al.  Specific interactions for ab initio folding of protein terminal regions with secondary structures , 2008, Proteins.

[57]  David Corne,et al.  The Pareto archived evolution strategy: a new baseline algorithm for Pareto multiobjective optimisation , 1999, Proceedings of the 1999 Congress on Evolutionary Computation-CEC99 (Cat. No. 99TH8406).

[58]  Yu-Jun Zheng,et al.  Population Classification in Fire Evacuation: A Multiobjective Particle Swarm Optimization Approach , 2014, IEEE Transactions on Evolutionary Computation.

[59]  A. Sali,et al.  Protein Structure Prediction and Structural Genomics , 2001, Science.

[60]  Sushmita Mitra,et al.  Natural computing methods in bioinformatics: A survey , 2009, Inf. Fusion.

[61]  Alexandre C. B. Delbem,et al.  Multiobjective evolutionary algorithm with many tables for purely ab initio protein structure prediction , 2013, J. Comput. Chem..

[62]  Changjun Zhou,et al.  Protein folding optimization based on 3D off-lattice model via an improved artificial bee colony algorithm , 2015, Journal of Molecular Modeling.

[63]  Glennie Helles,et al.  A comparative study of the reported performance of ab initio protein structure prediction algorithms , 2008, Journal of The Royal Society Interface.

[64]  Yue Shi,et al.  A modified particle swarm optimizer , 1998, 1998 IEEE International Conference on Evolutionary Computation Proceedings. IEEE World Congress on Computational Intelligence (Cat. No.98TH8360).

[65]  Roland L. Dunbrack,et al.  Nonplanar peptide bonds in proteins are common and conserved but not biased toward active sites , 2011, Proceedings of the National Academy of Sciences.

[66]  Anna Tramontano,et al.  Methods of model accuracy estimation can help selecting the best models from decoy sets: Assessment of model accuracy estimations in CASP11 , 2016, Proteins.

[67]  Anula Khare,et al.  A review of particle swarm optimization and its applications in Solar Photovoltaic system , 2013, Appl. Soft Comput..

[68]  J. Kennedy,et al.  Population structure and particle swarm performance , 2002, Proceedings of the 2002 Congress on Evolutionary Computation. CEC'02 (Cat. No.02TH8600).

[69]  J. Skolnick,et al.  GOAP: a generalized orientation-dependent, all-atom statistical potential for protein structure prediction. , 2011, Biophysical journal.

[70]  Jianpeng Ma,et al.  CHARMM: The biomolecular simulation program , 2009, J. Comput. Chem..

[71]  Andrew Wilkinson Compendium of Chemical Terminology , 1997 .

[72]  Peter J. Fleming,et al.  Conflict, Harmony, and Independence: Relationships in Evolutionary Multi-criterion Optimisation , 2003, EMO.

[73]  Andries Petrus Engelbrecht,et al.  A Convergence Proof for the Particle Swarm Optimiser , 2010, Fundam. Informaticae.

[74]  Pei-Chann Chang,et al.  Local search enhanced multi-objective PSO algorithm for scheduling textile production processes with environmental considerations , 2017, Appl. Soft Comput..

[75]  David S. Goodsell,et al.  The RCSB protein data bank: integrative view of protein, gene and 3D structural information , 2016, Nucleic Acids Res..

[76]  K. Dill,et al.  The Protein-Folding Problem, 50 Years On , 2012, Science.