A new distributed modified extremal optimization for optimizing protein structure alignment

Identifying similar structures in proteins has emerged as one of the most attractive research topics in the post-genome era. Protein structure alignment, which is similar to sequence alignment, identifies the structural homology between two protein structures according to their three-dimensional conformation. One of the simplest yet most robust techniques for optimizing protein structure alignment is the contact map overlap maximization problem (the CMO problem). In this paper, we focus on heuristics for the CMO problem. In our previous work, we proposed a bio-inspired heuristic using distributed modified extremal optimization (DMEO) for the CMO problem. DMEO is a hybrid of population-based modified extremal optimization (PMEO) and the island model. DMEO enhances population diversity; however, individual evolution is extremely monotonous because evolutions of it is based on the greedy moving approach. To address this issue, we propose a novel bio-inspired heuristic, i.e., DMEO with different evolutionary strategy (DMEODES). DMEODES is also based on the island model; however, some of the islands, called hot-spot islands, have a different evolutionary strategy. To evaluate DMEODES, we used actual protein structures. Experimental results showed that DMEODES outperforms DMEO.

[1]  Yasuma Mori,et al.  Reducing Crossovers in Reconciliation Graphs with Extremal Optimization , 2007 .

[2]  Adam Godzik,et al.  Flexible algorithm for direct multiple alignment of protein structures and sequences , 1994, Comput. Appl. Biosci..

[3]  Robert D. Carr,et al.  101 optimal PDB structure alignments: a branch-and-cut algorithm for the maximum contact map overlap problem , 2001, RECOMB.

[4]  Stefan Boettcher,et al.  Extremal Optimization: Methods derived from Co-Evolution , 1999, GECCO.

[5]  Lam Fat Yeung,et al.  A similarity matrix-based hybrid algorithm for the contact map overlaps problem , 2011, Comput. Biol. Medicine.

[6]  Keiichi Tamura,et al.  Distributed Modified Extremal Optimization using Island Model for Reducing Crossovers in Reconciliation Graph , 2013 .

[7]  Christos H. Papadimitriou,et al.  Algorithmic aspects of protein structure similarity , 1999, 40th Annual Symposium on Foundations of Computer Science (Cat. No.99CB37039).

[8]  Giuseppe Lancia,et al.  Protein Structure Comparison: Algorithms and Applications , 2003, Mathematical Methods for Protein Structure Analysis and Design.

[9]  Keiichi Tamura,et al.  Optimal protein structure alignment using modified extremal optimization , 2012, 2012 IEEE International Conference on Systems, Man, and Cybernetics (SMC).

[10]  Joel Sokol,et al.  Optimal Protein Structure Alignment Using Maximum Cliques , 2005, Oper. Res..

[11]  Keiichi Tamura,et al.  Bio-inspired heuristic for optimizing protein structure alignment using distributed modified extremal optimization , 2014, 2014 IEEE 7th International Workshop on Computational Intelligence and Applications (IWCIA).

[12]  R. Carr,et al.  Branch-and-Cut Algorithms for Independent Set Problems: Integrality Gap and An Application to Protein Structure Alignment , 2000 .

[13]  Janusz M. Bujnicki,et al.  Prediction of protein structures, functions, and interactions , 2008 .

[14]  S. Balaji,et al.  A Simple Algorithm for Maximum Clique and Matching Protein Structures , 2010, Int. J. Comb. Optim. Probl. Informatics.

[15]  Lam Fat Yeung,et al.  Extremal Optimization for the Protein Structure Alignment , 2009, 2009 IEEE International Conference on Bioinformatics and Biomedicine.

[16]  Wei Xie,et al.  A Reduction-Based Exact Algorithm for the Contact Map Overlap Problem , 2007, J. Comput. Biol..

[17]  Lena Jaeger,et al.  Introduction To Protein Structure , 2016 .

[18]  Michael Lappe,et al.  Joining Softassign and Dynamic Programming for the Contact Map Overlap Problem , 2007, BIRD.

[19]  Theodore C. Belding,et al.  The Distributed Genetic Algorithm Revisited , 1995, ICGA.

[20]  Keiichi Tamura,et al.  Population-based Modified Extremal Optimization for Contact Map Overlap Maximization Problem , 2013, 2013 Second IIAI International Conference on Advanced Applied Informatics.

[21]  Robert D. Carr,et al.  1001 Optimal PDB Structure Alignments: Integer Programming Methods for Finding the Maximum Contact Map Overlap , 2004, J. Comput. Biol..

[22]  Alberto Caprara,et al.  Structural alignment of large—size proteins via lagrangian relaxation , 2002, RECOMB '02.

[23]  Reiko Tanese,et al.  Distributed Genetic Algorithms , 1989, ICGA.