Hybrid multiobjective artificial bee colony for multiple sequence alignment

Graphical abstractDisplay Omitted HighlightsWe propose multiobjective evolutionary computation for multiple sequence alignment.We optimize two widely-used objective functions in the literature: WSP and TC.Hybridization of ABC and Kalign2 to obtain more accurate alignments.A study between our proposal and 13 aligners from the bioinformatics field. In the bioinformatics community, it is really important to find an accurate and simultaneous alignment among diverse biological sequences which are assumed to have an evolutionary relationship. From the alignment, the sequences homology is inferred and the shared evolutionary origins among the sequences are extracted by using phylogenetic analysis. This problem is known as the multiple sequence alignment (MSA) problem. In the literature, several approaches have been proposed to solve the MSA problem, such as progressive alignments methods, consistency-based algorithms, or genetic algorithms (GAs). In this work, we propose a Hybrid Multiobjective Evolutionary Algorithm based on the behaviour of honey bees for solving the MSA problem, the hybrid multiobjective artificial bee colony (HMOABC) algorithm. HMOABC considers two objective functions with the aim of preserving the quality and consistency of the alignment: the weighted sum-of-pairs function with affine gap penalties (WSP) and the number of totally conserved (TC) columns score. In order to assess the accuracy of HMOABC, we have used the BAliBASE benchmark (version 3.0), which according to the developers presents more challenging test cases representing the real problems encountered when aligning large sets of complex sequences. Our multiobjective approach has been compared with 13 well-known methods in bioinformatics field and with other 6 evolutionary algorithms published in the literature.

[1]  Ruhul A. Sarker,et al.  Vertical decomposition with Genetic Algorithm for Multiple Sequence Alignment , 2011, BMC Bioinformatics.

[2]  D. Higgins,et al.  T-Coffee: A novel method for fast and accurate multiple sequence alignment. , 2000, Journal of molecular biology.

[3]  Selcuk Aslan,et al.  Alignment of biological sequences by discrete Artificial Bee Colony algorithm , 2015, 2015 23nd Signal Processing and Communications Applications Conference (SIU).

[4]  Chuong B. Do,et al.  ProbCons: Probabilistic consistency-based multiple sequence alignment. , 2005, Genome research.

[5]  Kalyanmoy Deb,et al.  A fast and elitist multiobjective genetic algorithm: NSGA-II , 2002, IEEE Trans. Evol. Comput..

[6]  D. Karaboga,et al.  On the performance of artificial bee colony (ABC) algorithm , 2008, Appl. Soft Comput..

[7]  Ali R. Yildiz,et al.  A new hybrid artificial bee colony algorithm for robust optimal design and manufacturing , 2013, Appl. Soft Comput..

[8]  D. Higgins,et al.  SAGA: sequence alignment by genetic algorithm. , 1996, Nucleic acids research.

[9]  Tao Jiang,et al.  On the Complexity of Multiple Sequence Alignment , 1994, J. Comput. Biol..

[10]  Ali R. Yildiz,et al.  Optimization of cutting parameters in multi-pass turning using artificial bee colony-based approach , 2013, Inf. Sci..

[11]  J. Thompson,et al.  CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. , 1994, Nucleic acids research.

[12]  K. Katoh,et al.  MAFFT: a novel method for rapid multiple sequence alignment based on fast Fourier transform. , 2002, Nucleic acids research.

[13]  Colin G. Johnson,et al.  An ant colony algorithm for multiple sequence alignment in bioinformatics , 2003, ICANNGA.

[14]  Robert C. Edgar,et al.  MUSCLE: multiple sequence alignment with high accuracy and high throughput. , 2004, Nucleic acids research.

[15]  Dervis Karaboga,et al.  A comparative study of Artificial Bee Colony algorithm , 2009, Appl. Math. Comput..

[16]  Aravind Seshadri,et al.  A FAST ELITIST MULTIOBJECTIVE GENETIC ALGORITHM: NSGA-II , 2000 .

[17]  W. A. Beyer,et al.  Some Biological Sequence Metrics , 1976 .

[18]  J. Mixter Fast , 2012 .

[19]  Abdesslem Layeb,et al.  Multiple Sequence Alignment by Immune Artificial System , 2007, 2007 IEEE/ACS International Conference on Computer Systems and Applications.

[20]  Olivier Poch,et al.  BAliBASE 3.0: Latest developments of the multiple sequence alignment benchmark , 2005, Proteins.

[21]  D. Higgins,et al.  Fast, scalable generation of high-quality protein multiple sequence alignments using Clustal Omega , 2011, Molecular systems biology.

[22]  C. Gondro,et al.  A simple genetic algorithm for multiple sequence alignment. , 2007, Genetics and molecular research : GMR.

[23]  Hao Zhang,et al.  A hybrid multi-objective artificial bee colony algorithm for burdening optimization of copper strip production , 2012 .

[24]  Yongchao Liu,et al.  MSAProbs: multiple sequence alignment based on pair hidden Markov models and partition function posterior probabilities , 2010, Bioinform..

[25]  Cédric Notredame,et al.  3DCoffee: combining protein sequences and structures within multiple sequence alignments. , 2004, Journal of molecular biology.

[26]  R. Doolittle Similar amino acid sequences: chance or common ancestry? , 1981, Science.

[27]  E. Sonnhammer,et al.  Kalign2: high-performance multiple alignment of protein and nucleotide sequences allowing external features , 2008, Nucleic acids research.

[28]  C. Notredame,et al.  Recent progress in multiple sequence alignment: a survey. , 2002, Pharmacogenomics.

[29]  Hasan H Otu,et al.  Objective functions. , 2014, Methods in molecular biology.

[30]  Héctor Pomares,et al.  Optimizing multiple sequence alignments using a genetic algorithm based on three objectives: structural information, non-gaps percentage and totally conserved columns , 2013, Bioinform..

[31]  Ari Löytynoja,et al.  An algorithm for progressive multiple alignment of sequences with insertions. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[32]  Yudong Zhang,et al.  Optimal Multi-Level Thresholding Based on Maximum Tsallis Entropy via an Artificial Bee Colony Approach , 2011, Entropy.

[33]  Yudong Zhang,et al.  Feed‐forward neural network optimized by hybridization of PSO and ABC for abnormal brain detection , 2015, Int. J. Imaging Syst. Technol..

[34]  R. Doolittle,et al.  Progressive sequence alignment as a prerequisitetto correct phylogenetic trees , 2007, Journal of Molecular Evolution.

[35]  N. Grishin,et al.  MUMMALS: multiple sequence alignment improved by using hidden Markov models with local structural information , 2006, Nucleic acids research.

[36]  Xiujuan Lei,et al.  Artificial bee colony algorithm for solving multiple sequence alignment , 2010, 2010 IEEE Fifth International Conference on Bio-Inspired Computing: Theories and Applications (BIC-TA).

[37]  Dervis Karaboga,et al.  AN IDEA BASED ON HONEY BEE SWARM FOR NUMERICAL OPTIMIZATION , 2005 .

[38]  Ali Rıza Yıldız,et al.  Structural Damage Detection Using Modal Parameters and Particle Swarm Optimization , 2012 .

[39]  D. Bacon,et al.  Multiple Sequence Alignment , 1986, Journal of molecular biology.

[40]  Dennis R. Livesay,et al.  Probalign: multiple sequence alignment using partition function posterior probabilities , 2006, Bioinform..

[41]  Lior Pachter,et al.  Fast Statistical Alignment , 2009, PLoS Comput. Biol..

[42]  Michael Kaufmann,et al.  DIALIGN-TX: greedy and progressive approaches for segment-based multiple sequence alignment , 2008, Algorithms for Molecular Biology.

[43]  Yudong Zhang,et al.  MAGNETIC RESONANCE BRAIN IMAGE CLASSIFICATION BY AN IMPROVED ARTIFICIAL BEE COLONY ALGORITHM , 2011 .

[44]  Javid Taheri,et al.  RBT-GA: a novel metaheuristic for solving the multiple sequence alignment problem , 2009, BMC Genomics.

[45]  Lenan Wu,et al.  UCAV Path Planning by Fitness-Scaling Adaptive Chaotic Particle Swarm Optimization , 2013 .

[46]  Ruhul A. Sarker,et al.  Progressive Alignment Method Using Genetic Algorithm for Multiple Sequence Alignment , 2012, IEEE Transactions on Evolutionary Computation.

[47]  Héctor Pomares,et al.  Optimization of multiple sequence alignment methodologies using a multiobjective evolutionary algorithm based on NSGA-II , 2012, 2012 IEEE Congress on Evolutionary Computation.