A survey on multiple sequence alignment using metaheuristics

Over the past two decades, various research works have been going on Multiple Sequence Alignment (MSA) and it becomes an important domain in bioinformatics. This is an NPhard problem. For this purpose, various traditional, heuristics and metaheuristic methods have been applied. Among these methods, metaheuristics show an effective output to overcome the bottleneck of MSA problem. Different metaheuristic methods and software have been developed to overcome the speed and accuracy problem of MSA, while the number of sequences increases. In this article, we have surveyed widely used metaheuristic methods and alignment tools applied for solving MSA problem. However, after reviewing we can conclude that the time complexity is still a big challenge for MSA problem.

[1]  Guilherme Oliveira,et al.  Assessing the efficiency of multiple sequence alignment programs , 2014, Algorithms for Molecular Biology.

[2]  Xin-She Yang,et al.  Computational Intelligence and Metaheuristic Algorithms with Applications , 2014, TheScientificWorldJournal.

[3]  Miguel A. Vega-Rodríguez,et al.  A Hybrid Multiobjective Memetic Metaheuristic for Multiple Sequence Alignment , 2016, IEEE Transactions on Evolutionary Computation.

[4]  Michael Goesele,et al.  Addressing inaccuracies in BLOSUM computation improves homology search performance , 2016, BMC Bioinformatics.

[5]  M. Omair Ahmad,et al.  MSAIndelFR: a scheme for multiple protein sequence alignment using information on indel flanking regions , 2015, BMC Bioinformatics.

[6]  P. Winter,et al.  Københavns Universitet Protein Structure Prediction Using Bee Colony Optimization Metaheuristic : Extended , 2008 .

[7]  S. B. Needleman,et al.  A general method applicable to the search for similarities in the amino acid sequence of two proteins. , 1970, Journal of molecular biology.

[8]  Héctor Pomares,et al.  Optimizing multiple sequence alignments using a genetic algorithm based on three objectives: structural information, non-gaps percentage and totally conserved columns , 2013, Bioinform..

[9]  Kenneth Evans,et al.  Genomic DNA from animals shows contrasting strand bias in large and small subsequences , 2008, BMC Genomics.

[10]  Mohamed Tahar Ben Othman,et al.  Multiple sequence alignment based on genetic algorithms with new chromosomes representation , 2012, 2012 16th IEEE Mediterranean Electrotechnical Conference.

[11]  Kei Yura,et al.  Revisiting gap locations in amino acid sequence alignments and a proposal for a method to improve them by introducing solvent accessibility , 2011, Proteins.

[12]  Aboul Ella Hassanien,et al.  A Survey of Metaheuristics Methods for Bioinformatics Applications , 2016, Applications of Intelligent Optimization in Biology and Medicine.

[13]  Huseyin Seker,et al.  Novel protein weight matrix generated from amino acid indices , 2015, 2015 37th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC).

[14]  Sayyed Rasoul Mousavi,et al.  A hybrid metaheuristic for Closest String Problem , 2011, Int. J. Comput. Biol. Drug Des..

[15]  Robert C. Edgar,et al.  Multiple sequence alignment. , 2006, Current opinion in structural biology.

[16]  Yongchao Liu,et al.  Sequence analysis MSAProbs-MPI : Parallel Multiple Sequence Aligner for Distributed-Memory Systems , 2016 .

[17]  Ahmed Allali,et al.  Application of Tabu Search and Genetic Algorithm in Minimize Losses in Power System. Using the B-Coefficient Method , 2013 .

[18]  Jin-Kao Hao,et al.  Advances in metaheuristics for gene selection and classification of microarray data , 2010, Briefings Bioinform..

[19]  Paul P. Gardner,et al.  MASTR: multiple alignment and structure prediction of non-coding RNAs using simulated annealing , 2007, Bioinform..

[20]  Jaap Heringa,et al.  PRALINE: a multiple sequence alignment toolbox that integrates homology-extended and secondary structure information , 2005, Nucleic Acids Res..

[21]  Xin Deng,et al.  MSACompro: protein multiple sequence alignment using predicted secondary structure, solvent accessibility, and residue-residue contacts , 2011, BMC Bioinformatics.

[22]  Ujjwal Maulik,et al.  Modified differential evolution based fuzzy clustering for pixel classification in remote sensing imagery , 2009, Pattern Recognit..

[23]  Yongchao Liu,et al.  MSAProbs: multiple sequence alignment based on pair hidden Markov models and partition function posterior probabilities , 2010, Bioinform..

[24]  Andrew E. Torda,et al.  Not assessing the efficiency of multiple sequence alignment programs , 2014, Algorithms for Molecular Biology.

[25]  Robert C. Edgar,et al.  MUSCLE: multiple sequence alignment with high accuracy and high throughput. , 2004, Nucleic acids research.

[26]  Sebastian Deorowicz,et al.  QuickProbs—A Fast Multiple Sequence Alignment Algorithm Designed for Graphics Processors , 2014, PloS one.

[27]  Fernando Niño,et al.  Recent Advances in Artificial Immune Systems: Models and Applications , 2011, Appl. Soft Comput..

[28]  Hongwei Huo,et al.  A Quantum-Inspired Genetic Algorithm Based on Probabilistic Coding for Multiple Sequence Alignment , 2010, J. Bioinform. Comput. Biol..

[29]  José Francisco Aldana Montes,et al.  M2Align: parallel multiple sequence alignment with a multi‐objective metaheuristic , 2017, Bioinform..

[30]  F. Corpet Multiple sequence alignment with hierarchical clustering. , 1988, Nucleic acids research.

[31]  Alexey S Kondrashov,et al.  Context of deletions and insertions in human coding sequences , 2004, Human mutation.

[32]  Reda Alhajj,et al.  Multiple sequence alignment with affine gap by using multi-objective genetic algorithm , 2014, Comput. Methods Programs Biomed..

[33]  Chaabane Lamiche,et al.  An Adaptive Tabu Search Algorithm for Obtaining Alignment of Multiple Sequences , 2014 .

[34]  D. Higgins,et al.  SAGA: sequence alignment by genetic algorithm. , 1996, Nucleic acids research.

[35]  Toshio Shimizu,et al.  Multiple Sequence Alignment Using a Genetic Algorithm , 1996 .

[36]  Hongwei Huo,et al.  A simulated annealing algorithm for multiple sequence alignment with guaranteed accuracy , 2007, Third International Conference on Natural Computation (ICNC 2007).

[37]  D. Higgins,et al.  See Blockindiscussions, Blockinstats, Blockinand Blockinauthor Blockinprofiles Blockinfor Blockinthis Blockinpublication Clustal: Blockina Blockinpackage Blockinfor Blockinperforming Multiple Blockinsequence Blockinalignment Blockinon Blockina Minicomputer Article Blockin Blockinin Blockin , 2022 .

[38]  E. Myers,et al.  Basic local alignment search tool. , 1990, Journal of molecular biology.

[39]  S. K. Setua,et al.  A steady state Genetic Algorithm for Multiple Sequence Alignment , 2014, 2014 International Conference on Advances in Computing, Communications and Informatics (ICACCI).