An Improved Genetic Algorithm for Multiple Sequence Alignment

Multiple sequence alignment (MSA) is one of the most essential tools in bioinformatics. Genetic algorithm is used to simulate biological multiple sequence alignment problem, the initial population and crossover is the most critical part of the genetic algorithm. In this paper, we construct three initial populations and a simple horizontal crossover with respect to the vertical crossover. The experimental results showed that the initial population adding an appropriate proportion MAFFT excellent seed can optimize the population, horizontal crossover can reduce the computing time and the computational complexity. Combination of those two methods can improve the computational efficiency of multiple sequence alignment.

[1]  Olivier Poch,et al.  BAliBASE (Benchmark Alignment dataBASE): enhancements for repeats, transmembrane sequences and circular permutations , 2001, Nucleic Acids Res..

[2]  Mohamed Tahar Ben Othman,et al.  Genetic algorithms with permutation coding for multiple sequence alignment. , 2013, Recent patents on DNA & gene sequences.

[3]  D. Higgins,et al.  SAGA: sequence alignment by genetic algorithm. , 1996, Nucleic acids research.

[4]  Tao Jiang,et al.  On the Complexity of Multiple Sequence Alignment , 1994, J. Comput. Biol..

[5]  Wouter Boomsma,et al.  Multiple Sequence Alignment Using SAGA: Investigating the Effects of Operator Scheduling, Population Seeding, and Crossover Operators , 2004, EvoWorkshops.

[6]  Hamid Beigy,et al.  A New Genetic Algorithm for Multiple sequence Alignment , 2012, Int. J. Comput. Intell. Appl..

[7]  D. Higgins,et al.  RAGA: RNA sequence alignment by genetic algorithm. , 1997, Nucleic acids research.

[8]  Olivier Poch,et al.  BAliBASE: a benchmark alignment database for the evaluation of multiple alignment programs , 1999, Bioinform..

[9]  C. Gondro,et al.  A simple genetic algorithm for multiple sequence alignment. , 2007, Genetics and molecular research : GMR.

[10]  Ruhul A. Sarker,et al.  Progressive Alignment Method Using Genetic Algorithm for Multiple Sequence Alignment , 2012, IEEE Transactions on Evolutionary Computation.

[11]  Dan Gusfield,et al.  Algorithms on Strings, Trees, and Sequences - Computer Science and Computational Biology , 1997 .