Parallel AlineaGA: An island parallel evolutionary algorithm for multiple sequence alignment

Multiple sequence alignment is the base of a growing number of Bioinformatics applications. This does not mean that the accuracy of the existing methods corresponds to biologically faultless alignments. Searching for the optimal alignment for a set of sequences is often hindered by the size and complexity of the search space. Parallel Genetic Algorithms are a class of stochastic algorithms which can increase the speed up of the algorithms. They also enhance the efficiency of the search and the robustness of the solutions by delivering results that are better than those provided by the sum of several sequential Genetic Algorithms. AlineaGA is an evolutionary method for solving protein multiple sequence alignment. It uses a Genetic Algorithm on which some of its genetic operators embed a simple local search optimization. We have implemented its parallel version which we now present. Comparing with its sequential version we have observed an improvement in the search for the best solution. We have also compared its performance with ClustalW2 and T-Coffee, observing that Parallel AlineaGA can lead the search for better solutions for the majority of the datasets in study.

[1]  Erick Cantú-Paz,et al.  A Survey of Parallel Genetic Algorithms , 2000 .

[2]  David E. Goldberg,et al.  Genetic Algorithms in Search Optimization and Machine Learning , 1988 .

[3]  L. A. Anbarasu,et al.  Multiple molecular sequence alignment by island parallel genetic algorithm , 2000 .

[4]  Erik L L Sonnhammer,et al.  Quality assessment of multiple alignment programs , 2002, FEBS letters.

[5]  D. Higgins,et al.  T-Coffee: A novel method for fast and accurate multiple sequence alignment. , 2000, Journal of molecular biology.

[6]  D. E. Goldberg,et al.  Genetic Algorithms in Search , 1989 .

[7]  Kenneth DeJong,et al.  Learning with genetic algorithms: An overview , 1988, Machine Learning.

[8]  Chunlin Wang,et al.  Genomic multiple sequence alignments: refinement using a genetic algorithm , 2005, BMC Bioinformatics.

[9]  Cheng-Yan Kao,et al.  Using Genetic Algorithms to Solve Multiple Sequence Alignments , 2000, GECCO.

[10]  D. Higgins,et al.  RAGA: RNA sequence alignment by genetic algorithm. , 1997, Nucleic acids research.

[11]  Miguel A. Vega-Rodríguez,et al.  AlineaGA—a genetic algorithm with local search optimization for multiple sequence alignment , 2010, Applied Intelligence.

[12]  John H. Holland,et al.  Adaptation in Natural and Artificial Systems: An Introductory Analysis with Applications to Biology, Control, and Artificial Intelligence , 1992 .

[13]  S. Bandyopadhyay,et al.  Evolutionary computation in bioinformatics: a review , 2006, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[14]  Andrew Lumsdaine,et al.  Design and implementation of a high-performance MPI for C# and the common language infrastructure , 2008, PPOPP.

[15]  Rodrigo Lopez,et al.  Clustal W and Clustal X version 2.0 , 2007, Bioinform..

[16]  Kumar Chellapilla,et al.  Multiple sequence alignment using evolutionary programming , 1999, Proceedings of the 1999 Congress on Evolutionary Computation-CEC99 (Cat. No. 99TH8406).

[17]  K. De Jong Learning with Genetic Algorithms: An Overview , 1988 .

[18]  Miguel A. Vega-Rodríguez,et al.  Optimizing Multiple Sequence Alignment by Improving Mutation Operators of a Genetic Algorithm , 2009, 2009 Ninth International Conference on Intelligent Systems Design and Applications.

[19]  Miguel A. Vega-Rodríguez,et al.  AlineaGA: A Genetic Algorithm for Multiple Sequence Alignment , 2008, New Challenges in Applied Intelligence Technologies.

[20]  Miguel A. Vega-Rodríguez,et al.  An evolutionary approach for performing multiple sequence alignment , 2010, IEEE Congress on Evolutionary Computation.

[21]  D. Higgins,et al.  SAGA: sequence alignment by genetic algorithm. , 1996, Nucleic acids research.

[22]  M. O. Dayhoff,et al.  22 A Model of Evolutionary Change in Proteins , 1978 .

[23]  Olivier Poch,et al.  BAliBASE: a benchmark alignment database for the evaluation of multiple alignment programs , 1999, Bioinform..

[24]  Enrique Alba,et al.  A survey of parallel distributed genetic algorithms , 1999, Complex..

[25]  Cédric Notredame,et al.  Recent Evolutions of Multiple Sequence Alignment Algorithms , 2007, PLoS Comput. Biol..