Using a Parallel Team of Multiobjective Evolutionary Algorithms to Solve the Motif Discovery Problem

This paper proposes the use of a parallelmultiobjective evolutionary technique to predict patterns, motifs, in real deoxyribonucleic acid (DNA) sequences. DNA analysis is a very important branch within bioinformatics, resulting in a large number of NP-hard optimization problems such as multiple alignment, motif finding, or protein folding. In this work we study the use of amultiobjective evolutionary algorithms team to solve the Motif Discovery Problem. According to this, we have designed a parallel heuristic that allows the collaborative work of four algorithms, two population-based algorithms: Differential Evolution with Pareto Tournaments and Nondominated Sorting Genetic Algorithm II, and two trajectory-based algorithms: Multiobjective Variable Neighborhood Search and Multiobjective Skewed Variable Neighborhood Search. In this way, we take advantage of the properties of different algorithms, getting to expand the search space covered in our problem. As we will see, the results obtained by our team significantly improve the results published in previous research.

[1]  Charles Elkan,et al.  Fitting a Mixture Model By Expectation Maximization To Discover Motifs In Biopolymer , 1994, ISMB.

[2]  William Stafford Noble,et al.  Assessing computational tools for the discovery of transcription factor binding sites , 2005, Nature Biotechnology.

[3]  Graziano Pesole,et al.  Weeder Web: discovery of transcription factor binding sites in a set of sequences from co-regulated genes , 2004, Nucleic Acids Res..

[4]  P. D’haeseleer What are DNA sequence motifs? , 2006, Nature Biotechnology.

[5]  J. Dopazo,et al.  Methods and approaches in the analysis of gene expression data. , 2001, Journal of immunological methods.

[6]  Miguel A. Vega-Rodríguez,et al.  A Multiobjective Variable Neighborhood Search for Solving the Motif Discovery Problem , 2010, SOCO.

[7]  Holger Karas,et al.  TRANSFAC: a database on transcription factors and their DNA binding sites , 1996, Nucleic Acids Res..

[8]  Krzysztof A. Cyran,et al.  Advances in Intelligent and Soft Computing , 2009 .

[9]  Kalyanmoy Deb,et al.  A fast and elitist multiobjective genetic algorithm: NSGA-II , 2002, IEEE Trans. Evol. Comput..

[10]  Mehmet Kaya,et al.  MOGAMOD: Multi-objective genetic algorithm for motif discovery , 2009, Expert Syst. Appl..

[11]  Miguel A. Vega-Rodríguez,et al.  Solving the motif discovery problem by using Differential Evolution with Pareto Tournaments , 2010, IEEE Congress on Evolutionary Computation.

[12]  G. Church,et al.  Finding DNA regulatory motifs within unaligned noncoding sequences clustered by whole-genome mRNA quantitation , 1998, Nature Biotechnology.