A multiobjective swarm intelligence approach based on artificial bee colony for reliable DNA sequence design

The design of reliable DNA sequences is crucial in many engineering applications which depend on DNA-based technologies, such as nanotechnology or DNA computing. In these cases, two of the most important properties that must be controlled to obtain reliable sequences are self-assembly and self-complementary hybridization. These processes have to be restricted to avoid undesirable reactions, because in the specific case of DNA computing, undesirable reactions usually lead to incorrect computations. Therefore, it is important to design robust sets of sequences which provide efficient and reliable computations. The design of reliable DNA sequences involves heterogeneous and conflicting design criteria that do not fit traditional optimization methods. In this paper, DNA sequence design has been formulated as a multiobjective optimization problem and a novel multiobjective approach based on swarm intelligence has been proposed to solve it. Specifically, a multiobjective version of the Artificial Bee Colony metaheuristics (MO-ABC) is developed to tackle the problem. MO-ABC takes in consideration six different conflicting design criteria to generate reliable DNA sequences that can be used for bio-molecular computing. Moreover, in order to verify the effectiveness of the novel multiobjective proposal, formal comparisons with the well-known multiobjective standard NSGA-II (fast non-dominated sorting genetic algorithm) were performed. After a detailed study, results indicate that our artificial swarm intelligence approach obtains satisfactory reliable DNA sequences. Two multiobjective indicators were used in order to compare the developed algorithms: hypervolume and set coverage. Finally, other relevant works published in the literature were also studied to validate our results. To this respect the conclusion that can be drawn is that the novel approach proposed in this paper obtains very promising DNA sequences that significantly surpass other results previously published.

[1]  Max H. Garzon,et al.  Good encodings for DNA-based solutions to combinatorial problems , 1996, DNA Based Computers.

[2]  Rui Zhang,et al.  Research on the DNA Sequence Design Based on GA/PSO Algorithms , 2008, 2008 2nd International Conference on Bioinformatics and Biomedical Engineering.

[3]  J. Wetmur DNA probes: applications of the principles of nucleic acid hybridization. , 1991, Critical reviews in biochemistry and molecular biology.

[4]  Anne Condon,et al.  Strand design for biomolecular computation , 2002, Theor. Comput. Sci..

[5]  Masami Hagiya,et al.  Improving Sequence Design for DNA Computing , 2000, GECCO.

[6]  Laura F. Landweber,et al.  PUNCH: An Evolutionary Algorithm for Optimizing Bit Set Selection , 2001, DNA.

[7]  Masahito Yamamoto,et al.  Towards a general-purpose sequence design system in DNA computing , 2002, Proceedings of the 2002 Congress on Evolutionary Computation. CEC'02 (Cat. No.02TH8600).

[8]  David Corne,et al.  The Pareto archived evolution strategy: a new baseline algorithm for Pareto multiobjective optimisation , 1999, Proceedings of the 1999 Congress on Evolutionary Computation-CEC99 (Cat. No. 99TH8406).

[9]  Robert Penchovsky,et al.  DNA Library Design for Molecular Computation , 2003, J. Comput. Biol..

[10]  Yanfeng Wang,et al.  DNA codewords design using the improved NSGA-II algorithms , 2009, 2009 Fourth International on Conference on Bio-Inspired Computing.

[11]  Gary B. Lamont,et al.  Evolutionary Algorithms for Solving Multi-Objective Problems , 2002, Genetic Algorithms and Evolutionary Computation.

[12]  Yuren Zhou,et al.  Multiobjective Optimization and Hybrid Evolutionary Algorithm to Solve Constrained Optimization Problems , 2007, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[13]  Max H. Garzon,et al.  Biomolecular computing and programming , 1999, IEEE Trans. Evol. Comput..

[14]  Anne Condon,et al.  From RNA Secondary Structure to Coding Theory: A Combinatorial Approach , 2002, DNA.

[15]  L M Adleman,et al.  Molecular computation of solutions to combinatorial problems. , 1994, Science.

[16]  David J. Sheskin,et al.  Handbook of Parametric and Nonparametric Statistical Procedures , 1997 .

[17]  Lothar Thiele,et al.  Comparison of Multiobjective Evolutionary Algorithms: Empirical Results , 2000, Evolutionary Computation.

[18]  David Harlan Wood,et al.  A PCR-based Protocol for In Vitro Selection of Non-crosshybridizing Oligonucleotides , 2002, DNA.

[19]  Kalyanmoy Deb,et al.  Multi-objective optimization using evolutionary algorithms , 2001, Wiley-Interscience series in systems and optimization.

[20]  J. SantaLucia,et al.  A unified view of polymer, dumbbell, and oligonucleotide DNA nearest-neighbor thermodynamics. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[21]  Gary B. Lamont,et al.  Evolutionary Algorithms for Solving Multi-Objective Problems (Genetic and Evolutionary Computation) , 2006 .

[22]  Mohd Saberi Mohamad,et al.  DNA sequence design for DNA computation based on binary particle swarm optimization , 2012 .

[23]  Marzuki Khalid,et al.  Evaluation of Ordering Methods for DNA Sequence Design Based on Ant Colony System , 2008, 2008 Second Asia International Conference on Modelling & Simulation (AMS).

[24]  H. Lindman Analysis of variance in complex experimental designs , 1974 .

[25]  Seri Mastura Mustaza,et al.  A modified computational model of ant colony system in DNA sequence design , 2011, 2011 IEEE Student Conference on Research and Development.

[26]  Satoshi Kobayashi,et al.  DNA sequence design using templates , 2002, New Generation Computing.

[27]  Junghuei Chen,et al.  A Software Tool for Generating Non-crosshybridizing Libraries of DNA Oligonucleotides , 2002, DNA.

[28]  Max H. Garzon,et al.  Reliability and Efficiency of a DNA-Based Computation , 1998 .

[29]  Kalyanmoy Deb,et al.  Controlled Elitist Non-dominated Sorting Genetic Algorithms for Better Convergence , 2001, EMO.

[30]  Zuwairie Ibrahim,et al.  Sequence design for Direct-Proportional Length-Based DNA Computing using Population-based Ant Colony Optimization , 2009, 2009 ICCAS-SICE.

[31]  D. Gifford,et al.  Automated constraint-based nucleotide sequence selection for DNA computation. , 1999, Bio Systems.

[32]  Byoung-Tak Zhang,et al.  Multiobjective evolutionary optimization of DNA sequences for reliable DNA computing , 2005, IEEE Transactions on Evolutionary Computation.

[33]  Dervis Karaboga,et al.  A survey: algorithms simulating bee swarm intelligence , 2009, Artificial Intelligence Review.

[34]  Qiang Zhang,et al.  DNA Word Set Design Based on Minimum Free Energy , 2010, IEEE Transactions on NanoBioscience.

[35]  Andries Petrus Engelbrecht,et al.  A Model to Optimize DNA Sequences Based on Particle Swarm Optimization , 2008, 2008 Second Asia International Conference on Modelling & Simulation (AMS).

[36]  Zhang Hongyan,et al.  Improved Genetic Algorithm for Designing DNA Sequences , 2009, 2009 Second International Symposium on Electronic Commerce and Security.

[37]  Guangzhao Cui,et al.  The Optimization of DNA encodings based on Modified PSO/GA Algorithm , 2010, ICCD 2010.

[38]  Wolfgang Banzhaf,et al.  DNASequencesGenerator: A Program for the Construction of DNA Sequences , 2001, DNA.

[39]  Jin Xu,et al.  DNA Sequence Design Based on Template Strategy , 2003, J. Chem. Inf. Comput. Sci..

[40]  Masahito Yamamoto,et al.  Developing Support System for Sequence Design in DNA Computing , 2001, DNA.

[41]  Zhen Cheng,et al.  DNA Sequences Optimization Based on Gravitational Search Algorithm for Reliable DNA Computing , 2011, 2011 Sixth International Conference on Bio-Inspired Computing: Theories and Applications.

[42]  Kalyanmoy Deb,et al.  A fast and elitist multiobjective genetic algorithm: NSGA-II , 2002, IEEE Trans. Evol. Comput..

[43]  L F Landweber,et al.  Molecular computation: RNA solutions to chess problems , 2000, Proc. Natl. Acad. Sci. USA.

[44]  Amit Marathe,et al.  On combinatorial DNA word design , 1999, DNA Based Computers.

[45]  In-Hee Lee,et al.  Evolutionary sequence generation for reliable DNA computing , 2002, Proceedings of the 2002 Congress on Evolutionary Computation. CEC'02 (Cat. No.02TH8600).